feat(vosk): add link to whisper

This commit is contained in:
Pavel Korytov 2022-10-13 23:08:19 +03:00
parent e58d4a810c
commit 75f73fe07a
2 changed files with 4 additions and 0 deletions

View file

@ -6,6 +6,8 @@ tags = ["emacs", "elfeed"]
draft = false
+++
**Edit <span class="timestamp-wrapper"><span class="timestamp">&lt;2022-10-13 Thu&gt;</span></span>:** Just a couple of days after this post, OpenAI released a speech recognition model called [Whisper](https://openai.com/blog/whisper/), which is so much better than anything I've ever seen before. I've decided to leave this post as it is, but check the [Emacs config](https://sqrtminusone.xyz/configs/emacs/#podcast-transcripts) for the updated version.
In my experience, finding something in a podcast is particularly troublesome. For example, occasionally I want to refer to some line in the podcast to make an [org-roam](https://github.com/org-roam/org-roam) node, e.g. I want to check that I got that part right.
And I have no reasonable way to get there because audio files in themselves don't allow for [random access](https://en.wikipedia.org/wiki/Random_access), i.e. there are no "landmarks" that point to this or that portion of the file. At least if nothing like a transcript is available.

View file

@ -6,6 +6,8 @@
#+HUGO_TAGS: elfeed
#+HUGO_DRAFT: false
*Edit <2022-10-13 Thu>:* Just a couple of days after this post, OpenAI released a speech recognition model called [[https://openai.com/blog/whisper/][Whisper]], which is so much better than anything I've ever seen before. I've decided to leave this post as it is, but check the [[https://sqrtminusone.xyz/configs/emacs/#podcast-transcripts][Emacs config]] for the updated version.
In my experience, finding something in a podcast is particularly troublesome. For example, occasionally I want to refer to some line in the podcast to make an [[https://github.com/org-roam/org-roam][org-roam]] node, e.g. I want to check that I got that part right.
And I have no reasonable way to get there because audio files in themselves don't allow for [[https://en.wikipedia.org/wiki/Random_access][random access]], i.e. there are no "landmarks" that point to this or that portion of the file. At least if nothing like a transcript is available.