mirror of
https://github.com/SqrtMinusOne/sqrtminusone.github.io.git
synced 2025-12-10 15:53:03 +03:00
feat(vosk): add link to whisper
This commit is contained in:
parent
e58d4a810c
commit
75f73fe07a
2 changed files with 4 additions and 0 deletions
|
|
@ -6,6 +6,8 @@ tags = ["emacs", "elfeed"]
|
|||
draft = false
|
||||
+++
|
||||
|
||||
**Edit <span class="timestamp-wrapper"><span class="timestamp"><2022-10-13 Thu></span></span>:** Just a couple of days after this post, OpenAI released a speech recognition model called [Whisper](https://openai.com/blog/whisper/), which is so much better than anything I've ever seen before. I've decided to leave this post as it is, but check the [Emacs config](https://sqrtminusone.xyz/configs/emacs/#podcast-transcripts) for the updated version.
|
||||
|
||||
In my experience, finding something in a podcast is particularly troublesome. For example, occasionally I want to refer to some line in the podcast to make an [org-roam](https://github.com/org-roam/org-roam) node, e.g. I want to check that I got that part right.
|
||||
|
||||
And I have no reasonable way to get there because audio files in themselves don't allow for [random access](https://en.wikipedia.org/wiki/Random_access), i.e. there are no "landmarks" that point to this or that portion of the file. At least if nothing like a transcript is available.
|
||||
|
|
|
|||
|
|
@ -6,6 +6,8 @@
|
|||
#+HUGO_TAGS: elfeed
|
||||
#+HUGO_DRAFT: false
|
||||
|
||||
*Edit <2022-10-13 Thu>:* Just a couple of days after this post, OpenAI released a speech recognition model called [[https://openai.com/blog/whisper/][Whisper]], which is so much better than anything I've ever seen before. I've decided to leave this post as it is, but check the [[https://sqrtminusone.xyz/configs/emacs/#podcast-transcripts][Emacs config]] for the updated version.
|
||||
|
||||
In my experience, finding something in a podcast is particularly troublesome. For example, occasionally I want to refer to some line in the podcast to make an [[https://github.com/org-roam/org-roam][org-roam]] node, e.g. I want to check that I got that part right.
|
||||
|
||||
And I have no reasonable way to get there because audio files in themselves don't allow for [[https://en.wikipedia.org/wiki/Random_access][random access]], i.e. there are no "landmarks" that point to this or that portion of the file. At least if nothing like a transcript is available.
|
||||
|
|
|
|||
Loading…
Add table
Reference in a new issue