fix(vosk): a few changes

This commit is contained in:
Pavel Korytov 2022-09-16 21:51:05 +03:00
parent 6358ddfea6
commit e58d4a810c
2 changed files with 4 additions and 4 deletions

View file

@ -6,11 +6,11 @@ tags = ["emacs", "elfeed"]
draft = false
+++
In my experience, finding something in a podcast is particularly troublesome. For example, occasionally I want to refer to some line in the podcast to make an [org-roam](https://github.com/org-roam/org-roam) node, so I want to check that I got that part right.
In my experience, finding something in a podcast is particularly troublesome. For example, occasionally I want to refer to some line in the podcast to make an [org-roam](https://github.com/org-roam/org-roam) node, e.g. I want to check that I got that part right.
And I have no reasonable way to get there because audio files in themselves don't allow for [random access](https://en.wikipedia.org/wiki/Random_access), i.e. there are no "landmarks" that point to this or that portion of the file. At least if nothing like a transcript is available.
For obvious reasons, podcasts rarely ship with transcripts. So in this post, I'll be using a speech recognition engine to make up for that. A generated transcript is not quite as good as a manually written one, but for the purpose of finding a fragment of the known podcast, it works well enough.
For obvious reasons, podcasts rarely ship with transcripts. So in this post, I'll be using a speech recognition engine to make up for that. A generated transcript is not quite as good as a manually written one, but for the purpose of finding a fragment of a known podcast, it works well enough.
{{< figure src="/images/vosk/img.png" >}}

View file

@ -6,11 +6,11 @@
#+HUGO_TAGS: elfeed
#+HUGO_DRAFT: false
In my experience, finding something in a podcast is particularly troublesome. For example, occasionally I want to refer to some line in the podcast to make an [[https://github.com/org-roam/org-roam][org-roam]] node, so I want to check that I got that part right.
In my experience, finding something in a podcast is particularly troublesome. For example, occasionally I want to refer to some line in the podcast to make an [[https://github.com/org-roam/org-roam][org-roam]] node, e.g. I want to check that I got that part right.
And I have no reasonable way to get there because audio files in themselves don't allow for [[https://en.wikipedia.org/wiki/Random_access][random access]], i.e. there are no "landmarks" that point to this or that portion of the file. At least if nothing like a transcript is available.
For obvious reasons, podcasts rarely ship with transcripts. So in this post, I'll be using a speech recognition engine to make up for that. A generated transcript is not quite as good as a manually written one, but for the purpose of finding a fragment of the known podcast, it works well enough.
For obvious reasons, podcasts rarely ship with transcripts. So in this post, I'll be using a speech recognition engine to make up for that. A generated transcript is not quite as good as a manually written one, but for the purpose of finding a fragment of a known podcast, it works well enough.
[[./static/images/vosk/img.png]]