brin_bellway: forget-me-not flowers (Default)
[personal profile] brin_bellway
In which *some* but not *all* podcasts use auto-transcripts, so you can't immediately disregard something for being a podcast.

I was hoping Just Plain Wrong had a text version. :(

---

Listen Notes informs me that Google is willing to run their auto-transcriber on anything played through Google Chrome (not just Youtube videos), but 1: fuck Google Chrome, and 2: it sounds like it only *captions* rather than transcribing per se, so a 36-minute episode would require 36 minutes of hanging around watching for each new word/line to pop up (as opposed to dumping the audio into a processing queue, going off to do other things, and getting all the text at once later).

Date: 2021-06-29 06:58 am (UTC)
From: [personal profile] contrarianarchon
*sighs*.

(I haven't gotten used to any podcasts having transcripts at all and am thus still in the stone age of disregarding podcast sources... So you're ahead of me in that regard.)

Date: 2021-07-01 09:57 am (UTC)
From: [personal profile] contrarianarchon
If you've got the long-term memory needed to keep that workflow organised then that seems like a pretty viable plan, yeah.

... I wonder how much trouble it would be to make a FLOSS auto-transcriber, it feels like (bad) spoken-language models are a well-solved problem these days. Depends how much fidelity you actually need (probably more than a bad model can get you), I guess, plus any bit of software is fairly costly to make in practice because of the need for options and bug-testing and stuff. ... also this kind of thing is easier and easier the narrower the scope, which means the thing that can do specifically "two guys talking podcasts" is probably orders of magnitude simpler than a general-purpose speech parser. You could even train it on podcasts that *do* offer transcriptions!

Date: 2021-06-29 12:43 pm (UTC)
sigmaleph: (Default)
From: [personal profile] sigmaleph
the very notion of having software that will generate captions but won't let you download it as a text file fills me with rage

it's feels like, idk, designing your website so all the text input gets automatically converted to a jpg

Profile

brin_bellway: forget-me-not flowers (Default)
Brin

April 2025

S M T W T F S
   12345
6 7891011 12
13141516171819
20212223242526
27282930   

Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated May. 13th, 2025 07:33 am
Powered by Dreamwidth Studios