It is probably AI with suboptimal training, despite the massive dataset availabl...

sltkr · on Aug 20, 2024

YouTube already generates transcripts of uploaded videos (though not super accurate ones). A simple and straightforward implementation would just pattern match in the transcript text.

No need for fancy AI, on top of whatever technology powers the transcription feature.

hagbard_c · on Aug 20, 2024

It will be the STT they use for their automatic captioning so it should be as accurate as whatever that produces. It would be the height of inefficiency to do a separate STT run over the entire audio just for this gimmick.