Comment by 0xbeefcab

1 day ago

Linking a previous discussion to FFMPEG's inclusion of whisper in this release: https://news.ycombinator.com/item?id=44886647

This seemed to be interesting to users of this site. tl;dr they added support for whisper, an OpenAI model for speech-to-text, which should allow autogeneration of captions via ffmpeg

Heads up: Whisper support depends on how your FFmpeg was built. Some packages will not include it yet. Check with `ffmpeg -buildconf` or `ffmpeg -filters | grep whisper`. If you compile yourself, remember to pass `--enable-whisper` and give the filter a real model path.

these days most movies and series already come out with captions, but you know what does not, given the vast amount of it?... ;)

yep, finally the deaf will able to read what people are saying in a porno!

  • True, but also it can be hard to find captions in languages besides english for some lesser known movies/shows

  • And also pirated releases are super weird and all over the place with subtitles and video player compatibility

    This could streamline things

    • This is because blurays ship their subtitles as a bunch of text images. So pirates have 3 options:

      1. Just copy them over from the Bluray. This lacks support in most client players, so you'll either need to download a player that does, or use something like Plex/Jellyfin, which will run FFMpeg to transcode and burn the picture subtitles in before sending it to the client.

      2. Run OCR on the Bluray subtitles. Not perfect.

      3. Steal subtitles from a streaming service release (or multiple) if it exists.

    • There's websites where you can download subtitles. Usually from very obviously pirated released.