Comment by voxadam
10 days ago
Am I correct in understanding that Whisper is a speech recognition AI model originally created by OpenAI?
https://en.wikipedia.org/wiki/Whisper_(speech_recognition_sy...
10 days ago
Am I correct in understanding that Whisper is a speech recognition AI model originally created by OpenAI?
https://en.wikipedia.org/wiki/Whisper_(speech_recognition_sy...
yep, there's a c++ implementation to run it https://github.com/ggml-org/whisper.cpp
Isn't WhisperX the canonical choice for running Whisper?
While whisper and whisperx is python implementations, the whisper.cpp wins the benchmarks.
Maybe for running locally? whisper.cpp is nice because you can embed it pretty easily in apps for various targets like iOS, OSX, Android, wasm, etc.
Yes.
From the documentation:
> It runs automatic speech recognition using the OpenAI's Whisper model.
Thanks, I was being tripped up by DDOS protection on code.ffmpeg.org for a minute and couldn't read the patch. The combo of Firefox and the fact that Quantum/Lumen/CenturyLink seems to get off by rotating my dynamic IP for no reason occasionally triggers various DDOS protections schemes.
No problem. :) Yeah, it took me 8 seconds to get through. It seems your issue was worse.
Kind of, it's a family of audio transcription models.
https://huggingface.co/search/full-text?q=whisper
I think so, if I remember correctly PotPlayer also supports it for automatic subtitling.
Yes, according to the comments in the patch, you are correct.
yes.