Comment by pzo
6 days ago
you have image with WER on openai blog post here: https://openai.com/index/introducing-our-next-generation-aud...
On their chart they compare also with: gemini 2.0 flash, whisper large v2, whisper large v3, scribe v1, nova 1, nova 2. If you need only english transcription then pretty much all models will be good these days but big difference is depending on input language.
No comments yet
Contribute on Hacker News ↗