Comment by BeetleB
6 days ago
It's not the speech recognition model alone that's fantastic. It's coupling it to an LLM for cleanup that makes all the difference.
See https://blog.nawaz.org/posts/2023/Dec/cleaning-up-speech-rec...
(This is not the best example as I gave it free rein to modify the text - I should post a followup that has an example closer to a typical use of speech recognition).
Without that extra cleanup, Whisper is simply not good enough.
No comments yet
Contribute on Hacker News ↗