Comment by aargh_aargh
8 months ago
Is there a current, updated list (ideally, a ranking) of the best open weights TTS models?
I'm actually more interested in STT (ASR) but the choices there are rather limited.
8 months ago
Is there a current, updated list (ideally, a ranking) of the best open weights TTS models?
I'm actually more interested in STT (ASR) but the choices there are rather limited.
Yes: https://huggingface.co/models?pipeline_tag=text-to-speech
Generally if a model is trending on that page, there’s enough juice for it to be worth a try. There’s a lot of subjective-opinion-having in this space, so beyond “is it trending on HF” the best eval is your own ears. But if something is not trending on HF it is unlikely to be much good.
Best TTS: VibeVoice, Chatterbox, Dia, Higgs, F5 TTS, Kokoro, Cosy Voice, XTTS-2.
Unmute.sh (same team as Kokoro) gets slept on, but it's really good.
Click leaderboard in the hamburger menu: https://huggingface.co/spaces/TTS-AGI/TTS-Arena-V2
Is there a way to filter out hosted models? The top three winners currently are all proprietary as far as I can tell.
edit: Ah, there's a lock icon next to the name of each proprietary model.
That's a highly incomplete comparison
yes the best