Comment by swores
1 month ago
Minor nitpick, but you mean "tts" not "stt" both times.
Is supertonic the best sounding model, or is there a different one you'd recommend that doesn't perform as well but sounds even better?
1 month ago
Minor nitpick, but you mean "tts" not "stt" both times.
Is supertonic the best sounding model, or is there a different one you'd recommend that doesn't perform as well but sounds even better?
yes sorry i mixed these up. supertonic is not the best sounding in my tests. it was by far the fastest, but its audio quality for something so fast was decent. if you wanted something that sounds better AND is also extremely fast pocket tts is the choice. amazing quality and also crazy fast on both gpu and cpu. if you care mainly about quality, chatterbox in my tests was best fit, but its slower then the others. qwen 3 tts was also great but its unisable as any real time agentic voice as its too slow. they havent relesed the code for streaming yet, once they release that this will be my top contender.
Thanks!