Comment by smusamashah
6 months ago
The reddit video is awesome. I don't understand how people are calling it an OK model. Under 25MB and cpu only for this quality is amazing.
6 months ago
The reddit video is awesome. I don't understand how people are calling it an OK model. Under 25MB and cpu only for this quality is amazing.
Just made a TTS tool based on Kitten TTS, fully browser based, no Python server backend: https://quickeditvideo.com/tts/ A tts model of this size should be industry standard!
The people calling it "OK" probably tried it for themselves. Whatever model is being demoed in that video is not the same as the 25MB model they released.
Nope, looks like the default voice is the worst and it's not in the demo. A Reddit user generated these as well https://limewire.com/d/28CRw#UPuRLynIi7
Never thought I'd see the name LimeWire again, wow
1 reply →
It did say this was a preview release, so I'll reserve judgement until that's out the door.
Local quality is very bad
https://vocaroo.com/1njz1UwwVHCF
It doesn't sound so good. Excellent technical achievement and it may just improve more and more! But for now I can't use it for consumer facing applications.
We are still training the model. We expect the quality to go up in the next release. This is just a preview release :)
[flagged]