Comment by thedangler

1 day ago

Kind of a noob, how would I implement this locally? How do I pass it audio to process. I'm assuming its in the API spec?

I wanted to try this locally as well so I have asked AI to write CLI for me: https://github.com/daliusd/qtts

There are some samples. If you have GPU you might want to fork and improve this, but otherwise slow, but usable on CPU as well.