Comment by palmfacehn

8 months ago

Has anyone developed a way to annotate the input to provide emotional context?

In the past I've used different samples from the same speaker for this.

1 comment

palmfacehn

There are models that are trained for some kind of (in or out of band) emotiona (or style more general) prompting, but Chatterbox isn’t one of them, so beyond building some kind of system that took in input, processed it into chunks of text to speak and the settings Chatterbox does support (mostly pace and exaggeration) for each chunk, there’s probably no real way to do that with Chatterbox.