Comment by slim

2 years ago

so how would the process of training a speaking AI go ? would you input the actor voice samples and subtitles from a movie, then train it till the output is similar enough to the actors voice from the movie ?

2 comments

slim

numpad0 2 years ago

Just couple minutes of data through 10-20 minutes of training with RVC WebUI[0] on included base model into VC Client[1] gets you to 90% there. But that's nearly an year old method, so I'm sure OAI has its own completely novel architecture for extra 5% fidelity.

1: https://github.com/RVC-Project/Retrieval-based-Voice-Convers...

2: https://github.com/w-okada/voice-changer

slim 2 years ago

what test data would they use ?