Wow. Thanks for posting the direct link to examples. Those sound incredibly good and would be impressive for a frontier lab. For two people over a few months, it's spectacular.
A little overacted, it reminds me of the voice acting in those flash cartoons you'd see in the early days of YouTube. That's not to say it isn't good work, it still sounds remarkably human. Just silly humans :)
Sounds great. One of the female examples has convincing uptalk. There must be a way to manipulate the latent space to control uptalk, vocal fry, smoker’s voice, lispiness, etc.
Wow. Thanks for posting the direct link to examples. Those sound incredibly good and would be impressive for a frontier lab. For two people over a few months, it's spectacular.
A little overacted, it reminds me of the voice acting in those flash cartoons you'd see in the early days of YouTube. That's not to say it isn't good work, it still sounds remarkably human. Just silly humans :)
Overacted and silly humans indeed: https://www.youtube.com/watch?v=gO8N3L_aERg
"flash cartoons in the early days of Youtube" Wouldn't those be straight from Newgrounds?
Thank you! I couldn't remember the name Newgrounds for some reason!!
Reminded me of the Fenslerfilm G.I. Joe sketch where the kids have something on the stove burning
Stop all the downloading!
This is an instant classic. Sesame comparison examples all sound like clueless rich people from The White Lotus.
Sounds great. One of the female examples has convincing uptalk. There must be a way to manipulate the latent space to control uptalk, vocal fry, smoker’s voice, lispiness, etc.