← Back to context

Comment by Gathering6678

1 day ago

Emm...I played the sample audio and it was...horrible?

How is it voice cloning if even the sample doesn't sound like any human being...

I should have posted the reference audio used with the examples. Honestly it doesn’t sound so different from them. Voice cloning can be from a cartoon too, doesn’t have to be from a human being

  • A before / after with the reference and output seems useful to me, and maybe a range from more generic to more recognizable / celebrity voice samples so people can kinda see how it tackles different ones?

    (Prominent politician or actor or somebody with a distinct speaking tone?)

Also, I didn’t want to use known voices as the example, so I ended up using generic ones from the datasets