← Back to context

Comment by IshKebab

8 months ago

I agree. For some reason the female voices are waaay more convincing than the male ones too, which sound barely better than speech synthesis from a decade ago.

Results correlate to investment, and there’s more in synthesizing female coded voices. As for the why female coded voices gets more investments, we all know, only difference is in attitude towards that (the correct answer, of course, is “it sucks”)

  • We all know? Female voices have better intelligibility? That's my guess anyway.

    • There's a lot of money and effort spent in satisfying the sexual desires of (predominantly straight) men. There's not typically quite as much interest in doing the same for women.

      For example I've been looking at models and loras for generating images, and the boards are _full_ of ones that will generate women well or in some particular style. Quite often at least a couple of the preview images for each are hidden behind a button because they contain nudity. Clearly the intent is that they are at least able to generate porn containing women. There's a small handful that are focused on men and they're very aware of it, they all have notes lampshading how oddball they are to even exist.

      I would expect that this is not as pronounced an effect in the world generating speech, but it must still exist.

      8 replies →

    • If you don't know, it's on you to learn. If you do know and prefer to make an asshole of yourself, that's also on you.