← Back to context Comment by mixmastamyk 3 years ago Earth Angel, will you be mine…How about a speech synthesis DJ, “that was Foo McBar from 19XX”? 3 comments mixmastamyk Reply follower 3 years ago Since I've been playing around with Piper Text-to-Speech & the associated LibriTTS voice model, I couldn't resist:* https://rancidbacon.gitlab.io/piper-tts-demos/#various_radio...If the six speakers I selected for the demo don't match your taste in DJs, there's around 900 more in that voice model to try... :D(The 3 audio players differ only in file format & whether I ran the output through normalization.)Also, I was pretty impressed/surprised at the quality of their pronunciation of the meta-syntactic variables. :) drivers99 3 years ago They all sound out of breath. Not sure what causes that effect/perception. Listening to it again, it seems like the plosives (like d, or t) have no pop like there was no air pressure behind them. mixmastamyk 3 years ago Ha, that’s awesome. Several of the voices sounded just right for radio. I see the lib is limited to 24khz, maybe why it has an almost AM radio sound?
follower 3 years ago Since I've been playing around with Piper Text-to-Speech & the associated LibriTTS voice model, I couldn't resist:* https://rancidbacon.gitlab.io/piper-tts-demos/#various_radio...If the six speakers I selected for the demo don't match your taste in DJs, there's around 900 more in that voice model to try... :D(The 3 audio players differ only in file format & whether I ran the output through normalization.)Also, I was pretty impressed/surprised at the quality of their pronunciation of the meta-syntactic variables. :) drivers99 3 years ago They all sound out of breath. Not sure what causes that effect/perception. Listening to it again, it seems like the plosives (like d, or t) have no pop like there was no air pressure behind them. mixmastamyk 3 years ago Ha, that’s awesome. Several of the voices sounded just right for radio. I see the lib is limited to 24khz, maybe why it has an almost AM radio sound?
drivers99 3 years ago They all sound out of breath. Not sure what causes that effect/perception. Listening to it again, it seems like the plosives (like d, or t) have no pop like there was no air pressure behind them.
mixmastamyk 3 years ago Ha, that’s awesome. Several of the voices sounded just right for radio. I see the lib is limited to 24khz, maybe why it has an almost AM radio sound?
Since I've been playing around with Piper Text-to-Speech & the associated LibriTTS voice model, I couldn't resist:
* https://rancidbacon.gitlab.io/piper-tts-demos/#various_radio...
If the six speakers I selected for the demo don't match your taste in DJs, there's around 900 more in that voice model to try... :D
(The 3 audio players differ only in file format & whether I ran the output through normalization.)
Also, I was pretty impressed/surprised at the quality of their pronunciation of the meta-syntactic variables. :)
They all sound out of breath. Not sure what causes that effect/perception. Listening to it again, it seems like the plosives (like d, or t) have no pop like there was no air pressure behind them.
Ha, that’s awesome. Several of the voices sounded just right for radio. I see the lib is limited to 24khz, maybe why it has an almost AM radio sound?