Comment by viraptor

17 days ago

I can't quite figure this out: Can you save a generated voice for reuse later? The mlx-audio I looked at seems to take the text itself in every interface and doesn't expose it as a separate object. (I can dive deeper, but wanted to check if anyone's done it already)

3 comments

viraptor

akadeb 17 days ago

You could pipe the output to an audio file with ffmpeg or pyaudio and save it locally

viraptor 16 days ago
I don't want to save the audio. I want to save the voice model so I can use it for many different texts, for consistency.
- stuckkeys 16 days ago
  
  Yes, you can. I was just testing it. I made a "My Custom Voices" tab, and recorded a small sample of my own voice or upload a sample of w/e voice. Then you can use it. I am in the process of training a model of my voice too to see how it handles it using the 1.7b
  Works surprisingly good with a 4090. I will also try it on 5090. This is the best one I have seen so far. NGL. 11Labs is cooked lol.