Comment by rahimnathwani
1 month ago
If you want to do custom voice cloning, record a sample wav file with a sentence or two, and then try this:
uv tool install --force git+https://github.com/Blaizzy/mlx-audio.git --prerelease=allow
python -m mlx_audio.tts.generate --model mlx-community/Qwen3-TTS-12Hz-0.6B-Base-bf16 --text "Hello, this is a test." --ref_audio path_to_audio.wav --ref_text "Transcript of the reference audio." --play
No comments yet
Contribute on Hacker News ↗