Comment by ivape
3 days ago
Darn, don't have the appropriate hardware.
The full version of Dia requires around 10GB of VRAM to run.
If you have a 16gb of VRAM, I guess you could pair this with a 3B param model along side it, or really probably only 1B param with reasonable context window.
We will work on a quantized version of the model, so hopefully you will be able to run it soon!
We've seen Bark from Suno go from 16GB requirement -> 4GB requirement + running on CPUs. Won't be too hard, just need some time to work on it.
No doubt, these TTS models locally are what I'm looking for because I'm so done typing and reading :)
You can try it now on https://huggingface.co/spaces/nari-labs/Dia-1.6B !!!
1 reply →