Comment by jenny91
1 year ago
As a data point: you can get an RTX 3090 for ~$1.2k and it runs deepseek-r1:32b perfectly fine via Ollama + open webui at ~35 tok/s in an OpenAI-like web app and basically as fast as 4o.
1 year ago
As a data point: you can get an RTX 3090 for ~$1.2k and it runs deepseek-r1:32b perfectly fine via Ollama + open webui at ~35 tok/s in an OpenAI-like web app and basically as fast as 4o.
You mean Qwen 32b fine-tuned on Deepseek :)
There is only one model of Deepseek (671b), all others are fine-tunes of other models
> you can get an RTX 3090 for ~$1.2k
If you're paying that much you're being ripped off. They're $800-900 on eBay and IMO are still overpriced.