Comment by re-thc
5 months ago
> I know the distill models are not at all the same as the full model
It's far worse than that. It's not the model (Deepseek) at all. It's Qwen enhanced with Deepseek. So it's Qwen still.
5 months ago
> I know the distill models are not at all the same as the full model
It's far worse than that. It's not the model (Deepseek) at all. It's Qwen enhanced with Deepseek. So it's Qwen still.
No comments yet
Contribute on Hacker News ↗