Comment by noman-land
3 months ago
Thanks for clarifying this. Can you point to the link to the baseline model that was released? I'm one of the people not seeing censorship locally and it is indeed a distilled model.
3 months ago
Thanks for clarifying this. Can you point to the link to the baseline model that was released? I'm one of the people not seeing censorship locally and it is indeed a distilled model.
The main 671B parameters model is here[1].
[1] https://huggingface.co/deepseek-ai/DeepSeek-R1