Comment by noman-land
1 year ago
Thanks for clarifying this. Can you point to the link to the baseline model that was released? I'm one of the people not seeing censorship locally and it is indeed a distilled model.
1 year ago
Thanks for clarifying this. Can you point to the link to the baseline model that was released? I'm one of the people not seeing censorship locally and it is indeed a distilled model.
The main 671B parameters model is here[1].
[1] https://huggingface.co/deepseek-ai/DeepSeek-R1