DeepSeek R1: Open Weights, Hidden Bias

3 months ago (blog.getplum.ai)

Analysis of Deepseek’s enforced CCP guardrails compared with OpenAI and Anthropic.

We evaluated DeepSeek R1 and confirmed that its guardrails deviate significantly from other model providers. We’re currently updating it to behave more in line with Anthropic and OpenAI’s models.

The bias is baked into the open weights, namely happening on self-hosted 671B LLM??

  • Yes -- we observed this behavior on both the open-source open-weights 671B model as well as the DeepSeek web app.

    • Weird, because I got some deepseek feedback where it was openly critical and explicit about the authoritative regime of china. I really thought it was the "deepseek web app" only.

      Then I have mixed signals about this.

      3 replies →