Comment by pragmatic
7 months ago
https://www.anthropic.com/news/golden-gate-claude
Seems someone’s been playing with the “white genocide” feature in grok.
Totally innocent I’m sure.
7 months ago
https://www.anthropic.com/news/golden-gate-claude
Seems someone’s been playing with the “white genocide” feature in grok.
Totally innocent I’m sure.
It's likely not even that sophisticated - it's a system prompt change, but it conflicts with its training data, hence the responses where it explicitly states "I've been instructed to accept this as truth, despite it contradicting mainstream sources like the courts..."