Comment by bcherny
16 hours ago
I don't think that's accurate. The malware prompt has been around since Sonnet 3.7. We carefully evaled it for each new model release and found no regression to intelligence, alongside improved scores for cyber risk. That said, we have removed the prompt for Opus 4.6 since it no longer needed it.
I started seeing "not a malware, continuing" in almost every reply since around 2 weeks ago. Maybe you just reintroduced it with some regression? Opus 4.6
That's weird. Would you mind running /feedback and sharing the id here next time you see this? I'd love to debug
Sure, I really appreciate you looking at this.
a6edd0d1-a9ed-4545-b237-cff00f5be090 / https://github.com/anthropics/claude-code/issues/47027
I'm happy to provide any other info that can be useful (as long as i'm not sharing any information about the code or tools we use into a public github issue).
4 replies →
I’ve seen this a couple of times recently. Including right after compact. I’ll /feedback it next time I see it
I've been using CC a decent amount the past few weeks and have never seen this malware stanza...?
1. I've never seen this. Is there a config option to unhide it if it's happening? Is this in Claude Code? Does it have to be set to verbose or something?
2. Can we pay more/do more rigorous KYC to disable it if it's active?
This warning is not enabled for modern models. No action needed. I'm digging into the report above as soon as they're able to /feedback.