Comment by bcherny

17 hours ago

I don't think that's accurate. The malware prompt has been around since Sonnet 3.7. We carefully evaled it for each new model release and found no regression to intelligence, alongside improved scores for cyber risk. That said, we have removed the prompt for Opus 4.6 since it no longer needed it.

11 comments

bcherny

rawicki 17 hours ago

I started seeing "not a malware, continuing" in almost every reply since around 2 weeks ago. Maybe you just reintroduced it with some regression? Opus 4.6

bcherny 17 hours ago
That's weird. Would you mind running /feedback and sharing the id here next time you see this? I'd love to debug
- rawicki 17 hours ago
  
  Sure, I really appreciate you looking at this.
  a6edd0d1-a9ed-4545-b237-cff00f5be090 / https://github.com/anthropics/claude-code/issues/47027
  I'm happy to provide any other info that can be useful (as long as i'm not sharing any information about the code or tools we use into a public github issue).
  
  4 replies →
- obrajesse 17 hours ago
  
  I’ve seen this a couple of times recently. Including right after compact. I’ll /feedback it next time I see it
bavell 17 hours ago

I've been using CC a decent amount the past few weeks and have never seen this malware stanza...?
echelon 17 hours ago
1. I've never seen this. Is there a config option to unhide it if it's happening? Is this in Claude Code? Does it have to be set to verbose or something?
2. Can we pay more/do more rigorous KYC to disable it if it's active?
- bcherny 17 hours ago
  
  This warning is not enabled for modern models. No action needed. I'm digging into the report above as soon as they're able to /feedback.