Comment by cadamsdotcom
5 hours ago
> An early version of Claude Opus 4.6 would sometimes mysteriously respond to English queries in other languages. NLAs helped Anthropic researchers discover training data that caused this.
Very cool - sounds similar to OpenAI’s goblin troubles.
No comments yet
Contribute on Hacker News ↗