Comment by skybrian

1 day ago

Don't focus on the headline too much. They diagnosed the problem and figured out a fix.

> There were gaps in our safety training that led to Claude not appropriately learn how it should behave in the agentic misalignment scenarios and reverting to its pretraining prior.