Comment by 8note

10 hours ago

the author certainly failed at a lot of basics and is doing the known "the junior broke something prod and were putting all the pressure and blame on them rather than the system that created the error"

but it is still useful feedback to the model makers

they are training in the behaviour to prioritize deleting and starting from a clean environment.

this is a bad thing to train for, especially as more and more people use more and more agents in a different way.

an agent that thinks about deleting stuff without considering alternatives and asking for help, shouldnt be passing the safety bar