Comment by oofbey

2 months ago

Oh I’ve had agents remove tests plenty of times. Or cripple the tests so they pass but are useless - more common and harder to prompt against.

2 comments

oofbey

maddmann 2 months ago

Ah true, that also can happen — in aggregate I think models will tend to expand codebases versus contract. Though, this is anecdotal and probably is something ai labs and coding agent companies are looking at now.

oofbey 2 months ago

It’s the same bias for action which makes them code up a change when you genuinely are just asking a question about something. They really want to write code.