Comment by OptionOfT

2 days ago

What I see a lot is a generated test for something I prompt, and the test passes. Then I manually break the test and it fails for a different reason, not what I wanted to verify.

Guess I need to make it generate negative tests?

The automated version of this is mutation testing.

Which is actually probably a solid idea for this exact use case.