Comment by az226

19 days ago

You have a different agent write the tests and another run the tests. You tell them each that they aren’t checking their own work, they’re checking someone else’s. You can tell them to be skeptical. Then you can also tell them that don’t fail the code for no reason, because a third agent will be checking your tests and you will be penalized for inaccurate testing.

This approach balances out and maximizes accuracy.

4 comments

az226

otabdeveloper4 19 days ago

> just use psychological tricks on the LLM, bro, you'll cajole it into not hallucinating

Can't help but chuckle at that.

bheadmaster 19 days ago
If it sounds stupid but it works, it's not stupid.
- otabdeveloper4 18 days ago
  
  > but it works
  Proofs?
  
  1 reply →