Comment by az226
19 days ago
You have a different agent write the tests and another run the tests. You tell them each that they aren’t checking their own work, they’re checking someone else’s. You can tell them to be skeptical. Then you can also tell them that don’t fail the code for no reason, because a third agent will be checking your tests and you will be penalized for inaccurate testing.
This approach balances out and maximizes accuracy.
> just use psychological tricks on the LLM, bro, you'll cajole it into not hallucinating
Can't help but chuckle at that.
If it sounds stupid but it works, it's not stupid.
> but it works
Proofs?
1 reply →