Comment by sarchertech

2 days ago

I find that adversarial multi agent setups eventually fall down because one side or the other always manages to convince the other side to give up given enough time.

I’ve tried all sorts of things to keep Claude from cheating, but the only one that works is to restrict access to the tests files, which obviously isn’t a real solution.

We recently had an “AI week” at work and I spent $1000 in tokens trying out different iterations of this.

1 comment

sarchertech

josephg 1 day ago

What did you find works best?