Comment by suprfnk

2 months ago

But then, if an agent picks the best response, how would you know that that is reliable?

4 comments

suprfnk

Obviously you have multiple agents justify why they picked a certain response and then create another agent that picks the solution with the best justification.

kkyr 2 months ago

touché
DmitriyBuchilin 2 months ago

[dead]

onion2k 2 months ago

You could get the agents to output something structured and then use a deterministic test if you're worried about that.