← Back to context

Comment by cornholio

6 hours ago

If I understood correctly, the model will get it right because it knows when it isn't right.

Essentially, yes that's right! There's some subtlety in how to let it know it was wrong (returning things as tool errors because it trained on that), but that's the gist of it - sort of a self-correcting architecture.