Comment by tedsanders
2 hours ago
What do you mean by this? We don’t train on evals, and if we did I’d quit on the spot.
(The loose version of this that’s true is that there may exist eval data contamination in pretraining. This is a hard problem to fully solve.)
No comments yet
Contribute on Hacker News ↗