Comment by MatthiasPortzel
8 days ago
The claim is that these models are training on data which include the problems and explanations. The fact that the first model trained after the public release of the questions (and crowdsourced answers) performs best is not a counter example, but is expected and supported by the claim.
No comments yet
Contribute on Hacker News ↗