Comment by upcoming-sesame
5 days ago
I also look for this reply because i like seeing the follow-up reply saying that this is not a benchmark anymore because labs have gotten it in their training data.
that reply never failed to come it's basically a meme at this point
No comments yet
Contribute on Hacker News ↗