Comment by nick238
4 days ago
I guess my major question would be: does the training data include anything from 2025 which may have included information about the IMO 2025?
Given that AI companies are constantly trying to slurp up any and all data online, if the model was derived from existing work, it's maybe less impressive than at first glance. If present-day model does well at IMO 2026, that would be nice.
Are the human participants in the IMO held to the same standard?