Comment by tshadley
14 hours ago
Yes, OpenAI:
https://x.com/alexwei_/status/1946477754372985146
> 6/N In our evaluation, the model solved 5 of the 6 problems on the 2025 IMO. For each problem, three former IMO medalists independently graded the model’s submitted proof, with scores finalized after unanimous consensus. The model earned 35/42 points in total, enough for gold!
That means Google Deepmind is the first OFFICIAL IMO Gold.
https://x.com/demishassabis/status/1947337620226240803
> We've now been given permission to share our results and are pleased to have been part of the inaugural cohort to have our model results officially graded and certified by IMO coordinators and experts, receiving the first official gold-level performance grading for an AI system!
Do you know if OpenAI used the same grading criteria as official judges?
As IMO medalists they would be expected to I'm sure.
But this can be verified because the results are public:
https://github.com/aw31/openai-imo-2025-proofs/