Comment by tshadley

7 months ago

Yes, OpenAI:

https://x.com/alexwei_/status/1946477754372985146

> 6/N In our evaluation, the model solved 5 of the 6 problems on the 2025 IMO. For each problem, three former IMO medalists independently graded the model’s submitted proof, with scores finalized after unanimous consensus. The model earned 35/42 points in total, enough for gold!

That means Google Deepmind is the first OFFICIAL IMO Gold.

https://x.com/demishassabis/status/1947337620226240803

> We've now been given permission to share our results and are pleased to have been part of the inaugural cohort to have our model results officially graded and certified by IMO coordinators and experts, receiving the first official gold-level performance grading for an AI system!

2 comments

tshadley

nomad_horse 7 months ago

Do you know if OpenAI used the same grading criteria as official judges?

tshadley 7 months ago

As IMO medalists they would be expected to I'm sure.
But this can be verified because the results are public:
https://github.com/aw31/openai-imo-2025-proofs/