Comment by InkCanon

10 months ago

o1 reportedly got 83% on IMO, and 89th percentile on Codeforces.

https://openai.com/index/learning-to-reason-with-llms/

The paper tested it on o1-pro as well. Correct me if I'm getting some versioning mixed up here.

2 comments

InkCanon

alexlikeits1999 10 months ago

I've gone through the link you posted and the o1 system card and can't see any reference to IMO. Are you sure they were referring to IMO or were they referring to AIME?

sanxiyn 10 months ago

AIME is so not IMO.