Comment by Davidzheng

7 months ago

Are you sure this is not specialized to IMO? I do see the twitter thread saying it's "general reasoning" but I'd imagine they RL'd on olympiad math questions? If not I really hope someone from OpenAI says that bc it would be pretty astounding.

9 comments

Davidzheng

stingraycharles 7 months ago

They also said this is not part of GPT-5, and “will be released later”. It’s very, very likely a model specifically fine-tuned for this benchmark, where afterwards they’ll evaluate what actual real-world problems it’s good at (eg like “use o4-mini-high for coding”).

UltraSane 7 months ago
Humans who excel at IMO questions are also "fine tuned" on them in the sense that they practice them for hundreds of hours
- SiempreViernes 7 months ago
  
  Sure, but nobody is using their IMO score to prove they are superintelligent and pulling it off in wider groups.
  
  4 replies →
- Jensson 7 months ago
  
  Their hardware isn't fine tuned to it though, it uses the same general intelligence hardware that all other humans use.
  So its a big difference if you use a general intelligence system and makes it do well in math, or when you create a specialized system that is only good at math and can't be used to get good in other areas.
  
  1 reply →