Comment by Davidzheng
5 days ago
Are you sure this is not specialized to IMO? I do see the twitter thread saying it's "general reasoning" but I'd imagine they RL'd on olympiad math questions? If not I really hope someone from OpenAI says that bc it would be pretty astounding.
They also said this is not part of GPT-5, and “will be released later”. It’s very, very likely a model specifically fine-tuned for this benchmark, where afterwards they’ll evaluate what actual real-world problems it’s good at (eg like “use o4-mini-high for coding”).
Humans who excel at IMO questions are also "fine tuned" on them in the sense that they practice them for hundreds of hours
Sure, but nobody is using their IMO score to prove they are superintelligent and pulling it off in wider groups.
4 replies →
Their hardware isn't fine tuned to it though, it uses the same general intelligence hardware that all other humans use.
So its a big difference if you use a general intelligence system and makes it do well in math, or when you create a specialized system that is only good at math and can't be used to get good in other areas.
1 reply →