← Back to context

Comment by untitled2

6 days ago

Exactly. Whom to believe?

The last time someone claimed a medal in an olympiad like this, turned out they manually translated the problem into Lean and then ran a brute force search algorithm to find a proof. For 60 hours. On a supercomputer.

Meanwhile high schoolers get a piece of paper and 4.5 hours.

Both are true. One spent $400 in compute and the other one spent a lot more.

  • Exactly. And presumably had a more sophisticated harness around the model, longer reasoning chains, best of N, self judging, etc

OpenAI achieved Gold on an unreleased model. GPT-5. Read the tweets and they explain what they did.