Comment by johnecheck

7 months ago

The key bit here is whether the LLM doing the cherry picking had knowledge of the solution. If it didn't, this is a meaningful result. That's why I'd like more info, but I fear OpenAI is going to try to keep things under wraps.

9 comments

johnecheck

diggan 7 months ago

> If it didn't

We kind of have to assume it didn't right? Otherwise bragging about the results makes zero sense and would be outright misleading.

samat 7 months ago

> would be outright misleading
why would not they? what are the incentives not to?
lucianbr 7 months ago

Corporations mislead to make money all the damn time.
Dilettante_ 7 months ago

"You really think someone would do that, just go on the internet and tell lies?"
[https://youtube.com/watch?v=YWdD206eSv0]
blibble 7 months ago
openai have been caught doing exactly this before
- aluminum96 7 months ago
  
  Why do people keep making up controversial claims like this? There is no evidence at all to this effect
  
  2 replies →

aluminum96 7 months ago

Mark Chen posted that the system was locked before the contest. [1] It would obviously be crazy cheating to give verifiers a solution to the problem!

[1] https://x.com/markchen90/status/1946573740986257614?s=46&t=H...