Comment by riku_iki
6 months ago
> OpenAI to have gamed ARC-AGI by seeing the first few examples
not just few examples. o3 was evaluated on "semi-private" test, which was previously already used for evaluating OAI models, so OAI had access to it already for a long time.
No comments yet
Contribute on Hacker News ↗