Comment by SwellJoe
20 hours ago
Interesting that the best performers are all Chinese-made models (DeepSeek and Qwen also perform consistently well). I wonder if there's more focus on vision and illustration in their training, or if something else is leading to their clear lead on this one test.
No comments yet
Contribute on Hacker News ↗