Comment by culi
25 days ago
Kimi 2 is remarkably consistently the best. I wonder if it's somehow been trained specifically on tasks like these. It seems too consistent to be coincidence
Also shocking is how the most common runner up I've seen is DeepSeek
No comments yet
Contribute on Hacker News ↗