Comment by ctolsen
6 days ago
I mean, I could say the same about Gemini. 3.1 Pro tops a bunch of benchmarks out there but any practical use I've put it to it's underperforming both other proprietary and open weight models. Benchmarks are suspicious in general.
No comments yet
Contribute on Hacker News ↗