← Back to context

Comment by Szpadel

4 hours ago

isn't that weird there are no benchmarks included on this release?

I was thinking the same thing. It's the first release from any major lab in recent memory not to feature benchmarks.

It's probably counterprogramming, Gemini 3.0 will drop soon.

Probably because it’s not that much better than GPT-5 and they want to keep the AI train moving.

For 5.1-thinking, they show that 90th-percentile-length conversations are have 71% longer reasoning and 10th-percentile-length ones are 57% shorter