Comment by gertlabs
1 month ago
The small Qwen 3.6 models handle context a little better than Gemma 4, but Gemma 4 26B in particular has such small and efficient solutions which are really smart for its weight class. I was so impressed with its performance in our benchmark upon release that I wrote a blog post about it [0], although its position on the leaderboard later fell a bit as we ran it in more long context agentic coding environments.
Here's a great explanation why:
https://www.youtube.com/watch?v=_A367W_qvc8
Google's messing with the context. LOTS of speed for a little worse long-context performance.