Comment by gertlabs

1 month ago

The small Qwen 3.6 models handle context a little better than Gemma 4, but Gemma 4 26B in particular has such small and efficient solutions which are really smart for its weight class. I was so impressed with its performance in our benchmark upon release that I wrote a blog post about it [0], although its position on the leaderboard later fell a bit as we ran it in more long context agentic coding environments.

[0] https://gertlabs.com/blog/gemma-4-economics

1 comment

gertlabs

spwa4 1 month ago

Here's a great explanation why:

https://www.youtube.com/watch?v=_A367W_qvc8

Google's messing with the context. LOTS of speed for a little worse long-context performance.