My experience with qwen-3.6:35B-A3B reinforces this, gonna give this a spin when unsloth has quants available
Gemini flash was just as good as pro for most tasks with good prompts, tools, and context. Gemma 4 was nearly as good as flash and Qwen 3.6 appears to be even better.
Plus you can control thinking time a lot more, so when Anthropic lobotomizes Opus on you...
My experience with qwen-3.6:35B-A3B reinforces this, gonna give this a spin when unsloth has quants available
Gemini flash was just as good as pro for most tasks with good prompts, tools, and context. Gemma 4 was nearly as good as flash and Qwen 3.6 appears to be even better.
> when unsloth has quants available
https://huggingface.co/unsloth/Qwen3.6-27B-GGUF
That was quick (compared to the 1T Kimi-2.6, not surprising)
2 replies →
> Size of the model isnt all that matters.
What matters is the motion in the tokens