Comment by trvz
4 hours ago
Since that defaults to the q4 variant, try the q8 one:
ollama launch claude --model gemma4:26b-a4b-it-q8_0
4 hours ago
Since that defaults to the q4 variant, try the q8 one:
ollama launch claude --model gemma4:26b-a4b-it-q8_0
Even tried gemma4:31b and gemma4:31b with 128k context (I have 72GiB VRAM). Nothing. I'm cursed I guess. That's ollama-rocm if that matters (I had weird bugs on Vulkan, maybe gemma misbehaves on radeons somehow?..).
UPD: tried ollama-vulkan. It works, gemma4:31b-it-q8_0 with 64k context!