Comment by rspoerri
5 hours ago
I am very interested in seeing new qwen models. Qwen3.6 27b is the first one that can do things and doesnt constantly loose "it's mind" and that can be run on a 3090 with a good context size. But it's sometimes getting into a loop.
Look on HuggingFace, there is a template that is supposed to fix the updates for the Qwen Models.
https://huggingface.co/froggeric/Qwen-Fixed-Chat-Templates
Maybe will help you?
I sort of thought this about qwen3.5 35b, finally a local model that isn't a complete waste of electricity, but "upgrading" to 3.6 35b left me disappointed. It seemed more like a downgrade. But honestly I've barely used either. Subjectively they still seem far from the frontier models, but for what they can do, it's great to be able to do locally.
I've completely replaced GitHub Copilot using Sonnet 3.6 with OpenCode using Qwen3.6 27b, and it's been a great experience.
Is Sonnet 3.6 a typo? Claude Sonnet 3.6 (aka 3.5 New) is an ancient model from 2024
Pretty sure they meant 4.6
Similar, but I'm using 35B A3B variation with experimental MTP support
OpenCode is pretty good too
A3B is especially nice, MoE really shines on memory bandwidth contained platforms like the DGX Spark.
1 reply →
I had a flavor of an older version of Qwen (I forget which one to be fair) that was coding along, then lost itself in a loop, I was so confused, it was just a random greenfield "lets see how it does" type of project anyway.