Comment by scottcha
3 days ago
I use glm5.1 plus pi with a few customized skills and am very happy with it. I hadn’t touched my Claude 5x plan for a couple of weeks but opened it back up in Claude code when fable was released and did a few tasks and still was happy to return to glm/pi.
Better than Qwen3.6-35B-A3B-8bit ?
When I tried glm found it way way slower (omlx as runtime)
Yes way better. We host both and while qwen3.6 is over 100tps we usually can do glm around that too.