Comment by scottcha

3 days ago

I use glm5.1 plus pi with a few customized skills and am very happy with it. I hadn’t touched my Claude 5x plan for a couple of weeks but opened it back up in Claude code when fable was released and did a few tasks and still was happy to return to glm/pi.

Better than Qwen3.6-35B-A3B-8bit ?

When I tried glm found it way way slower (omlx as runtime)

  • Yes way better. We host both and while qwen3.6 is over 100tps we usually can do glm around that too.