Comment by esperent

5 hours ago

Is GLM-5.1 actually good?

I tested one of the other models that everyone is raving about yesterday (Qwen 3.6 plus) and within minutes found myself arguing with it even over a very simple task. After about 30 minutes (in which token usage never went over 50k because it was just me rewinding to give it more and more explicit instructions which it kept ignoring), I reverted everything and did it with Opus in literally about 4 minutes, after intentionally giving Opus a much more vague prompt.

I've had a good experience so far. Idk if I would attribute that to Pi or the GLM models. However, it feels nice not being constrained by usage.