Comment by razfar

14 hours ago

For anyone who uses GPT-5.5/Codex as their daily driver, how does GLM-5.2/ZCode compare, esp in a codebase already set up for agentic coding?

GLM 5.2 is in an uncanny valley where it's too big to run at home, too expensive and slow in comparison to similarly capable model (a good chart here - https://deepswe.datacurve.ai/), and that's just comparing API prices.

When looking at subscription offering by Anthropic and OpenAI, it's not even comparable, as a Codex $200 subscription can easily use a billion tokens per week on GPT 5.5 high/xhigh.

It's an interesting model from the perspective of being the most capable open weight model. But it doesn't have a solid place in this marketplace right now.

  • Thanks, that DeepSWE comparison is really useful. Yeah the Codex $100 plan with xHigh reasoning is very practical and cost efficient.

I tried it for a couple of hours this morning and yeah, it's a bit slow, and I needed it in peak hours so it also often can't reach the server so that makes it even slower. And I'm not even sure it's just the model, it could very much be the harness. Stalled for 40 minutes on trivial tool calls like `find`, two times...

It shows potential, answer/code quality was solid, but I would need more time with it.

TLDR GLM will take a lot longer to do a task, and maybe spend more tokens depending how complex it is

Its a hell of a lot cheaper though, so for me its worth it. I have more claude experience though, and I would say its almost en par with Opus 4.1

  • It's cheaper if you pay API prices. If you pay a gpt sub then codex is much much cheaper.