Comment by alex7o

6 hours ago

To be honest I am a bit sad as, glm5.1 is producing mich better typescript than opus or codex imo, but no matter what it does sometimes go into shizo mode at some point over longer contexts. Not always tho I have had multiple session go over 200k and be fine.

8 comments

alex7o

disiplus 6 hours ago

When it works and its not slow it can impress. Like yesterday it solved something that kimi k2.5 could not. and kimi was best open source model for me. But it still slow sometimes. I have z.ai and kimi subscription when i run out of tokens for claude (max) and codex(plus).

i have a feeling its nearing opus 4.5 level if they could fix it getting crazy after like 100k tokens.

InsideOutSanta 4 hours ago

I just set the context window to 100k and manage it actively (e.g. I compact it regularly or make it write out documentation of its current state and start a new session).

For me, Opus 4.6 isn't working quite right currently, and I often use GLM 5.1 instead. I'd prefer to use peak Opus over GLM 5.1, but GLM 5.1 is an adequate fallback. It's incredible how good open-weight models have gotten.

MegagramEnjoyer 6 hours ago

Why is that sad? A free and open source model outperforming their closed source counterparts is always a win for the users

KaoruAoiShiho 5 hours ago

The non-awesome context window is the sad part, but I think a better harness can deal with this.

DeathArrow 6 hours ago

After the context gets to 100k tokens you should open a new session or run /compact.

cmrdporcupine 5 hours ago

I honestly still hold onto habits from earlier days of Claude & Codex usage and tend to wipe / compact my context frequently. I don't trust the era of big giant contexts, frankly, even on the frontier models.

calgoo 5 hours ago

I also feel like its helping me on the big models these days with claude giving so many issues.

varispeed 5 hours ago

Isn't the same with opus nowadays?