Comment by alecco

3 days ago

Isn't GLM-5.2 mostly DeepSeek V3 architecture?

More and more I suspect Z.ai just has deeper pockets and access to the Claude traces while DeepSeek is punching way above their class.

That understates how difficult it is to get to the level of performance they attained. The fact that it surpasses DeepSeek v4 in most ways shows that they accomplished some great work in this space.

Perhaps, but Z.ai contributed with techniques such as IndexShare, which helps reduce computation for larger context windows (1M).