Comment by sankalpmukim
5 hours ago
I think this kind of overthinking is an extremely common pattern in the Chinese models. GLM's models are also very much like this.
5 hours ago
I think this kind of overthinking is an extremely common pattern in the Chinese models. GLM's models are also very much like this.
No comments yet
Contribute on Hacker News ↗