Comment by sankalpmukim
3 hours ago
I think this kind of overthinking is an extremely common pattern in the Chinese models. GLM's models are also very much like this.
3 hours ago
I think this kind of overthinking is an extremely common pattern in the Chinese models. GLM's models are also very much like this.
No comments yet
Contribute on Hacker News ↗