Comment by zozbot234
12 hours ago
> We need open weights models that are big and run on H200s.
We have this class of models already, Kimi 2.5 and GLM-5 are proper SOTA models. Nemotron might also release a larger-sized model at some time in the future. With the new NVMe-based offload being worked on as of late you can even experiment with these models on your own hardware, but of course there's plenty of cheap third-party inference platforms for these too.
No comments yet
Contribute on Hacker News ↗