Comment by jeremyjh
6 hours ago
No one is doing that for a model this size it would have to be so heavily quantized that it wouldn’t be useful - or you’d need to spend a half million dollars on hardware. People use hosted APIs. Open weight means cloud vendors can host it.
Can you recommend any US based cloud providers?
In HuggingChat (https://huggingface.co/chat) you can test open models for free and even test specific providers.
From there I collected the following US providers currently serving GLM 5.2:
- Together (https://www.together.ai/models)
- Fireworks (https://fireworks.ai/models)
- Featherless (https://featherless.ai/models)
That's great. Thank you!