Comment by Sanzig
7 hours ago
Take a look at Ollama Cloud: https://ollama.com/pricing
You get access to a whole bunch of bleeding edge open models including GLM-5.2, Kimi K2.7, DeepSeek 4 Pro, etc. Inference is run on US/SG/EU cloud providers with zero data retention policies. The $20/mo tier is very generous, in my experience.
They don’t have a statement about where it is run or data retention on the GLM5.2 model. They do state that for others, like MiniMax.
There's a blanket statement at the bottom of the pricing page, which I would hope also applies to GLM-5.2:
> Where are models hosted?
> Ollama hosts models and compute resources primarily in the United States. To serve global demand, we may route to Europe and Singapore for additional capacity.
> Is my prompt or response data trained on?
> Prompt or response data is never logged or trained on.
> Who does Ollama partner with to host models?
> Ollama collaborates with NVIDIA Cloud Providers (NCPs) to host open models.
> When Ollama partners with providers, we require no logging, no training, and zero data retention policies in place.