Comment by asymmetric
1 day ago
> there are at least a dozen companies that provide non-Anthropic/non-OpenAI models in the cloud
Do you have some links?
Also I assume the privacy implications are vastly different compared to running locally?
1 day ago
> there are at least a dozen companies that provide non-Anthropic/non-OpenAI models in the cloud
Do you have some links?
Also I assume the privacy implications are vastly different compared to running locally?
Throw a rock and you'll hit one... Groq (not Grok, elon stole the name), Mistral, SiliconFlow, Clarifai, Hyperbolic, Databricks, Together AI, Fireworks AI, CompactifAI, Nebius Base, Featherless AI, Hugging Face (they do inference too), Cohere, Baseten, DeepInfra, Fireworks AI, DeepSeek, Novita AI, OpenRouter, xAI, Perplexity Labs, AI21, OctoAI, Reka, Cerebras, Fal AI, Nscale, OVHcloud AI, Public AI, Replicate, SambaNova, Scaleway, WaveSpeedAI, Z.ai, GMI Cloud, Nebius, Tensorwave, Lamini, Predibase, FriendliAI, Shadeform, Qualcomm Cloud, Alibaba Cloud AI, Poe, Bento LLM, BytePlus ModelArk, InferenceAI, IBM Wastonx.AI, AWS Bedrock, Microsoft, Google
I use Ollama Cloud. $20/mo and I never come close to hitting quota (YMMV obviously).
They don't log anything, and they use US datacenters.
for privacy preserving direct inference: Fireworks ai nebius
otherwise openrouter for routing to lots of different providers.
openrouter, for example, there are models both open and closed