Comment by hodgehog11

4 days ago

Not surprising; Ollama is set on becoming the standard interface for companies to deploy "open" models. The focus on "local" is incidental, and likely not long term. I'm sure Ollama is going to announce a plan to use "open" models through their own cloud-based API using this app.

3 comments

hodgehog11

grumbelbart2 4 days ago

> The focus on "local" is incidental

Strongly disagree with this. It is the default go-to for companies that cannot use cloud-based services for IP or regulatory reasons (think of defense contractors). Isn't that the main reason to use "open" models, which are still weaker than closed ones?

theshrike79 3 days ago

We are specifically using Ollama, because our stuff CANNOT leave the company internal net.

Any whiff of a cloud service and the lawyers will freak out.

That's why we run models via Ollama on our laptops (M-series is crazy powerful) and a few servers on the intranet for more oomph.

LM Studio changed their license to allow commercial use without "call me" pricing, so we might look into that more too.

diggan 3 days ago

> Ollama is set on becoming the standard interface for companies to deploy "open" models.

That's not what I've been seeing, but obviously my perspective (as anyone's) is limited. What I'm seeing is deployments of vLLM, SGLang, llama.cpp or even HuggingFace's Transformers with their own wrapper, at least for inference with open weight models. Somehow, the only place where I come across recommendations for running Ollama was on HN and before on r/LocalLlama but not even there as of late. The people who used to run Ollama for local inference (+ OpenWebUI) now seem to mostly be running LM Studio, myself included too.