Comment by fy20

2 months ago

It feels like a bit of history is missing... If ollama was founded 3 years before llama.cpp was released, what engine did they use then? When did they transition?

I don't think that is the case. Llama.cpp appeared within weeks after meta released llama to select researchers (which then made it out to the public). 3 years before that nobody knew of the name llama. I'm sure that llama.cpp existed first

  • > within weeks

    One week, really, if we consider the "public" availability.

    Llama announced: February 24, 2023

    Weights leaked: March 3, 2023

    Llama.cpp: March 10, 2023

    (Ollama 0.0.1: Jul 8, 2023)

They spent several years in stealth mode but the initial release was llama.cpp.

Ollama v0.0.1 "Fast inference server written in Go, powered by llama.cpp" https://github.com/ollama/ollama/tree/v0.0.1

  • They spent several years in stealth mode

    doing what?

    trying to build themselves what llama.cpp ended up doing for them?

    • I asked myself the same question. Some other commenter mentioned above they started with some Kubernetes infrastructure thing and they pivoted later.