Comment by Zetaphor
2 months ago
I got tired of repeating the same points and having to dig up sources every time, so here's the timeline (as I know it) in one place with sources.
2 months ago
I got tired of repeating the same points and having to dig up sources every time, so here's the timeline (as I know it) in one place with sources.
Thanks for writing this, I hope people here will actually read this and not assume this is some unfounded hit piece. I was involved a little bit in llama.cpp and knew most of what you wrote and it’s just disgusting how ollama founders behaved! For people looking for alternatives, I would also recommend llama-file, it’s a one file executable for any OS that includes your chosen model: https://github.com/mozilla-ai/llamafile?tab=readme-ov-file
It’s truly open source, backed by Mozilla, openly uses llama.cpp and was created by wizard Justine Tunney of CosmopolitanC fame.
I also thought llamafile deserves a mention. Once you have all model params and tunings done bakes 'em into a single portable binary!
Thank you; it's an educating read for me, as someone who doesn't dwell in this space, but cares about FOSS in its true spirit.
Really nice. I wasn't aware of any of this.
Thanks, did not know any of this.
Great writing, thanks for the summary and timeline.
> Ollama eventually added ollama run hf.co/{repo}:{quant} to pull directly from Hugging Face, which partially addresses the availability problem.
uh actually, _we_ did (generates a Docker-style manifest on the fly)
Hah! I did not know that, I'll update the article!