Comment by Zetaphor

2 months ago

I got tired of repeating the same points and having to dig up sources every time, so here's the timeline (as I know it) in one place with sources.

8 comments

Zetaphor

brabel 2 months ago

Thanks for writing this, I hope people here will actually read this and not assume this is some unfounded hit piece. I was involved a little bit in llama.cpp and knew most of what you wrote and it’s just disgusting how ollama founders behaved! For people looking for alternatives, I would also recommend llama-file, it’s a one file executable for any OS that includes your chosen model: https://github.com/mozilla-ai/llamafile?tab=readme-ov-file

It’s truly open source, backed by Mozilla, openly uses llama.cpp and was created by wizard Justine Tunney of CosmopolitanC fame.

cachius 2 months ago

I also thought llamafile deserves a mention. Once you have all model params and tunings done bakes 'em into a single portable binary!

kashyapc 2 months ago

Thank you; it's an educating read for me, as someone who doesn't dwell in this space, but cares about FOSS in its true spirit.

Mario9382 2 months ago

Really nice. I wasn't aware of any of this.

robot-wrangler 2 months ago

Thanks, did not know any of this.

kelsolaar 2 months ago

Great writing, thanks for the summary and timeline.

julien_c 2 months ago

> Ollama eventually added ollama run hf.co/{repo}:{quant} to pull directly from Hugging Face, which partially addresses the availability problem.

uh actually, _we_ did (generates a Docker-style manifest on the fly)

Zetaphor 2 months ago

Hah! I did not know that, I'll update the article!