Comment by rpdillon
6 days ago
llama.cpp is what you want. It offers both a web UI and an API on the same port. I use llama.cpp's webui with gpt-oss-20b, and I also leverage it as an OpenAI-compatible server with gptel for Emacs. Very good product.
No comments yet
Contribute on Hacker News ↗