Comment by BoredomIsFun
17 days ago
I'd stay away from ollana, just use llama.cpp; it is more up date, better performing and more flexible.
17 days ago
I'd stay away from ollana, just use llama.cpp; it is more up date, better performing and more flexible.
But you can't just switch between installed models like in ollama, can you?
llama-swap? https://www.nijho.lt/post/llama-nixos/