Comment by danielhanchen
8 hours ago
We also made some dynamic MLX ones if they help - it might be faster for Macs, but llama-server definitely is improving at a fast pace.
8 hours ago
We also made some dynamic MLX ones if they help - it might be faster for Macs, but llama-server definitely is improving at a fast pace.
What exactly does the .sh file install? How does it compare to running the same model in, say, omlx?