Comment by antirez
6 days ago
I inspected manually and indeed the 27B is doing worse, but I believe it could be due to the exact GGUF in the ollama repository and/or with the need of adjusting the parameters. I'll try more stuff.
6 days ago
I inspected manually and indeed the 27B is doing worse, but I believe it could be due to the exact GGUF in the ollama repository and/or with the need of adjusting the parameters. I'll try more stuff.
Isn’t llama.cpp’s implementation of Qwen 3.5 better, or am I misinformed?
There was a recent fix by ollama and I used it.