Comment by singpolyma3
6 days ago
Does anyone know what the quantization is with ollama models? They always just list parameter count.
I'm also a bit unsure of the trade offs between smaller quant vs smaller model
6 days ago
Does anyone know what the quantization is with ollama models? They always just list parameter count.
I'm also a bit unsure of the trade offs between smaller quant vs smaller model
run ollama show <name_of_model>:<parameters> and you'll get the info. E.g. ollama show qwen3.5:0.8b Model architecture qwen35 parameters 873.44M context length 262144 embedding length 1024 quantization Q8_0 requires 0.17.1