Comment by easel
3 months ago
Also for me. 8b and 32b are Qwen based, 7b and 70b are Llama based. Trying the same queries against qwen2.5:7b produces markedly different results (sanitized vs. blocked entirely), however, so there must be some interplay between the foundation model and distillation accounting for the difference.
No comments yet
Contribute on Hacker News ↗