Comment by easel

3 months ago

Also for me. 8b and 32b are Qwen based, 7b and 70b are Llama based. Trying the same queries against qwen2.5:7b produces markedly different results (sanitized vs. blocked entirely), however, so there must be some interplay between the foundation model and distillation accounting for the difference.