Also for me. 8b and 32b are Qwen based, 7b and 70b are Llama based. Trying the same queries against qwen2.5:7b produces markedly different results (sanitized vs. blocked entirely), however, so there must be some interplay between the foundation model and distillation accounting for the difference.
Also for me. 8b and 32b are Qwen based, 7b and 70b are Llama based. Trying the same queries against qwen2.5:7b produces markedly different results (sanitized vs. blocked entirely), however, so there must be some interplay between the foundation model and distillation accounting for the difference.