Comment by kouteiheika
2 days ago
> Would also love to better understand what factors into "accuracy" since there might be some nuance there depending on the measure.
It's accuracy across GSM8K, MMLU, IFEVAL and LiveCodeBench.
They detail their methodology here: https://byteshape.com/blogs/Qwen3-4B-I-2507/
No comments yet
Contribute on Hacker News ↗