Comment by alfiedotwtf
2 hours ago
It’s not a general rule, and depends highly on the model and the quantisation used. Don’t guess, Unsloth sometimes publish graphs in their tutorials showing the error rate vs file size… sometimes Q4 is great, other times I go for Q6
No comments yet
Contribute on Hacker News ↗