Comment by bitexploder
1 day ago
It also comes down to inference speed, not "can I run this". 8-bit quant is quite a bit slower on an M5 Pro.
1 day ago
It also comes down to inference speed, not "can I run this". 8-bit quant is quite a bit slower on an M5 Pro.
No comments yet
Contribute on Hacker News ↗