Comment by kamranjon
15 hours ago
I was using the 8 bit quant and no reasoning - it’d make mistakes but then fix them at a speed that was impressive - it also was like incredibly tenacious and would honey badger its way around any issues it hit. My second best was Qwen 3 coder next - I did play with 3.5 and 3.6 (both moe and dense variants) but always seemed to go back to GLM 4.7 8 bit mlx variant. I have 128gb mbp so I’ve migrated to Deepseek v4 flash for everything now and haven’t looked back but if a new GLM flash model came out I’d be very excited.
No comments yet
Contribute on Hacker News ↗