Comment by K0balt
4 hours ago
I’d say when qwen works it works like sonnet, when it fails it fails like haiku. So it’s less consistent but works pretty well, I guess? It’s still overall pretty useful for a lot of stuff, and I can run it directly on my MacBook. Once you get an idea of what it can and can’t bite off, it’s pretty easy to break things into chunks it will handle reliably with grace. But I still like to have access to SOTA models for review. Also you can have a SOTA model write a development plan that is basically a bunch of prompts to generate each part, then have the local model follow the plan.
I should mention not to run it at less than q6, I prefer q8.
No comments yet
Contribute on Hacker News ↗