Comment by sermakarevich
10 hours ago
I started using brain -> workers approach for coding.
-- Brain is expensive smart model from claude subscription, Fable 5 when it was available, Opus now.
-- Worker is a local model (qwen3.6:46B), deployed in 36GB GPU, Opencode + Ollama.
Brain is responsible for analysis/design and task creation. Task should be made simple and clear so the worker can handle it. Worker does the coding. Brain validates and create a fix task when required. Atm fix to task ration is ~ 1:20.
If no available GPU at home - qwen3.6 is quite cheap on clouds.
Its rather experimental setup, out of curiosity, but it works better than I would expect it to. This allows me running 3 coding agents non stop for the 4-th day atm. Here I explain how I got there: https://news.ycombinator.com/item?id=48520757
No comments yet
Contribute on Hacker News ↗