Comment by Roark66
4 hours ago
I think there is no reasonably priced machine you could run locally to do serious work with LLMs...
10x rtx6000 Pro in a large workstation is probably the way to go for someone wanting to run GLM5.2.
Other than that it is cloud.
As good as these small models got we are still not "at breakeven" for me.
What is "breakeven" with LLMs? For me it is when I no longer have to read the actual code it wrote. I can trust that if I told it to implement and document a certain architecture it actually did that with no stupid mistakes.
The first model ever that did that for me was the first opus. 4.4 if I remember correctly.
The second model was Gemini 3 Pro preview. For few weeks. Then it was lobotomised. I guess it was too expensive to run and they quantized it too hell.
Only Opus remains. If this GLM model truly rivals even an old opus I'll be very happy when day comes that I'll be able to run it locally.
No comments yet
Contribute on Hacker News ↗