Comment by revolvingthrow

3 hours ago

Amusing that just when the big three AI providers from US raise prices significantly, even for the mini models, you’ve got a Chinese model slashing their already-cheap offer by 75%. Not to mention you can run this model on your own hardware, although admittedly even the flash stretches the meaning of local for individual people.

2 comments

revolvingthrow

skybrian 2 hours ago

My guess is that the popular US providers get a lot more traffic and are supply-limited. No point in lowering prices unless you can serve the traffic that will result.

Lwerewolf 2 hours ago

Given that you can run quantized flash on 128g ram, and there's a heavy focus around it (DS4)... I'd say that it's pretty feasible for a decent amount of devs. Never thought I'd buy an MBP but here we are.

n.b. I can't use nonlocal models for a big chunk of my work, so there's that as well.