Comment by airstrike
1 hour ago
This seems to be an attempt to compete with people running local models on Apple hardware—even though those local Mac Mini setups aren't really powerful.
I expect we'll get there in a few years, so perhaps this is Nvidia taking an early step in that direction.
In that case, this goes against Anthropic and OpenAI's business models. Which is a double whammy after Jensen Huang's recent comment about how agentic coding will only increase demand for software engineers, not reduce it.
So it also feels like a part of a budding shift in the competitive tension between the various parts of the AI supply chain.
Local AI was/is bound to happen, eventually. It'd be smart of Nvidia to get ahead of it.
Non-techy consumers may never do it, but at some point businesses are going to start asking when do they stop paying per token and start running models themselves. Right now the hardware is cost prohibitive, but I doubt that'll always be the case. Eventually the hardware will get cheaper and more available, and Nvidia seems to be betting on that.
They don't care where inference happens, so long as it happens on Nvidia hardware.