Comment by aurareturn
6 days ago
Just like how starting a chip fab was relatively easy back in the 80s and 90s. There were dozens of chip fab companies in the 80s.
It turns out that fabs follow Rock's Law which is that the capital cost to build a new fab doubles every 4 years. This means it will quickly get rid of the less competitive players. This is not dissimilar to the LLM scaling laws where you need a magnitude more compute to get unlock a new tier of intelligence.
Today, Anthropic and OpenAI are clearly in the lead for models and then there is everyone else. Google is a close 3rd. No one else is challenging them anymore in SOTA models. Some models might beat them in one or two benchmarks but none can compete overall. I expect this gap to grow bigger as models cost more and more to train.
Yes, I wrote about Rock’s Law too, but we don’t know that this is how these models will develop
Evidence point to the same type of scaling law. Compute for a training run grows 4-5x every year.[0] I'm sure this will slow down but the premise remains that weaker competitors will not be able to maintain this pace. We already see labs like Cohere, Mistral, Inflection AI, Adept, Character.ai, and others bow out of the frontier race. I'm also skeptical that Meta, xAI can catch up. Even Google has trouble keeping up.
Even if this isn't true, comparing telecom bits to tokens is wrong. Bits are the same no matter what telecom transfers them. Tokens are not all the same. The quality varies.
We're already seeing a massive divide between frontier models and lesser models in growth rates. Anthropic is adding $10b - $15b every month in ARR. This figure likely dwarfs open source labs. This is all because its models are maybe 10-15% better.
The cost to inference a 1T param frontier model is the same as a 1T param open source model. Therefore, if the frontier model is even 10-15% better, it will gobble up the market over time.
Lastly, even though Claude Code and Codex are the biggest revenue drivers for Anthropic and OpenAI today, I don't believe this will be true 2-5 year from now. I believe selling their tokens via API will be their biggest. The sum of applications in the world will dwarf coding in market size. For example, biotech, finance, physics, engineering, robotics, sensor data, etc. This is why I think OpenAI and Anthropic are becoming more like iOS and Android than AT&T and Verizon. Applications will build on top of OpenAI and Anthropic just like iOS and Android.
[0]https://epoch.ai/blog/training-compute-of-frontier-ai-models...
How about the externalized intelligence around the model weights (skills, tools, harness, memory etc)? If the model weights are sufficiently intelligent, the focus might move to the external layers.
I agree with much of what you’ve written but think you are missing the correct alignment of the mobile data timeline — mobile data had standards because it was forced to. It was forced to early because it was not a fundamental innovation, telecom itself was the fundamental innovation, mobile was a constraint relaxation. Intelligence might be forced to have standards as well, we will see what form the regulations take when prices reflect costs and healthy margins and become existential threats for many businesses.
2 replies →
You lay out some good arguments but I agree with both: the models relative to few years back really did become the commodity because today you could take the non-frontier model, maybe self-host it or pay the much less price per M tokens to get the performance of a ~2-year old frontier model. At the same time I do think that we are getting into the monopoly/duopoly/tripoly with the frontier models for all the reasons you already mentioned, and this scares me a little bit.
1 reply →
I doubt it. Compute costs tend to crater over time, and LLMs will almost certainly plateau. So the opposite is almost certainly the case: over time, it becomes cheaper to train.
By that time, there would be a clear monopoly or duopoly that controls the revenue, distribution, and compute.
Possible, but only if there’s regulatory capture. Which will be difficult since any country could produce a SOTA model that everyone can use for free