Comment by Animats
4 hours ago
There are real issues on the money front. The big AI companies have a financial model that assumes a huge increase in demand in the next year or two. Otherwise the bubble pops.
"Anthropic, OpenAI and every other AI company deliberately obfuscated these costs because they knew that the second a user actually had to pay for the fuckups of an AI model they’d scream like they were being stung to death by bees."
So some of the growth was purchased by underpricing, subsidizing the customers with venture capital. Uber did that, and eventually got out of it by raising prices and squeezing the drivers.
The "fuckup" problem is real. LLM-type AI exacts huge costs because it is terrible at reporting "I don't know". When it doesn't know, it generates noise and polishes it. If a "confidence too low for output" signal could be extracted, this whole technology would be a lot more useful. You could use small, inexpensive models on small problems, and only use big models when the small models failed. Most customer service bots fit that model. Needing ever-larger models to fix the noise problem is not cost-effective.
>LLM-type AI exacts huge costs because it is terrible at reporting "I don't know". When it doesn't know, it generates noise and polishes it.
>If a "confidence too low for output" signal could be extracted, this whole technology would be a lot more useful
Anthropic's interpretability research explored this topic a bit in 2025. Apparently, the signal is extractable:
https://www.anthropic.com/research/tracing-thoughts-language...
Isn’t the thing that an LLM never knows (at all)? It just guesses words based on the context and previous words and often gets lucky.
It isn’t thinking or knowing and then expressing the resulting understanding but just spitting out contextual words and hoping it reaches a conclusion or ending of some sort.