Comment by fooster

1 day ago

Where is your evidence of this "massive cost"? Inference is massively profitable for both anthropic and openai. Training is not.

20 comments

fooster

reppap 19 minutes ago

Inference cannot happen without training the model first, so the distinction is quite pointless.

kibwen 1 day ago

The evidence is that quotas exist, as seen here, and are low enough that people are hitting them regularly. When was the last time you hit your quota of Google searches? When was the last time you hit your quota of StackOverflow questions? When was the last time you hit your quota of YouTube videos? Any service will rate limit abuse, but if abuse is indistinguishable from regular use from the provider's perspective, that's not a good sign.

jerf 21 hours ago
It's also kind of interesting that they don't think they can do what an economy would normally do in this situation, which is raise prices until supply matches. Shortages generally imply mispricing.
There's a lot of angles you take from that as a starting point and I'm not confident that I fully understand it, so I'll leave it to the reader.
- 8note 12 hours ago
  
  the sales pitch is that you can keep throwing more and more tokens at a problem to solve it.
  if the prices dont keep going down, the pitch falls apart, that you need a specialist to come in and make it work
caminante 21 hours ago
Great point.
The parent's argument is that the marginal cost of inference is minimal. However, the fundamental flaw is that he's separating inference from the high cost frontier models. It's a cross-subsidy that can't be ignored.
- bachmeier 19 hours ago
  
  Without any insider knowledge on the economics of these companies, I suspect it's that the amount of infrastructure you have to build is determined by peak usage rather than average usage. If peak usage is much higher for a small part of one day a week (say on Monday morning as software developers across the US get back to work) the cost of fulfilling demand at all times can be insane. That's why companies are implementing batch/standard/priority pricing for the API.

weakfish 21 hours ago

This article convinced me otherwise https://www.wheresyoured.at/the-subprime-ai-crisis-is-here/

Narciss 21 hours ago

This is a great article, thanks for sharing
byzantinegene 6 hours ago

good article!

scrollop 1 day ago

The majority of accounts are free - these are profitable?

IMO they need as many users before their IPO - then the changes will really begin.

quikoa 21 hours ago

Inference for API or subscriptions? There is a massive price difference between the two.

ares623 8 hours ago

You're assuming they can just stop training. For the entirety of these companies' existence, they have done training. It is part of their price. They must keep pushing out better and better models. That's like saying Nvidia can just stop making new GPUs, they're obviously making so much money with their current models now.

wesammikhail 1 day ago

source?

KaoruAoiShiho 1 day ago
After googling https://www.reddit.com/r/singularity/comments/1psesym/openai...
- wesammikhail 21 hours ago
  
  I've seen sources like this before. It's all hearsay and promo. I was asking for any publicly available verifiable information regarding the cost of inference at scale. I haven't seen any such info personally which is why I asked.
  I'm dying to see S-1 filing for Anthropic or OpenAI. I don't actually think inference is as cheap as people say if you consider the total cost (hardware, energy, capex, etc)
  
  4 replies →
- caminante 1 day ago
  
  >OpenAI's compute margin, referring to the share of revenue excluding the costs of running its AI models for paying users
  Huh?
  The reddit summary comment makes no sense. How are they getting revenues without ads or paying customers?
  "After" makes more sense.
  FTA:
  >The company has yet to show a profit and is searching for ways to make money to cover its high computing costs and infrastructure plans.