← Back to context

Comment by wqaatwt

11 hours ago

Why? Well it depends, most evidence is suggesting that Anthropic and OpenAI are making a lot of money on inference so the question is whether its more profitable for them to sell 100X tokens for Y, or 1X tokens for 100Y. In most industries with high fixed costs and low variable costs and unlimited scalability (like LLM providers) the first option ends up being much more profitable

Literally nobody is making money on inference

  • Based on what? There isn’t a lot evidence that’s the case..

    Prices on OpenRouter for GLM and other large open models indicate that Anthropic/OpenAI must have pretty high gross margins even if their models are several times more expensive to serve.

    It wouldn’t make sense for any provider to host large open models and then loss $10 on every $1 they make since they don’t have infinite VC money or any business model that would justify it.

    • If they had high margins they wouldn't be issuing senior debt with a 18.5% coupon payment (and failing to fully subscribe it), nor would they need Elon to give them two months of free compute in order to appear profitable for a single quarter.

      3 replies →