Comment by simonw

6 days ago

"... as you make more your costs come down"

I'd say dropping the price of o3 by 80% due to "engineers optimizing inferencing" is a strong sign that they're doing exactly that.

You trust their PR statements?

  • Seems more likely to me then them deciding to take a sizable loss on inference by dropping prices by 80% for no reason.

    Optimizing serving isn't unlikely: all of the big AI vendors keep finding new efficiencies, it's been an ongoing trend over the past two years.

    • This is my sense as well. You dont drop 80% on a random Tuesday based on scale, you do it with an explicit goal to get market share at the expense of $$.

> "engineers optimizing inferencing"

They finally implemented DeepSeek open source methods for fast inference?