Comment by harrouet

12 hours ago

This is Apple commoditizing LLMs while keeping control of the UX.

They are a hardware company and will keep selling the best machine for AI use. Well done.

Benedict Evans may be right after all; frontier models look more and more like telecom companies in the 90s. Billions and billions of investment in infrastructure while others further up the stack captured all the value.

  • There will be frontier models that are non-commoditized, but they'll be kept guarded and hidden away, and you'll only get the final result, so that they can't be distilled and their harness can't be reverse engineered. They'll be billed like employees, rather than like a tool.

    • The non-commodity network services of the early 1990’s and the non-commodity 3d graphics hardware of the mid-1990s made the same argument.

    • I doubt that. What stops the Chinese labs from figuring it out? It’s not like these models are fundamentally different from each other

      6 replies →

    • Isn't that what they are doing already? The model is already guarded and hidden and i only get to send it what i want. Talk with it to clarify my requirements. And i can switch to a different provider for cheaper/better results.

    • I think this will be isolated to highly specialized fields where training data will need to be selectively curated.

  • In spite of their deeper pockets, massive datacenters, colosal amounts of user data, and hundreds of thousands of top developers, even Amazon, Meta, Microsoft, and Google are well behind.

    I think Evans is completely wrong. There are only 2 truly frontier models. (at least for now). And Anthropic seems to be leaving OpenAI behind so there might be only 1 in the near future. (which is scary/dangerous)

    • >I think Evans is completely wrong.

      I wish there was a case where I find Evans is wrong. As far as my memory served me, I failed to record a single one.

      I disagree that Amazon, Meta, Microsoft, and Google are "well" behind. If anything the frontier model advantage seems to be at best 6 - 9 months. And that the Chinese model are all doing well.

      One of Steve Jobs's line, "It is a feature, not a product." Even if Apple were a generation behind or 1 year behind frontier model. The advantage of default is enough to hold a lot of its user.

      To put it simply, even if OpenAI or Anthropic were better, there is zero chances they would topple Apple in hardware sales, user or ecosystem. On the other hand, even if Apple's AI were 6 - 9 months or a generation behind, most user would settle for it and damage OpenAI / Anthropic.

      5 replies →

    • Remember the implicit “pareto” in “frontier models”.

      Anthropic and OpenAI are far behind state of the art for the entire curve except the “extremely expensive for barely measurable improvements” part.

      GLM is probably the third most expensive frontier model (benchmarks and reviews will say for sure), and is apparently ~Opus 4.6 for 10% the inference cost.

      The last I checked, qwen was still owning the 24-32GiB RAM range (it runs reasonably without a GPU!) and somewhere around 3.5-4 generation models.

      Also, even anthropic says Mythos ~= ChatGPT 5.5, so it’s unlikely either one is leaving the other behind. The big problem they both have is they asked for the government to gate keep model releases and use cases, and their wish was granted.

      That’s knocked them back 6 months already. Anthropic’s only frontier offering has been taken down.

    • I use both Claude and Codex and don’t see any meaningful difference between the two. My use case is modeling semi complex physical processes (energy and manufacturing) in code for simulations. I also have to do a good fair of automation via scripting in Python or PowerShell for manipulating data as well as legacy code analysis (C, Fortran, COBOL). Given I provide the models with the information and documentation they need, both perform very similarly. I recently did a full codebase review (for design patterns and vulnerabilities) and both Codex and Fable agreed 100% about the most critical findings. I do very little front end development, although some of my automation scripts have TUIs and again no problem with either Claude or Codex generating them for me. At this point I go with the less expensive, which seems to be Codex. With the $100 plan I rarely hit the limits. With Claude I max out my plan in about 4-6 hours of work.

      5 replies →

    • That's true now, but long-term (maybe just a few years) it doesn't seem feasible for the status quo to continue from a financial point of view.

      Spend for compute seems like it needs to increase to get the next iterations of models, and even if they IPO the money might run out before they can solidify their revenue streams.

      All while Google just needs to survive long enough with their good-enough models and do it without really putting themselves in any existential financial risk.

      And ideally the chinese models are also still there keeping everyone honest.

      The true dystopic worst case is a Google monopoly on cutting edge AI.

    • Is Google behind? The general opinions I read suggest Gemini is very competitive with Anthropic and OpenAI's top models.

    • I think it's highly likely that there will remain one or two companies on the very bleeding edge of AI development for the foreseeable future.

      But what I think a lot of people miss is that the market for the truly bleeding edge (developing bio-tech, building the most sophisticated software stacks (probably with a tilt towards simulation, GPU kernel optimization, etc)) is not the whole market.

      There's a plethora of use-cases for models that are not on the bleeding edge. If I can solve my relatively simple problems with an off-the-shelf model for a minuscule fraction of the cost of the frontier, I'm going to.

      2 replies →

    • > I think Evans is completely wrong. There are only 2 truly frontier models. (at least for now). And Anthropic seems to be leaving OpenAI behind so there might be only 1 in the near future. (which is scary/dangerous)

      Truly fascinating ecosystem and community in general, as experiences differ so wildly. Anthropic's models seems far behind OpenAI to me, especially when you get into "Pro" territory, and there doesn't seem to be any worthy competition to Pro Mode available at all.

      And this is said with someone who use both platforms, and spend a lot of my day interacting with agents and LLMs in various ways. The interesting part is that probably so do you too, and probably your experience and what you share lines up with what you experience! Yet we come away with basically opposite takeaways :) I don't think either of us are wrong either, somehow.

      12 replies →

    • I'm perfectly happy at claude opus 4.6. All improvements since then have not meaningfully improved my day to day. If i can get 4.6 on my laptop for 5-10k, i'd gladly start shifting my ~1k/month Anthropic spend over.

      Some of the harness even let you run a local model for most things, and only pay for the latest frontier models when needed, which cuts down cost drastically.

    • Maybe I’m alone in thinking this but I think the long term victor will be the one that works out pricing best.

      Fable might well be a better model but it’s too expensive for everyday AI use. Definitely if we’re talking about the kind of stuff you’re going to want to do on your phone. Even for coding, I’m not going to reach for Fable (well, when I can…) for 95% of the work I do.

      I don’t believe a mature AI industry is going to have a one size fits all, single winner.

      1 reply →

  • It is much better. Imagine if the whole Manhattan project could have been outsourced and costs you nothing. I expect in a short time that open source models will be almost or almost parity by 2030 and running on consumer devices.

    • Market phenomena like this are a bit like the Manhattan project in that you pay for it, and make use of it, whether you want to or not. It's functionally very similar to the government doing something.

  • Last I checked the telcos made plenty of money in the 90s. Should Verizon be getting a cut of my Claude Pro subscription, since I use FIOS to access it?

    • I haven’t fact checked, but according to Evans big telecom builders didn’t make a lot of money after all the capacity investment. Some actually went bankrupt or got acquired as distressed assets. Big tech was very profitable monetizing that same infrastructure.

      1 reply →

    • This is what everybody is TRYING to do. They built something and will do everything they can to charge outsized rent on it far past the value it provides to take revenue from anyone downstream.

      The fact that telcos couldn't charge rent was a primary reason the Internet was so successful.

      Remember $0.10 per text message? You bet in some alternate timeline AT&T charges $0.10 per webpage visit and we're stuck on 100kbps connections because the monopoly doesn't want to innovate.

> while keeping control of the UX.

Extremely tangential, but this is my favourite upshot of AI. For decades, companies have been walling off their services and forcing us into their fuckass UIs. Now over the course of the last twelve months, suddenly everything has an MCP and I can use it through my command line chat interface.

Any company that doesn't adapt gets so hammered by people's AI-DIY web scrapers that they have no choice but to cave.

It's been clear for years now that eventually ai will be embedded at the os level. Apple even recognized it way back when they first introduced Apple Intelligence. Yes they're commoditizing llms or whatever. But this has been a user facing feature they've been iterating on for years now

Does “the best machine for AI use” apply here considering these models are still server-side?

  • The play here seems pretty evidence, if I may assume. Apple creates an interface that is generalized enough so you can easily swap models, and while Claude is preferred by Apple today, it may be any provider or even local models in the future, and the APIs the developers use remain the same, so "migration" becomes easier.

  • for the on-device model, yes it runs on the Neural Engine (at the moment) so a newer chip means faster, cheaper local inference. For the server side path this Claude package is about your machine is irrelevant since it's a network call. The same API covers both, so "best machine for AI" only bites when the session is actually local.

    But we can imagine that the balance of what's on-device vs what's remote will move continuously towards the former as time, improved HW and improved local models keep progressing

  • I would think so, as “use” doesn’t specify implementation. If you use a word processor it may be running locally or remotely.

    From a user’s perspective, it doesn’t matter.

  • Apple's been trying to make the marketing appeal that "Private Compute Cloud" is also a hardware project. Given it seems to rely on low level details of device Hardware Security Modules, it's maybe even at least a little bit more than just "marketing spin".

    • looks like it is not "Private iCloud Compute" at all.

      Anthropic literally says "Requests go directly from your app to the Claude API; Apple is not in the request path and does not see prompts or responses." — Apple straight up lied

      1 reply →

Now we only need to commoditize the hardware.

  • Check out AMD’s offerings.

    They’re typically a bit better on high TDP stuff, and a bit worse on low TDP. They mostly match in the middle. I have a $500 AMD NUC and a slightly older $2000 MBP. Inference throughput is within 2x.

    The comparison is a little messy: AMD currently maxes out at 128GB of RAM vs Apple’s discontinued 512. Apple has nothing to rival the Steam Deck.

  • This is what originally made Microsoft the most lucrative tech company of its day.

    Android succeeded at this to an extent with phones, but Apple has been able to keep its products differentiated enough in the minds of consumers to maintain their premium pricing. So far.

How is this Apple keeping control of the UX?

  • The betas of the next OS's include a Siri AI chatbot, and the AI features are built into various parts of the OS. A user has no idea what model is powering any of it - Apple controls the UX.

I think there is an opportunity for a new hardware company to enter the market. I know this is just hypothetical but I believe that AI is revolutionary enough where a new approach to hardware and UI/UX will enable far more value to be derived from AI. I think the incumbents like Apple will stick to their familiar platforms and could get beaten out by a new competitor that is AI native to the core. Maybe? ¯\_(ツ)_/¯

Apple’s play was a masterclass - unsure how deliberate it was, or how much of a choice thy actually had, but it’s turning out pretty well IMO.

Now if they can further reinforce their angle on Privacy, they might continue to be what they are (or more)