← Back to context

Comment by fakedang

7 days ago

Yeah honestly. They could just try selling solutions and SLAs combining their TPU hardware with on-prem SOTA models and practically dominate enterprise. From what I understand, that's GCP's gameplay too for most regulated enterprise clients.

Googles bread and butter is advertising, so they have a huge interest in keeping things in house. Data is more valuable to them than money from hardware sales.

Even then, I think that their primary use case is going to be consumer grade good AI on phones. I dunno why Gemma QAT model fly so low on the radar, but you can basically get full scale Llamma 3 like performance from a single 3090 now, at home.

  • https://www.cnbc.com/2025/04/09/google-will-let-companies-ru...

    Google has already started the process of letting companies self-host Gemini, even on NVidia Blackwell GPUs.

    Although imho, they really should bundle it with their TPUs as a turnkey solution for those clients who haven't invested in large scale infra like DCs yet.

    • My guess is that either google want's a high level of physical control over their TPUs, or they have one sort of deal or another with NVidia and don't want to step on their toes.

      And also, Google's track record with hardware.

    • Its the same format as other software - you release the actual software for free but offer managed services that work with that software way better and easier.

      1 reply →

  • It’s my understanding that google makes bulk of ad money from search ads - sure they harvest a ton of data but it isn’t as valuable to them as you’d think. I suspect they know that could change so they’re hoovering up as much as they can to hedge their bets. Meta on the other hand is all about targeted ads.

    • Right so keeping things in house and seeing what people are asking Gemini would be probably better for them?

Relenting hardware like that would be such a cleansing old-school revenue stream for Google... just imagine...