Comment by aurareturn

20 hours ago

More signs that xAI might be giving up on the AGI race. xAI let Cursor train a model on Colossus 2, gave the entire Colossus1 to Anthropic, and is now giving compute in Colossus2 to Anthropic as well.

Bad read on the situation. xAI has too much compute and not enough customers using it. They have around half a million GPUs, some of which are stolen from Tesla, running at 11% utilization. xAI predicted more people would be using Grok, but Grok is not a SOTA model & users primarily want to use SOTA models. They have excess capacity and it makes sense to rent out GPUs to other customers while they improve their models.

  • Grok is also tuned to align with Musk's personal beliefs. I wouldn't touch it with a 10 foot pole.

    • I keep hearing this statement, and it always makes me wonder if people have actually used Grok…

      I have a Claude Max plan I use for coding, but I also have a Grok Lite plan I use for web search type tasks (similar to Perplexity) because I like how the Grok harness handles searches and I don’t need a SOTA model for that use case. I’d never pay $30/mo for a full SuperGrok account but to me it’s worth the $10/mo for Lite as I was hitting limits on the free tier.

      I’ve never noticed it to be particularly biased at least for anything I’ve been searching for on it. And on the other side, I’ve never noticed it to be particularly less censored or anything compared to other models either (also a claim I’ve heard a lot about Grok but I think because it is/was part of their marketing).

  • Why are they selling compute instead of using it to build that SOTA model?

    • They tried and failed. xAi made a mistake building Colossus 1 and ended up with heterogenous cluster of H100/H200/GB200 GPUs. This is a nightmare to train huge models on because each card has different specs, features, and hardware requirements. During gradient synchronization, a heterogeneous cluster would bottleneck on the slowest GPU (H100) so the faster GPUs would end up idling. They also probably ran into unexpected compatibility issues, which are difficult to resolve.

      It makes more sense to use this cluster for inference, since they can segment the cluster by GPU type and avoid GPU mixing. xAI doesn't have enough inference customers so it makes sense to monetize this to companies that need inference compute such as Anthropic or Cursor.

      Apparently xAI will try building SOTA models on Colossus 2, which will be built on Blackwell GPUs only.

      3 replies →

  • It is a race that has a flywheel effect.

    Once xAI training team “fix” their model, where will Anthropic be then?

  • More people should try Grok. I don't use it for coding but it's replaced a lot of my ChatGPT usage. Definitely more perferred model for quick questions or easy answers.

    • One thing I do like about Grok is that it makes it stupid easy to see what its referencing, and gives you the links to those resources. Which most models sometimes either don't bother, or don't do much of a good job of doing. It's not the top model, but it is definitely high up there, people's blind rage for anything Elon Musk is the only reason most people don't realize how capable it is unfortunately. Grok is not exclusively made by Elon Musk, there's definitely other engineers working day and night on it.

      4 replies →

  • It's not stolen if it was taken from Tesla, investors already agreed that Elon can do anything he pleases with their money.

Elon lost his lawsuit with openAI and knows xAI isn't on the same trajectory. Might as well try to win the bet and flip off Sam by supporting the best competition. Also they are getting a head start on AI as a commodity. I'm sure there's plenty of money to be made for those that can leverage their capital to essentially rent capacity right now. If he's not making enough off of grok, might as well cover their expenses.

It was kinda obvious when SpaceX "acquired" it. Elon rewarded xAI investors/prevented lawsuits by giving them SpaceX equity, and that was that.

FWIW, SpaceX (parent company of xAI) has an option to acquire Cursor for $60B that expires 7 days after their imminent IPO.

  • Do people still use Cursor? My company’s leadership has been clear that Cursor was cool for a hot minute but you Should Not be using it anymore

    • It’s a very fractured and heterogeneous landscape where your own perspective will be warped by your personal experience.

      Anthropic has a lot of the market share and dominates the mind share, but each of Codex, Devin, Cursor, Claude, et al have significantly more market usage than they had 6 months ago and each are likely still growing very quickly based on publicly-reported information.

xAI might acquire Cursor. They are in the process of training new coding models and probably a new Grok.

Until they finish training, it makes sense to rent the excess capacity.