Anthropic is expanding to Colossus2. Will use GB200

1 month ago (twitter.com)

https://xcancel.com/nottombrown/status/2057194829986300375

366 comments

aurareturn

Here's a probably stupid question - if someone were unbounded by ethics and conceivably had enough power and connections to power to shield themselves from many consequences of their actions - and that person owned these DCs, could they in theory observe all the streams of tokens coming in and out of these models, and even exfiltrate copies of these models wholesale to have their own teams do what they will with them in the pursuit of building their own competitive models?

Or is there something fundamental in the way these models get deployed (encryption or something or than legal contracts?) at this scale that prohibits the owners of the infra from gaining this level of insight / access?

tristanj 1 month ago
1) The situation you described would be covered under the contract between Anthropic and xAI, and that any violation of that would be subject to financial penalties and legal proceedings. The US has a robust corporate legal system, and disputes do get resolved through the court system, although in a slow and costly manner.
The contract can stipulate a penalty at a high enough amount to discourage this behavior.
2) Output from models & intra-datacenter communications can be encrypted if customers truly cared.
3) There is no reason do this, because there are far better ways to exfiltrate data from Anthropic models. Chinese companies are already doing this at an industrial scale where they are reselling Claude tokens for 10-20% of the cost while retaining the data to train their own models. https://www.chinatalk.media/p/how-to-buy-cheap-claude-tokens...
If we look at Deepseek V4-pro, created by Deepseek who Anthropic formally accused of harvesting Claude tokens at scale, it performs the same as Claude did 6 months prior.
- 47282847 1 month ago
  
  > The US has a robust corporate legal system
  Thanks for the chuckle. ;)
  
  12 replies →
- stingraycharles 1 month ago
  
  > There is no reason do this, because there are far better ways to exfiltrate data from Anthropic models. Chinese companies are already doing this at an industrial scale where they are reselling Claude tokens for 10-20% of the cost while retaining the data to train their own models.
  I think you missed the part where Anthropic stopped displaying their thinking tokens over the past few months, and instead now provides “summarized thinking”, letting Haiku summarize Opus’ thoughts.
  So it is now much more difficult (impossible?) to distill the models.
  I also think you over-estimate how well the legal systems works in the US nowadays, and under-estimate just how much power Elon has in the government.
  
  2 replies →
- viking123 1 month ago
  
  I hope the Chinese keep harvesting Claude etc. since they stole all their data anyway, who cares?
- seydor 1 month ago
  
  I guess there's a reason why those Chinese companies are in china
  
  1 reply →
Traster 1 month ago

It's already in the public domain (thanks to the OpenAI trial) that Grok distilled OpenAIs models. Listening to the data going into the models in the data centre would be very similar thing. There's some downsides (you're passively listening, not controlling the queries), and some upsides (way more data). But it only ever gets you to some percentage of the existing production model. It doesn't get you what Musk wants - an AI company capable of designing and deploying leading edge models. It gets you to fast follower status.
rdgthree 1 month ago

xAI had a lot of negotiating power here because Anthropic had ~0 comparable options and ultimately desperately needed the compute now. So, it wouldn't surprise me if data sharing was an explicit part of the agreement
fragmede 1 month ago
There's accusations that the Chinese labs have done essentially that to OpenAI and Anthropic and exfiltrated their models without having DC access, so if you had DC access, yes, you could do that. If you had DC access though you could just copy the model onto an SSD.
- zozbot234 1 month ago
  
  The Chinese labs have been accused of training on elicited chat logs on a massive scale in violation of ToS. That's possibly a real concern, but it's nowhere close to "exfiltrating" the model or even roughly matching its behavior.
HarHarVeryFunny 1 month ago
Bear in mind these data centers were built for X.ai to use themselves, so there would have been no reason to backdoor them. Also, this is all off-the shelf equipment: Dell & Supermicro servers with NVidia compute modules and ethernet switches.
- killerstorm 1 month ago
  
  The fact that X.ai build it for themselves means that they are in full control. It's not challenging to install a rootkit on hardware under your control. Also if traffic within the datacenter is unencrypted, you can probably get copy of all comms via switches.
  It's generally far easier to install spying software under a computer under your control than to detect it.
giancarlostoro 1 month ago
You could, but Grok is pretty high up there, it might not be "#1" but its definitely up there with the giants, people seem to overlook it. Gemini has a similar problem, it was #1 once, and it seems like Google isn't hell bent on chasing #1 they just want to keep iterating over time, they know they just need it to be "good enough" and they'll keep having repeat customers.
If Elon REALLY wanted to do anything like that he would be better off poaching talent from competitors, less legal hell to go through.
- mike_d 1 month ago
  
  You must have missed the part where he did poach all the top talent from Google and OpenAI, and then they all quit because they couldn't stand working for him.
- jatora 1 month ago
  
  No Grok is not 'up there'. Not by any stretch of the imagination is it anywhere close to anthropic or openai in any single domain whatsoever. Not even close to deepseek. It really isnt a good model. Their research team's talent is just not good, unfortunately.
  
  1 reply →
- petesergeant 1 month ago
  
  If I was training a top-tier model, having a competitors’ weights feels like it would be an excellent refining tool. As a user I find cross-model iteration to be a huge power-tool, and presumably the boffins can zero in on areas of relative strength and weakness and work out which area they want massive amounts of synthetic data from.
ncr100 1 month ago
Could they also, if they owned a 10,300 vehicle fleet of orbiting satellites, add radar or optical surveillance to each, and track every human on planet early in real time?
- fragmede 1 month ago
  
  Why would they need to do that, when the majority of those humans are already carrying a radio transceiver with them at all times.
espeed 1 month ago

What prevents a data center operator from reading your chats? [FEATURE] Provide a way to select your data center #56916 https://github.com/anthropics/claude-code/issues/56916
__MatrixMan__ 1 month ago

Given Musk's previous antics, this strikes me as a relevant line of reasoning.
How do you trust space infra? You can't show up and audit some kind of airgap. Whatever encryption keys you have are likely accessible in memory sometime.
Glohrischi 1 month ago

Yes he could do that and I thought about this too. Very weird tbh

urams 1 month ago

Very possible Elon is doing this to make give Anthropic better chances against OAI while he attempts to reshape xAI.

Also possible he sees infra as the future of xAI if he really believes in the value of space compute.

Hard to see this any of this as anything other than a bearish sign for Grok though.

tristanj 1 month ago
That's very doubtful given recent news that Colossus is running at 11% capacity and has hundreds of thousands of idle GPUs. xAI acquired too many GPUs and currently doesn't have enough customers to use them. That's why they are making compute deals with Anthropic and Cursor.
xAI is bleeding money and this compute deal with Anthropic will pay for all of xAI's capex ($25 billion) in 2 years.
- petesergeant 1 month ago
  
  > will pay for all of xAI's capex ($25 billion) in 2 years
  This is a confusing framing; you pay down capex with profit not revenue, and there is presumably a high opex cost.
  
  3 replies →
- shenberg 1 month ago
  
  11% MFU does not mean 89% of GPUs are idle, it means that they're using the GPUs ineffectively.
  
  5 replies →
- andriy_koval 1 month ago
  
  > xAI is bleeding money and this compute deal with Anthropic will pay for all of xAI's capex ($25 billion) in 2 years.
  minus maintenance and energy cost.
utopiah 1 month ago
> space compute
Heard anybody remotely competent about space talk the topic? It's pretty much a literal laugh every time.
- Forgeties79 1 month ago
  
  It is absolutely baffling to me how frequently I hear people talk about this around me. There is no way this is happening anytime in the near future.
  
  7 replies →
- wongarsu 1 month ago
  
  Some of the proposals are laughable, some of the example calculations are. The idea of running AI training sounds extremely challenging. But the idea of inference in space doesn't seem absurd
  The power budget of a starlink v2 mini satellite is estimated at around 20kW based on the known solar panel size. That also matches what a satellite of that size would roughly dissipate without dedicated radiators, using just some heat pipes to spread the heat evenly over the satellite's surface. There is nothing fundamentally preventing you from taking the same satellite design, remove most of the comms payload and instead put 15kW of GPUs there. Or about 10 GB200 including the CPUs, networking, etc that you need along with the GPUs
  Now, do the economics work out for $300k worth of compute for each satellite, in an environment where maintenance is impossible and degradation will be higher than on the ground? Probably not right now, but in a couple years they might
  
  15 replies →
- sebmellen 1 month ago
  
  While I agree with you on the merit of the idea, rockets that can take off and then land vertically without damage were also laughable pre-SpaceX.
  
  2 replies →
jrflo 1 month ago

Space compute isn't real though, it's just a scheme to pump the value of SpaceX before IPO by associating it with AI. It's really hard to cool things in space, because there's no matter to transfer the heat away. All you've got is radiative cooling, and that's really really slow.
skywhopper 1 month ago
He believes in the value of the idea of “space compute” for attracting investors to SpaceX. But the existence of the idea of “space compute” as a better way to deploy datacenters (along with everything else Musk has claimed in the past decade) should give everyone pause as to the plausibility of literally everything else he says.
- oblio 1 month ago
  
  > should give everyone pause as to the plausibility of literally everything else he says.
  That ship has sailed, we're in the Age of Cults. If you're a believer you're probably also invested in his companies, so your mind is doubly-clouded.
  
  1 reply →
- jpadkins 1 month ago
  
  > along with everything else Musk has claimed in the past decade
  * mass produced EVs * Neural links to brain * reusable rockets * more efficient tunnel digging
  Dude has a pretty good record of taking stuff in R&D and making pretty real products and/or companies. Can you name someone better?
  
  1 reply →
- pyrale 1 month ago
  
  > He believes in [...]
  ...the value of having others buy the idea of space computing. I don't think he himself believes in what he says.
jameson 1 month ago

Right, also Anthropic has been having difficult time getting more GPUs
giancarlostoro 1 month ago

I think its a mix of both of your first two points. He has hinted that he will be producing chips for Tesla and SpaceX here in the US in the future, so there will be a time where Elon will have every critical component we need for Tesla and SpaceX here in the US and not overseas, and fully in-house. Once he does this, nothing stops him from selling components. He's also insanely efficient at standing up data centers, so I would not be surprised if he gets enlisted by someone like Anthropic to setup infrastructure for him.
notfromhere 1 month ago

xAI is a dead company; you don't sell compute if you're growing.
More promising is that cursor is training a model using it.
mitjam 1 month ago

Interesting that nobody mentioned SAR in this thread, yet.
SAR benefits from space compute by reducing the strain on downlinks (pre-process, detect in space). A large-ish (think 6k sat) constellation of much bigger sats (2-2,5 ton) than current Starlink sats can run inference on board, distribute load to nearby sats, and enable full round-the-clock surveillance of the whole world.
The nominal compute capacity is in ballpark range of a modern AI data center, but it's only about 20% used on average due to duty cycles. An indestructible, global eye.
The big Starship launch vehicle is perfect platform to bring them into orbit, can maybe bring 40 at a time into orbit, so 150 launches for 6k. Maybe even fewer sats are needed, depends mostly on electrical efficiency of the components.
Glohrischi 1 month ago
That would be absolutly ludicris.
His own product is competing against anthropic.
- cbm-vic-20 1 month ago
  
  It's a money maker. Maybe not a number one spot, but they don't have to stand up any more datacenters.
  
  1 reply →
fillskills 1 month ago
We could see the first company vertically integrated from etching to chip to data center
- AlotOfReading 1 month ago
  
  That's Intel. Probably IBM too, though they've been doing mass manufacturing with TSMC (and GloFo before) for years instead of their fabs. I wouldn't be surprised if HP did similar things back in the 80s.
- jacobrast 1 month ago
  
  What do you mean by etching? Google does also it's own chip design with TPUs, data centers, and models but afaik only TSMC Intel and Samsung do the actual semiconductor fabrication
  
  4 replies →
- Glohrischi 1 month ago
  
  We wont for anytime soon.
  Intel and TSMC are not what they are today just because they buy a very expensive EUV Machine from ASML but because they have the knowledge and infrastructure to even use these machines.
- sofixa 1 month ago
  
  IBM was doing this since the 50s.

alienreborn 1 month ago

Why is xAI giving up their advantage? Is this a signal that their frontier model improvements are plateauing and decided there is no value in hoarding all their compute?

dktp 1 month ago

I would guess it's purely because Grok isn't nearly in-demand enough to produce meaningful revenue. And they want to juice the numbers for IPO
And I'm sure it's a bonus point for Musk that it goes to OpenAI's most relevant competitor
SoKamil 1 month ago

As weird as it seems I think this is Musk’s best shot at winning over Altman. He has personal vendetta.
ifwinterco 1 month ago

xAI have (with some questionable ethics) managed to actually build a data centre, so they have a ton of compute but not much inference demand for their model which is second-tier.
Everyone else trying to build data centres is really struggling (turns out building physical things is not as easy as writing code, who knew).
So Anthropic have the model but they're compute starved because everyone else they've signed agreements with still mostly have piles of dirt and the GPUs are still in Nvidia's warehouse somewhere.
It's a bit of a win-win: xAI's financial numbers will be massively improved by the revenue using otherwise useless data centre capacity, Anthropic get the compute they desperately need albeit probably paying through the nose for it
SilverElfin 1 month ago
Elon got rid of most of the core xAI researchers recently. It looked like firings and he said something like Grok needs to be rewritten because it was fundamentally built in a bad way. My guess - Grok was not one of the top two models and had no traction beyond users on Twitter - so investing in it did not make sense. Or maybe he needs time to build a new team and rebuild Grok, and while he’s doing that, he needs revenue to make up for all the GPUs they bought.
Either way, I doubt he has any ‘real’ advantage in this game. It’s OpenAI and Anthropic, and then everyone else including the open weight models.
- vessenes 1 month ago
  
  Boy I’d love to have the sort of ‘no real advantage’ that got me $1.9bn in revenue per month from Anthropic.
  Make no mistake - Elon’s charging premium pricing because he has something Anthropic realllly needs. In exchange, he will get significant revenues added to the balance sheet along with the cursor team right around IPO. It’s very good business for Elon and SpaceX.
tw04 1 month ago
What advantage? Has there ever been any indication they’re leading in any segment? Sure Elon has thrown a bunch of money at hardware, but to what end?
And frankly as bad as Altman is from a: if AI is really going to disrupt humanity do I want this guy in charge? Elon is 10x worse. So why would the best and the brightest ever work for him?
- abraxas 1 month ago
  
  No other LLM has made as much child porn as grok so there is that...
- eightysixfour 1 month ago
  
  In a compute starved world, big ass data centers are an advantage.
  
  8 replies →
- Rover222 1 month ago
  
  The datacenter advantage, obviously
guluarte 1 month ago
I think they overestimated the demand for Grok, which is mostly useless, and now they have too much compute on hand.
- bwfan123 1 month ago
  
  I see this as a huge warning sign. If a frontier ai lab is in this position of renting their own capacity, imagine how much overcapacity there is in the system.
  Outside of VC money, and circular financing, the only external money coming into ai are into open-ai, and anthropic via their subscriptions and APIs.
  
  5 replies →
HarHarVeryFunny 1 month ago

Musk's goal with X.ai from the start seems to have been primarily to compete with OpenAI, and given that Grok itself isn't doing that (not even mentioned in competitive benchmark tables), and his data centers are therefore mostly sitting idle, this deal with Anthropic helps him achieve that goal by helping OpenAI's #1 competitor.
Note how the initial deal was only for the older Colossus #1 data center, and now after losing his lawsuit against OpenAI the Anthropic deal has been expanded to Colossus #2 also. Coincidence?
Computer0 1 month ago
When did xAI have an advantage?
- aurareturn 1 month ago
  
  Advantage in training compute.
viking123 1 month ago

xAI advantage is that it's not censored, that's why people use Grok

gaze 1 month ago

I think that's the datacenter with the gas turbine generators that operate without permits because they're "portable." Data centers have tremendous externalities but colossus is a particularly nasty offender, and not just due its size.

Edit: They did it with Colossus and now they're doing the exact same thing with Colossus2. https://www.selc.org/news/xai-built-an-illegal-power-plant-t...

robwwilliams 1 month ago
The newer location is about 3 miles southwest of the Memphis Airport (MEM), one of the world’s largest cargo airports and the center of FedEx operations (500 take-off and landings per day most concentrated in a 6 h FedEx window with lots of engines running on ramps and that produces about 2000 tons of NOx per year).
I live about 18 miles downwind of the new Colossus sites, the airport, and lots of truck logistics sites, and a large refinery.
I definitely will be getting 2x exposure to ozone and particulates from both Colossi when they are running full bore. Plus an extra dose of ultrafine particulate with my morning fresh air.
Yes, wouldn’t it be nice to be in Nashville instead with HCA, Oracle, many insurance and financial institutions, and the joy of country music.
As an avid Opus user I am in an ethical Nimby bind. We do need almost any investments we can get in Shelby County TN. I’ll take Anthropic in preference to Grok NOx. And it will be my NOx.
- bob1029 1 month ago
  
  The biggest issue with the interim onsite generation is the lack of meaningful stack height on the generating units.
  Airplanes by virtue of their mode of operation stay out of the unhappy regime most of the time. Also, engines at/near idle produde orders of magnitude less emissions. Those aeroderivative generators are running at full capacity 24/7.
  Dumping exhaust at ground level continuously is probably much worse than the airport. Even if it's a FedEx world hub.
- superloika 1 month ago
  
  Cattle can't choose how to be slaughtered, though.
- vasco 1 month ago
  
  > I’ll take Anthropic in preference to Grok NOx
  It's the same datacenter? Ran by the same people?
  
  1 reply →
- Melatonic 1 month ago
  
  Hey Siri - can you please order this guy a spare set of lungs ?
  
  1 reply →
- monster_truck 1 month ago
  
  Your math is way wrong. The airport is and will be far, far worse. You didn't even mention all of the lead
  
  2 replies →
Kelteseth 1 month ago
So for a guy that literally has a company that produces batteries and solar panels, choose to use gas turbines. So much for saving the planet.
- renticulous 1 month ago
  
  He is building the skill tree in such a way that he is prioritising speed rather than environment.
- _heimdall 1 month ago
  
  Solar and batteries are a bad choice for a constant 24/7 load.
  That's the exact reason we will never go fully solar (or wind) unless an insanely impressive battery breakthrough makes storage effectively free while using only common, renewable components rather than rare earths.
  Solar, wind, etc are excellent parts of an energy system, but its nearly impossible to cover base load at scale with generation that may only run for 0-5 hours a day.
  edit: typo
  
  6 replies →
- rdtsc 1 month ago
  
  > So for a guy that literally has a company that produces batteries and solar panels, choose to use gas turbines
  “Sorry, we have to shut down your database because it’s been overcast for the last two days”
  Now, they could have a mix, of course, but just running on solar and batteries at that power is not realistic.
  
  1 reply →
hemul3n 1 month ago
I dug into this topic in some detail on my blog and it's both enraging and depressing.
https://poiesic.com/posts/pattern-recognition
- Torn 1 month ago
  
  This is a really great watch from Benn Jordan on the topic https://www.youtube.com/watch?v=_bP80DEAbuo
  Some wild things happening with those, and infrasound. Colossus is shown 4 mins in
  
  5 replies →
- HDThoreaun 1 month ago
  
  I dont know, Id say the enraging thing is that the government is so incompetent and unable to expand electricity supply that datacenters are forced into using loopholes to get power the only way they can.
  
  7 replies →
georgemcbay 1 month ago
Yup, and now Anthropic is complicit in the environmental damage and health problems for local residents that these data centers are causing.
But hey, number must go up, right?
- idle_zealot 1 month ago
  
  Have you considered that the march of progress requires human blood to grease the gears and mulched skulls to pave the (highly efficient) road? Really, when you take into account all of the future lives this will improve and save it's difficult to claim any cost now is too high. Would you stand in their way and delay the day that Mythos cures cancer?
  This is a joke. Read it in a mocking tone.
  
  20 replies →
- broknbottle 1 month ago
  
  Some of you may die, but it’s a sacrifice that I am willing to make.
  
  3 replies →
- shimman 1 month ago
  
  Do you think Boris cares about people getting cancer and dying from these data centers? No, he cares about becoming rich as fuck.
  
  3 replies →
- jquery 1 month ago
  
  Given how much our EPA has been gutted by the current administration, I don’t think relief is very likely.
stronglikedan 1 month ago

Seems like selc's time would be better spent trying to close the loophole that allows for unpermitted turbine generators instead of going after one company for doing what they were allowed to do when they did it.
tootie 1 month ago

And Anthropic can pay money to Musk to absorb all that liability while they keep training models.
SilverElfin 1 month ago
So much for Dario’s ethics. He happily partners with Elon. He seems like just another power hungry monopoly seeking liar.
- chrisldgk 1 month ago
  
  That’s because he is. None of these people are your friends and all of them will fuck you over if it means getting richer and more powerful.
- znpy 1 month ago
  
  > So much for Dario’s ethics. He happily partners with Elon. He seems like just another power hungry monopoly seeking liar.
  Dario has been glorified unnecessarily. He's just like all the other people in the space: not good, not bad.
  And keep in mind that when Dario was opposing AI usage by the US State he wasn't really opposing, he was just saying "not yet".
bvcp 1 month ago
this is why ai in space will win
- _heimdall 1 month ago
  
  The physics of data centers in space will be extremely difficult and expensive to pull off in any meaningful timeline.
  I fully expect "space AI" to be about as realistic as the flying cars and hover boards we've been promised since televisions were black and white.
  
  2 replies →
nelox 1 month ago
xAI’s turbines produce meaningful local/regional pollution (especially NOx in a vulnerable area) but represent a rounding error nationally and globally.
- palmotea 1 month ago
  
  > xAI’s turbines produce meaningful local/regional pollution (especially NOx in a vulnerable area) but represent a rounding error nationally and globally.
  If you shoot someone in the face, it will produce a meaningful increase in local/regional murder, but represent a rounding error nationally and globally.
  
  3 replies →
- cududa 1 month ago
  
  It doesn’t matter if people have to suddenly live by gas turbines that run 24/7 because why again? Can you repeat that last part back to me but say it a little dumber for me?
- glaslong 1 month ago
  
  farting in a crowded elevator because the people outside the elevator won't even notice
- throwaw12 1 month ago
  
  multiply rounding errors by thousands and you get somewhat meaningful impact. you are underestimating the scale of independently small pollutions
redox99 1 month ago
What's the tremendous externality of gas generators? People heat their own homes with natural gas and it's no big deal. How can a datacenter that is miles away be worse than that?
- coliveira 1 month ago
  
  How can something that is a million times bigger than your home be any different, right?
- m4rtink 1 month ago
  
  Its totally inefficient - burning the same gas in a co-generation plant, ideally combined with district heating, would produce the same amount of pollution and basically make use of all the energy.
- jLaForest 1 month ago
  
  The gas furnace in my basement don't have a massive jet turbine emiting high frequency noise
  
  4 replies →

aurareturn 1 month ago

More signs that xAI might be giving up on the AGI race. xAI let Cursor train a model on Colossus 2, gave the entire Colossus1 to Anthropic, and is now giving compute in Colossus2 to Anthropic as well.

tristanj 1 month ago
Bad read on the situation. xAI has too much compute and not enough customers using it. They have around half a million GPUs, some of which are stolen from Tesla, running at 11% utilization. xAI predicted more people would be using Grok, but Grok is not a SOTA model & users primarily want to use SOTA models. They have excess capacity and it makes sense to rent out GPUs to other customers while they improve their models.
- kibibu 1 month ago
  
  Grok is also tuned to align with Musk's personal beliefs. I wouldn't touch it with a 10 foot pole.
  
  15 replies →
- coliveira 1 month ago
  
  It's not stolen if it was taken from Tesla, investors already agreed that Elon can do anything he pleases with their money.
- jstummbillig 1 month ago
  
  Why are they selling compute instead of using it to build that SOTA model?
  
  4 replies →
- aurareturn 1 month ago
  
  It is a race that has a flywheel effect.
  Once xAI training team “fix” their model, where will Anthropic be then?
- CSMastermind 1 month ago
  
  More people should try Grok. I don't use it for coding but it's replaced a lot of my ChatGPT usage. Definitely more perferred model for quick questions or easy answers.
  
  5 replies →
papascrubs 1 month ago

Elon lost his lawsuit with openAI and knows xAI isn't on the same trajectory. Might as well try to win the bet and flip off Sam by supporting the best competition. Also they are getting a head start on AI as a commodity. I'm sure there's plenty of money to be made for those that can leverage their capital to essentially rent capacity right now. If he's not making enough off of grok, might as well cover their expenses.
paxys 1 month ago

It was kinda obvious when SpaceX "acquired" it. Elon rewarded xAI investors/prevented lawsuits by giving them SpaceX equity, and that was that.
gen220 1 month ago
FWIW, SpaceX (parent company of xAI) has an option to acquire Cursor for $60B that expires 7 days after their imminent IPO.
- derwiki 1 month ago
  
  Do people still use Cursor? My company’s leadership has been clear that Cursor was cool for a hot minute but you Should Not be using it anymore
  
  1 reply →
DeathArrow 1 month ago

xAI might acquire Cursor. They are in the process of training new coding models and probably a new Grok.
Until they finish training, it makes sense to rent the excess capacity.

amazingamazing 1 month ago

I don't see a scenario where it really makes sense to be a frontier lab long term. Eventually model quality will plateau then you distill and get 90% for 10% or less cost.

hungryhobbit 1 month ago

General AI is that scenario. The investor dream is that their horse hits general AI first, patents it (or otherwise somehow stops the competition from hitting it), and then reaps the massive benefits.
I'm not saying it's a likely scenario, but I genuinely believe a big percentage of AI investment revolves around that (or similar) scenarios.
baq 1 month ago
Opus and GPT are so different there’s room for both and I wouldn’t be ignoring Gemini even if it isn’t ahem great at coding, since it’s quite obviously very good otherwise.
In the times before you also would rather have very smart people working together instead of one very smart dude alone even if he had an identical twin.
- olmo23 1 month ago
  
  My go-to usecase for Gemini is summarizing Youtube tech-influencers.
  
  5 replies →
twoodfin 1 month ago
Over the past 6 months, Anthropic has made more waves as a product company than a frontier lab.
- amazingamazing 1 month ago
  
  Cant wait to see their books once they IPO
usef- 1 month ago

This might come down to when you expect plateau to happen. You could have said similar about transistor density many decades ago.
(I'm not denying it could happen next year for all we know. But we simply don't know, and from what I hear from researchers the breadth of ideas we've tried are still small)
energy123 1 month ago

Plateau on a saturated benchmark where an asymptote to 100% is mathematically baked in?
Or plateau as in it won't solve any Millennium Prize problems within the next decade?

stevefan1999 1 month ago

Aren't Anthropic afraid of Elon siphoning the model weights out from the network buses?

wongarsu 1 month ago
Is xAI a competitor worth worrying about?
They make good models, at times SotA (at least if you don't need coding, their last good coding model was six months ago), with lower safeguards than either Anthropic or OpenAI, and they still fail to capture meaningful market share or mind share. The name Grok is tainted by the twitter bot of the same name operated by xAI/X. Being owned by Musk lets the company appear unstable and untrustworthy in the minds of many. Their marketing game is just bad all around. They struggle to retain top talent.
Maybe their next model will be great. I doubt it will matter. I doubt xAI siphoning off Anthropic models and distilling that would matter. Model performance is not the main factor dragging down xAI
- sometimelurker 1 month ago
  
  > Is xAI a competitor worth worrying about?
  they have GPUs so yes
lavezzi 1 month ago

Concern maybe elsewhere, but I like how we have gone full circle.
SpaceX expects to complete its $60 billion acquisition of AI coding startup Cursor roughly 30 days after it begins trading publicly, according to Bloomberg.
Cursor's Composer 2 model was built on Kimi K2.5, who were in turn accused of 'distillation' attacks by Anthropic.
Anthropic now relies on SpaceX for compute demands.
nroets 1 month ago

Theft of trade secrets. And so many people will have to be involved that evidence of the crime is bound to leak out.
aurareturn 1 month ago
Pretty sure models are encrypted all the way.
- ACCount37 1 month ago
  
  Can't run inference on encrypted weights and get any kind of performance out of it.
  
  6 replies →
- davedx 1 month ago
  
  Dude, Chinese labs distil attack via the APIs, if Musk wanted to do something like that, technically he could. Legally it would be a giant slam dunk liability though
  
  2 replies →

brightball 1 month ago

https://x.com/nottombrown/status/2057194829986300375?s=20

chinathrow 1 month ago

I use Claude daily but I do not want that my spend is going towards Elon.

giancarlostoro 1 month ago

Might want to check the Anthropic parking lot, this might have already been happening.
jpadkins 1 month ago

I see the programming was effective on you.

try-working 1 month ago

xAI cannot train models. Anthropic cannot do inference. The roles of these two companies have already been decided; one as a pure inference provider, the other as a pure model manufacturer.

Lucasoato 1 month ago

Ok, maybe this question is a bit silly, but could it possibly be that Elon is doing this to steal Anthropic models secrets and using them to improve Grok?

aurareturn 1 month ago
No. Model weights are encrypted. Elon is doing this because Anthropic is desperate for compute, SpaceX needs to juice up its balance sheet before IPO, and xAI model team isn't competitive.
- pona-a 1 month ago
  
  Homomorphic encryption doesn't exit at any meaningful scale. The weights must still be stored plain in the RAM for inference to work
vessenes 1 month ago

Love the curiosity. I recommend you spend that curiosity on figuring out why both companies are in the situation they are that this deal is a win-win: Elon has provided a masterclass in public company launch and management over the last 15 years. (And in recent years done it amidst an incredibly divisive reputational environment).
Put another way - they could maybe try to do this. But it would be worth literally nothing compared to the actual benefits to SX of this deal. And add an immense amount of risk. There’s much more clever and interesting things to do.
alecco 1 month ago

The model weights, sure he can "steal" them. But do what then? Serve it on the side?
Even distillation would be obvious.
And I wouldn't be surprised Anthropic is running the smaller inference models, keeping the base large model in machines they fully control/own.
spacebanana7 1 month ago
SpaceX's data centre business is much more valuable than Grok, so this wouldn't make sense.
- stefan_ 1 month ago
  
  Valuable? If SpaceX is a majority "resell electricity" business, its valuation will be a tiny fraction of what they are trying to push.
  
  1 reply →

ChrisArchitect 1 month ago

SpaceX S-1

https://news.ycombinator.com/item?id=48213933

mrcwinn 1 month ago

Anthropic is gross for this. The grandstanding about principles and values is intolerable.

6thbit 1 month ago

This isn't pointing towards a merger is it?

xAI gets the cashflow and makes spaceX bottom line more appealing. But that guy Musk hardly makes deals that favor other companies over his, so what am i missing?

7e 1 month ago

xAI is bleeding tons of cash, and they don’t have the ability to use the compute for robotaxis, which have been a disaster. Grok monetization is also low.

momo26 1 month ago

Does an expansion of computing power on this scale imply that computing capital is displacing model architecture as the true moat in AI competition?

zombot 1 month ago

When they are doing Colossus (The Forbin Project), I wonder who will be building Samaritan (Person of Interest)?

up2isomorphism 1 month ago

Will this solve it making up numbers it never sees? Unlikely.

ramon156 1 month ago

How come a company like Anthropic has invested in photonic computing? If it's good enough for Boeing, I'd assume they would at least invest in it. qc-LPU100 seems worth it for some matrix mult, if it can be proven its O(n) (at least less than O(n^3).

labrador 1 month ago

Musk said Anthropic Claude was woke DEI until he said it wasn't. It must be hard for Musk fans to keep up.

vessenes 1 month ago
I mean … they’re mostly sitting on their SpaceX stock and rubbing their hands gleefully. What do they have to keep up with?
Disclosure: former (small) SX investor.
- labrador 1 month ago
  
  You just made my sub text the text. Money makes the world go around. Who cares about values when there's money to be made?

itrunsdoomguy 1 month ago

They should set up a Doom server there.

ReptileMan 1 month ago

War makes strange bedfellows.

jgalt212 1 month ago

Too bad Enron is still not around. They'd have some real fun with today's electrical markets.

zitterbewegung 1 month ago

Seems like either Grok is being shut down or it will be "powered by anthropic" soon.

brookst 1 month ago
Grok’s whole thing is being irresponsible and edgy. Can’t s ee Anthropic going for that, can’t see Grok’s customer base accepting an AI that refuses to do nonconsensual porn.
- londons_explore 1 month ago
  
  Model fine-tunes to change the 'edgy' tone are very cheap and easy to do.
  Grok could easily be powered by Claude in just a few weeks engineering time.
  
  1 reply →

ChicagoDave 1 month ago

Sigh.

I’d hoped Anthropic would steer clear of blatantly unethical practices but here they go in bed with that guy and his horribly damaging data center.

sunaookami 1 month ago

Anthropic works with Palantir, they are all but "ethical" lol.
signatoremo 1 month ago

I suppose you're also using Chinese models, who all have to be at the behest of the Communist party. China also have "horribly damaging" coal power plants. Do they acquire training data ethically? Do they censor? Be honest with yourself. Your concerns have nothing to do with ethic.
oezi 1 month ago
Yes, quite a bad reputation hit for Anthropic to get in bed with Musk. Tesla lost half its marketshare in Europe because Elon is meddling in our democracy, is promoting right wing populist parties and supporting Trump in general.
- burnerRhodov3 1 month ago
  
  Europe has a hugely shrinking demographic, and is actively decoupling from the United States. There's almost no innovation and you are making the mistake of getting in bed with China and constantly suing our tech companies.
  Europe needs to focus more on Europe and less US politics.
  
  5 replies →
- MagicMoonlight 1 month ago
  
  No, they’ve lost market share because they haven’t released a new product in years. Meanwhile China is releasing hundreds of excellent models into the European market which are better than Tesla’s.
  
  3 replies →

nelox 1 month ago

I predict SpaceX will subsume Anthropic at some point.

zenai666 1 month ago

[flagged]

varalaakshay 1 month ago

[flagged]

zhangdake 1 month ago

[dead]

poelzi 1 month ago

Nobody uses grok except checking shit on X At least the hardware is not idle

fasbg1 1 month ago

I though Claude is too woke. Musk has posted that at least 50 times in the last year.

But booking outrageous rental fees as fake AI revenue ahead of the SpaceX IPO apparently takes precedence.

energy123 1 month ago

He posted that AI will kill everyone, then stopped when he started his own AI company.
He tried to steal the OpenAI charity, then started complaining about someone else stealing the OpenAI charity after he failed.
He attacks European democracies about free speech, but is a compliant little censor for Turkey, India and other non-European countries.
0xy 1 month ago
Anthropic is paying real cash, how is it fake revenue?
- 123aad 1 month ago
  
  Fake __AI__ revenue. Maybe Hetzner should build Colossus4, rent it out and book it as AI revenue instead of hardware rental revenue and get a P/E of 100.
  
  9 replies →
Rover222 1 month ago
He talked with the Anthropic team, and his concerns lessened. It's actually a good thing to be able to change one's mind.
- SilverElfin 1 month ago
  
  You’re falling for the Musk shell game. He just says whatever is convenient at any given moment. FSD. Funding secured. I talked to the Claude guys.
  He didn’t change his mind about anything. It’s just that he wants to prop up SpaceX’s IPO as much as he can. Plus if he hits various targets, he unlocks more shares for himself, I think.
  
  1 reply →
tmp10423288442 1 month ago

The eternal truth: money talks, bullshit walks
thelastgallon 1 month ago
Musk will buy Anthropic and fix the wokeness. It was done before.
- gdiamos 1 month ago
  
  Don’t put it past Dario to buy spaceX
  
  1 reply →

throwatdem12311 1 month ago

Cool. Can they make Claude not absolute dogwater then?