Comment by ollin

13 hours ago

Specifically it looks like he's planning to extend the ideas from https://github.com/karpathy/autoresearch into a larger effort towards recursive training improvement [1]:

> Excited to welcome Andrej to the Pretraining team! He'll be building a team focused on using Claude to accelerate pretraining research itself. I can’t think of anyone better suited to do it — looking forward to what we build together!

[1] https://x.com/nickevanjoseph/status/2056760504949842219

16 comments

ollin

stingraycharles 7 hours ago

Am I the only one who wasn’t particularly impressed by AutoResearch? If you looked at what the agent was actually doing, it was just tuning parameters mostly, not really trying different novel approaches.

I couldn’t help myself but consider this mostly a very inefficient variant of hyperparameter optimization, but someone correct me if I’m wrong, I may be looking at this too pessimistic.

lacker 6 hours ago

I didn't dig into what the actual repository was doing, but personally, I took some inspiration from the idea after reading about it and realizing that I might have been underestimating the ability of LLMs. I put a bit more work into a performance harness I was using locally and just set some agents to brainstorming and they did seem to find some great stuff. So I don't really have a stance one way or another on this specific repo, but the general idea seems like a really good one.
druub 1 hour ago

Ever since AlphaEvolve - the idea that if you build a harness which can evaluate solutions and give LLMs a database where they can keep storing their work and then sample from it - they do find non-trivial solutions over time leaning from their own past ideas.
It is the ultimate manifestation of test-time scaling. I think karpathy just popularised it.
clbrmbr 6 hours ago
Karpathy embedded within an organization is way more impressive than him out on his own with hot takes and little projects. I hope he does great things for Anthropic.
- stingraycharles 3 hours ago
  
  Absolutely, I wasn’t saying that him being at Anthropic wasn’t going to be effective, I just think his little projects wouldn’t be very interesting if his name wasn’t attached to them.
teravor 5 hours ago
> Am I the only one who wasn’t particularly impressed by AutoResearch?
isn't it just a nerfed AlphaEvolve? https://arxiv.org/abs/2506.13131
DesaiAshu 5 hours ago

Inefficient variants with $100m+ worth of compute will still probably outperform the best team of researchers

triyambakam 8 hours ago

I guess we must expect it at this point. But funny that has model written tokens like ’ instead of '

4ashz 6 hours ago

More like he'll blog and tweet about using Claude and get gullible software engineers to buy Claude subscriptions and work on their own obsolescence while paying for it.

Many people are still deluded and think he is the same person who wrote the informal AI tutorials in plain html. He isn't, he is selling stuff now.

mrandish 2 hours ago

I'm as jaded as can be but I think Anthropic is now beyond the point where they'd place much value on farming Karpathy's name recognition. I'm sure they considered it an extra plus in his hiring package but they wouldn't do the level of comp package he'd want if they didn't believe the odds were decent that he'll contribute serious value.
Sure, it can always not work out but that's no more a risk with him than any high-profile hire who doesn't really need the money and will always have other options.
bonoboTP 6 hours ago
What is he selling? How is this time different compared to when he was at OpenAI or at Tesla? You could say he was shilling those products too. I don't see any shift. He's still posted free in depth YouTube videos recently.
- 23998h 6 hours ago
  
  > What is he selling?
  Is that a serious question? He already promoted vibe coding and AI hype. Now he is literally there to promote Anthropic and its IPO price.
  When he was at OpenAI it wasn't overtly commercial yet. At Tesla he had a way lower profile. Now he is the vibe coding Jesus for deluded software engineers. The impact is much larger.
  
  2 replies →