Comment by Traster

13 hours ago

Karpathy is probably one of the biggest names in AI, I do wonder where he fits now. He's sort of bounced around Tesla back to OpenAI back to independent. He sort of left OpenAI before it really hit the inflection point, and he was at Tesla for a long time and they didn't really deliver what they wanted on the AI side. Now he's bounced around a few places. I understand that the leaders in this market play this silly game of trying to buy up the names like trading cards but I wonder what this turns into.

i wouldn't be surprised if he just becomes a glorified marketer for anthro.

im also going to guess that whatever research he does would be free roam research that primarily serves to market the fact that claude was able to help perform the research.

the visible stuff he's been working on has been mostly agent soft skills. off the top of my head is autoresearch and his the wiki knowledge stuff. nothing particularly groundbreaking, but has helped devs expand their understanding of the utility that these models can provide.

not a diss to andrej i know he's reading this now

  • I think you are underestimating both the value of both projects (autoresearch and personal wiki) just because they are simple. I see both POCs for continuous learning / optmization on the harness layer, which in my opinion is a very interesting direction.

    I think Andrej has the experience (and now ressources) to productionize this research into something very interesting.

    p.s. called it

    > Karpathy will help launch a new team focused on using Claude itself to accelerate pretraining research — an increasingly important frontier as AI companies race to automate parts of AI development. (https://www.axios.com/2026/05/19/anthropic-openai-karpathy-a...)

    • No, these are developed off of the assumed uses of the models (predictive autofiller) rather than their actual, cognitive and potential industrial use (developing large scale frameworks for industrial production, automating systems that normally require human monitoring), and uses that we have not yet discovered, because we have not figured out all the constraints and limitations of these models. If Karpathy was in the game like he used to be, he would be on real product. Right now he’s probably so lost by the very thing he helped create that he is stuck doing these mini projects for his own personal interest, without anyone really critically engaging with his work.

    • Just because something is not ground breaking that doesn't mean that technology path isn't valuable.

    • Those projects are a complete joke. Neither of them were even original, people have been playing around with those ideas for well over a year.

      They just became "famous" because Karpathy is effectively an AI celebrity, so he could throw shit at a wall and post it on X and it would get 10k Github stars.

      But seriously, people have been using the models to tweak hyperparemeters, or using LLMs to help create a second brain using markdown or json files or 100X other combinations of files, for a long time already.

      1 reply →

  • > just becomes a glorified marketer

    That implies Karpathy is either dumb or desperate and he is neither of those by a long shot.

    • I don't think that's the parents implication.

      Generally, when a "good" developer has a huge public presence and reputation, that's quite valuable to a company when they're competing in a tough space. Many a time, more so than the (very high) technical skill of the developer in question.

      I've seen large funded companies gather good popular developers like pokemon cards and just have them go around give talks and write blog posts. It creates an aura around them which makes things like hiring, fund raising etc. much easier.

      So, it's not really a statement about Karpathy himself. It's more about the company hiring him.

      1 reply →

    • > That implies Karpathy is either dumb or desperate

      This kind of thing happens to big names in software all the time. Carmack going to Facebook is a prime example - he joined with the idea of using all those resources to build world-changing tech, and instead he ended up headlining conferences, and fighting a losing battle against the corporate types who were put in charge of Oculus.

      3 replies →

    • No it doesn’t? It matches his skills to the lab’s needs. Karpathy is a media personality, manager, and educator far more than he is a hands-on researcher.

      4 replies →

    • He already stated his motivation a few months ago in an interview with Dwarkesh - basically saying that he might join one of the big labs, for a while, to keep in touch with frontier research.

      Andrej seems like a great guy, but him joining Anthropic feels a bit like a transactional relationship (rich old guy marries hot young chick). Anthropic get a "glorified marketer", and he gets a front row seat at SOTA LLM dev 2026. I don't think they hired him expecting he's going to change the direction/pace of their research.

      2 replies →

    • > That implies Karpathy is either dumb or desperate and he is neither of those by a long shot.

      No it implies that he is more valuable for being famous than the hands-on work he can produce. This is the IC endgame

    • I don’t think it does. I think it’s better phrased that he is marketing rather than a marketer. He can do whatever he wants to do, in return Anthropic gets to say “hey, this guy works with us!”

      1 reply →

    • I don't know anything about this person, but want to point out that renown and validation is something that most (all?) humans crave. That doesn't make them dumb or desperate, it makes them normal.

    • CEOs also get paid millions and billions to do nothing. Sooooo...

      "Improve yourself, no mistakes" in a loop. Woooah sooo revolutionary...

    • Being a singular influencer in this space, at this time, may be more valuable than a lot of successful VC-backed startups over the last few decades.

    • Andrej is a smart guy. You don't get into Stanford for grad school without that.

      But he has always been known for his communication rather than his research. He got famous by putting out a (very well made) course on machine learning that was available to the public. Since graduating he hasn't exactly delivered on revolutionary new stuff at the businesses that employed him but he has continued to be extremely good at communicating thoughts about the current and future state of AI. Businesses want that and he knows that he can deliver that.

    • he's not dumb or desperate compared to the average person, but it's very possible to be dumb and desperate compared to the delusional promises and outsized amounts of money in the industry. Manages to make smart people look extremely stupid every day.

    • Anyone who would voluntarily work for Musk when he went obviously has things going on that aren't great.

  • It's also hard to any hard research on your own without resources. At best a few gpus can only go so far right now.

  • yes stop kidding yourself that he is going in as a tech leader in terms of providing technical innovation..at that stage its your persona that matters not the tech (sure I think Anthropic is going to listen to his advice..but its a transactional marketing win primarly)

    his value to Anthropic is his influence..he has over 2 million followers, and value is that he is the Top influencer for AI right now, like it or not. just like Selena Gomez might be for top for women age 21-29...

    Every AI nerd I know reposts his (very thoughtful posts and projects mind you) like religon

  • > i know he's reading this now

    meanwhile in the real world:

      claude --permission-mode=auto --model=opus -p '/onboard --user=karpathy'

    • expectation: in the real world the CLI will be replaced by an agent prompt and to get to the shell you'll have to ask 'get me bash dammit'

He may not be a brilliant researcher, but he is a brilliant teacher. I am glad he is joining Anthropic so he can stay up to date with the next round of things that he will teach :)

  • It is a pity we don't hear more about the truly brilliant researchers.

    All we hear is Altman, Musk, ...

    • > It is a pity we don't hear more about the truly brilliant researchers.

      Reason? What is the value of that other than entertainment? And it's not in the interest of companies to make celebrities that then are poach targets (if they can avoid they would yes there are exceptions as noted elsewhere in this thread).

      And if you did 'hear' (via articles) to what extent what was said even be correct vs. a writer just fluffing things up to the max.

      Tech is not sports where you can actually see the superlatives and know that the person who praise is being lavished on actually won or threw or caught and so on. (Or even music where you can hear it and see the stadium that is packed with fans..)

The self drive on my Tesla is damn near perfect. I haven’t driven my car in around 6 months.

  • FWIW while Karpathy was at Tesla he was basically working on the vision component. The actual driving component (using vision as an input) was originally all C++. They may have started migrating parts of the driving component from C++ to neural networks while Karpathy was there, but most of it happened after he left in 2022, with the big switch being FSD 12 in 2024. User reports from before/after FSD 12 are like night and day.

    • I was never convinced by the "vision only" approach - I don't see the point of throwing out or refusing to have additional data from other sensors.

      I suppose that with modern ML they can just toss it in the blender and reap the benefits ...

      1 reply →

he'd probably be a great face for developer relations or whatever Antropic calls the role

Tesla self driving works. I don’t know if Karpathy deserves credit for that or not.

  • Tesla self driving kind of works. In a very similar way to how it kind of worked back in 2016. It's better than it was in 2016, don't get me wrong. But even today they haven't solved the problem and Karpathy left in 2022. And other companies notably have actually surpassed Tesla over that time. I don't think anyone could reasonably say he walked away in 2022 because he thought the job was finished.

  • Yes, no driver needed at all. Your Tesla makes money for you while you sleep at home.

> He sort of left OpenAI before it really hit the inflection point

Sorry I'm out of the loop... What inflection point are you referring to?

  • When OpenAI was founded, the mission was to develop AI, but nobody (anywhere) knew how to do AI, so OpenAI did ML research on games instead, which is what DeepMind was doing (with Google's perceived AI/ML dominance being the raison d'etre for OpenAI, and Google having just bought DeepMind). This was the era when Karpathy was at OpenAI.

    Around the time Karpathy left, Ilya Sutskever, another OpenAI founder, started playing with Google's new "Transformer" architecture, which was the beginning of the "GPT" series, GPT-1, GPT-2 and eventually ChatGPT (GPT 3.5 + RLHF). In retrospect OpenAI's early Transformer experiments and GPT-1 was the inflection point that moved OpenAI from a company that wanted to build AI, as soon as anyone else did, to one that was actually doing so, although I think it would be revisionist for anyone to claim they knew what they were doing at the time. The early GPT-1 and GPT-2 papers read more like "wow, this is a bit unexpected, look at all of the things it can do!".

  • Karpathy left OpenAI in 2017 for Tesla, came back from Tesla in 2023 and left again in 2024.

    So pretty sure the original poster is talking about 2017.

  • GPT-1 presumably, which was released a year after he left. Prior to focusing on GPT, OpenAI was pursuing a lot of research directions.

  • Well Karpathy left in 2017, and all the sort of commercial stuff didn't happen till a while later - for example they set up the structure to take external money in 2019 and that's obviously the point at which they'd found the pathway that justified doing massive training runs and all that. So Karpathy was out very early (left at the point that Musk thought OpenAI had basically failed).

Or they collude by hiring each others engineers as a way to create manageable competition and information sharing outside their fiduciary duty to shareholders.

I somehow felt he, along with Andrew Ng, are very few well-known AI experts that are left behind on the money side during the AI-gets-me-super-rich crazy time, unfortunately.

  • Andrew Ng has been investing in AI startups for almost a decade, I would be very surprised if this rising tide left him behind.

  • I can't speak for Andrew Ng - but my take is he did out of pure altruism - love. just in terms of advancing free education e.g coursera & the free machine learning courses etc he brought to the masses.

    not everyone does things to be rich.

Some people are good at developing the sciences. Others are good at developing commercial products.

And tesla is not a good place for science development. Tesla is structured from narcissistic mindset: results driven, cynical, and position-based. This doesn’t bode well for long term sciences.

I dont see how he could be helping anthropic

I read this as a bad sign for Anthropic. Relying yet again on more hype instead of improving products.

OpenAI’s hiring recently has been much stronger, whether through luck or structure. The “no-name” guys have actual taste. I love that. I don’t care that they’re no-names.

I don’t know Karpathy personally, I won’t speak bad about a man I don’t know. I hope he makes CC better. I just read this as hype. My point is that there’s nothing he has that an empowered no-name product manager doesn’t. It’s like Alex Wang at Meta. That acq didn’t redeem Meta. They actually lost LeCun. Where’s Llama today?

Regardless of what Anthropic’s share price is, OpenAI has been fucking killing it recently. I don’t take particular pleasure in saying that, i’ve been a google and gemini guy for years

My lens is meritocratic. My experience is as an extremely heavy user of both company’s full suite of products in the range of 5 digits per month. My interest is better products not hype.

  • >OpenAI’s hiring recently has been much stronger, the guys have actual taste.

    Can you cite specifics? "I won't speak bad about someone, but also won't speak good about others" resulted in a comment that seems to contribute nothing

    • I judge them from a meritocratic lens.

      I’m hoping Karpathy will make Claude Code better, in the meantime I’m super happy seeing a small product manager like Tibo fucking crushing it on Codex

      5 replies →

  • It feels like these companies are constantly going back and forth on who has the best product constantly. It's such a dynamic time with how fast they are both working.

  • OpenAI seems to be dumping a LOT of money into marketing on social media at least.

    • To be fair, Mythos is probably one of the most significant marketing pushes in the industry in both impact and investment.

      I am sure there is an element of reality in it's capabilities, but there's also a significant amount of "We don't have the compute to handle this at scale", and "look look, we have the best model. It's so good that you can't even compare it to other models. That is how good we are."

      4 replies →

    • Anthropic as well. The private equity partnerships for guaranteed users are going to make their numbers look great.

  • Curious what you mean by killing it? Products? Model quality?

    • Dude, both! Codex is going to eat Openclaw… i don’t love saying that.

      What codex is a few steps away from doing is changing fundamentally a lot of workflows.

      Remote codex with their computer use is basically you at your computer doing things, 24/7.

      Then they added gpt images 2.0

      what codex can do, in a few more product iterations, is show you visually side by side “would you prefer this (A) or that (B)” in a series of questions. This is what some open source researchers have been up to. That’s no longer guessing.

      I’m not trying to hype a company i have no stake in, but they’ve been killing it.

      It’s extremely compute intensive, but also very satisfying.

      2 replies →

  • really - what am i missing?

    • It just feels like more hype instead of product focus.

      Example 1, just from top of my mind, Composer 2.5 released today. Go look at their benchmark.

      Composer 2.5 and Opus 4.7 ranked around the same, meanwhile gpt-5.5 was miles ahead.

      You wouldn’t have caught me dead using a gpt model 2 years ago

  • This is true for all the UASanian frontier model owners

    They are all going to get their lunch eaten by the Chinese.

    In the USA with access to most of the world's capital, they've succumbed to the temptation of "bigger, faster, harder"

    Whilst the Chinese, with enough capital only, have had to think.

    The Chinese models are already miles ahead on cost/inference basis and will probably pass all the USAnian companies in five years

    The age of UASnian engineering dominance are coming to an end.

    Let's all hope she goes quietly - not at the moment

I mean, you would think that all those people he killed as the person in charge of deploying knowingly dangerously defective self-driving software for profit would have had a impact. But executives seem to just skate on killing customers to line their own pockets these days. Just "following orders" I guess.

  • He deployed, not just developed?

    • Yes, he was [1] director of AI and Autopilot Vision at Tesla, directly poached and reporting to Elon Musk on the most important headline feature of Tesla directly managed by Elon Musk.

      He had both the technical and executive authority to determine if the product was fit for customer usage. He had direct executive responsibility for the product on the road between 2017-2022.

      If he, the lead architect and executive responsible felt the product was dangerous and then he was overridden, he can not get away with claiming he was “just following orders”, he had a moral duty to not sign-off or quit otherwise he is clearly complicit in deploying a dangerous product for his own self-enrichment.

      When people talk about engineering ethics, this is literally a completely uncontroversial textbook example. The only way you accept this is if you do not want ethics in engineering.

      Furthermore, he was extremely hireable with numerous job opportunitys available to him. He would not be destitute or even particularly worse off if he did quit for ethical reasons. Any self-preservation defense is also invalid.

      [1] https://techcrunch.com/2017/06/20/tesla-hires-deep-learning-...

      5 replies →

His goal could simply be to learn SOTA architectures.

When rumors started that GPT-4 design would be kept secret, he likely wanted to know what architecture it would be. Perhaps he left Tesla, waited out the non-compete clause, and joined OpenAI to learn its details.

When Mythos dropped, there were hints that it had a new architecture. He might similarly want to know how it works.

Either way, there is enough cross-lab hiring that those secrets eventually get known, but only by the labs.