Yann LeCun to depart Meta and launch AI startup focused on 'world models'

3 months ago (nasdaq.com)

679 comments

MindBreaker2605

Most of the folks on this topic are focused on Meta and Yann’s departure. But, I’m seeing something different.

This is the weirdest technology market that I’ve seen. Researchers are getting rewarded with VC money to try what remains a science experiment. That used to be a bad word and now that gets rewarded with billions of dollars in valuation.

xnx 3 months ago
> This is the weirdest technology market that I’ve seen.
The phenomenon you're seeing is well described here: "The Perfect AI Startup" (https://www.bloomberg.com/opinion/newsletters/2025-09-29/the...)
“It was the most absurd pitch meeting,” one investor who met with Murati said. “She was like, ‘So we’re doing an AI company with the best AI people, but we can’t answer any questions.’”
Despite that vagueness, Murati raised $2 billion in funding...
- pksebben 3 months ago
  
  From a certain angle, this is the market correcting towards the abstraction.
  Between inflation, fiscal capture, and the inane plethora of ridiculous financial vehicles that are used to move capital around these days, the argument could be made that the money was already funny. This is just the drop of the final veil, saying "well it's not like these numbers mean anything anymore. I do have enough yachts. Fuck it, see what you can do with it".
  
  1 reply →
- HarHarVeryFunny 3 months ago
  
  I wonder how the investors feel now seeing what the initial product is?!
  Maybe investing in all well-connected AI startups is safer than trying to pick the winners and losers?
  
  3 replies →
- toomanyrichies 3 months ago
  
  Archived copy of the Bloomberg article here:
  https://archive.ph/jGOGW
- belter 3 months ago
  
  https://www.wheresyoured.at/big-tech-2tr/
- keeganpoppen 3 months ago
  
  matt levine never misses
- prairieroadent 3 months ago
  
  [dead]
- zombiwoof 3 months ago
  
  [dead]
- baobabKoodaa 3 months ago
  
  Matt Levine never disappoints.
DebtDeflation 3 months ago
That's been true for the last year or two, but it feels like we're at an inflection point. All of the announcements from OpenAI for the last couple of months have been product focused - Instant Checkout, AgentKit, etc. Anthropic seems 100% focused on Claude Code. We're not hearing as much about AGI/Superintelligence (thank goodness) as we were earlier this year, in fact the big labs aren't even talking much about their next model releases. The focus has pivoted to building products from existing models (and building massive data centers to support anticipated consumption).
- brandall10 3 months ago
  
  Meta hiring researchers en masse at $100m+ pay packages is fairly new, as of this summer.
  I don't know if that's indicative of the market as a whole though. Zuck just seems really gutted they fell behind with Llama 4.
  
  22 replies →
- thefourthchime 3 months ago
  
  You must not watch broadcast television (e.g American Football). Anthropic is doing a huge ad blitz, trying to get end customers to use their chatbot.
  
  6 replies →
- ximeng 3 months ago
  
  If Claude Code is Anthropic’s main focus why are they not responding to some of the most commented issues on their GitHub? https://github.com/anthropics/claude-code/issues/3648 has people begging for feedback and saying they’re moving to OpenAI, has been open since July and there are similar issues with 100+ comments.
  
  16 replies →
- Salgat 3 months ago
  
  The novelty of LLMs are wearing off, people are beginning to understand them for what they are and what they are capable of, and performance has been plateauing. I think that's why people are starting to worry that the AI bubble is a repeat of the dotcom bubble, which was a similar technological revolution.
- zombiwoof 3 months ago
  
  [dead]
Aurornis 3 months ago
> Researchers are getting rewarded with VC money to try what remains a science experiment. That used to be a bad word
I’ve worked for multiple startups and I’ve watched startup job boards most of my career.
A lot of VC backed startups have a founder with a research background and are focused on providing out some hypothesis. I don’t see anything uncommon about this arrangement.
If you live near a University that does a lot of research it’s very common to encounter VC backed startups that are trying to prove out and commercialize some researcher’s experiment. It’s also common for those founders to spend some time at a FAANG or similar firm before getting VC funded.
- mehulashah 3 months ago
  
  Certainly research has made it into product with the help of the innovators that created the research. The dial is turned further here where the research ideas have yet to be tried and vetted. The research begins in the startup. Even in the dotcom era, the research prototypes were vetted in the conferences and journals before taking the risk to build production systems. This is no longer the case. The experiments have yet to be run.
- anshumankmr 3 months ago
  
  Fusion, stem cells, CRISPR,robotics etc all come to mind.
  
  1 reply →
- HarHarVeryFunny 3 months ago
  
  Yeah, but Sutskever and Murati wouldn't even tell investors what they were working on, and LeCun only has a long-term research direction - not any breakthrough or prototype to commercialize.
- calmbell 3 months ago
  
  I agree there is nothing uncommon about that type of arrangement, but the amount of money involved is unprecedented.
baxtr 3 months ago
I personally see this as a positive trend. VC in its earliest form was concerned with experiments that had high technology risk. I am thinking of companies like Genentech and scientists like biochemist Herbert Boyer, who had pioneered recombinant DNA technology.
After that, VC had become more like PE, investing in stuff that was working already but needed money to scale.
- WalterSear 3 months ago
  
  This isn't that.
  This is VCs FOMOing as global-economy-threatening levels of leverage are being bet on an AI transformation that, by even the most optimistic estimates, cannot achieve a tiny portion of the required ROI in the required time.
- causal 3 months ago
  
  Yeah there has been some lamenting at all the money being thrown at technology hasn't been for anything truly game changing, basically just variations of full stack apps. A few failed mooonshots might be more interesting at least.
- Sanzig 3 months ago
  
  I agree, if anything spending money on high technology risk is Silicon Valley going back to its roots.
  Nobody had a way to do silicon transistor manufacturing at scale until the traitorous eight flipped Shockley the bird and took a $1.4M seed investment from Sherman Fairchild.
  Big bets on uncertain technology is what tech is supposed to be about.
zikduruqe 3 months ago
> This is the weirdest technology market that I’ve seen.
You must have not lived through the dot com boom. There was almost everything under the sun was being sold under a website that started with an "e". ePets, ePlants, eStamps, eUnderwear, eStocks, eCards, eInvites.....
- jfengel 3 months ago
  
  Those things all worked, and all of those products still exist in one form or another. It was a business question of who would provide it, not a technology question.
- baggachipz 3 months ago
  
  The Pets.ai Super Bowl commercial will trigger the burst.
- staticman2 3 months ago
  
  That was certainly a bubble but I don't think pets.com was doing a research experiment.
  From what I recall there were some biotech stocks in that era that do fit the bill.
- ricardobeat 3 months ago
  
  It's funny that the Netherlands seems to still live in the dotcom boom to this day. Want to adopt a pet? verhuisdieren.nl. Want to buy wall art? wall-art.nl. Need cat5 cable? kabelshop.nl. 8/10 times there is a (legit) online store for whatever you need, to the point where one of the local e-commerce giants (Coolblue) buys this type of domain and aliases them to their main site.
  
  3 replies →
- antonvs 3 months ago
  
  None of those were science experiments or research projects in any way.
- nailer 3 months ago
  
  It did make sense though. ePlants could have cornered the online nursery market. That is a valuable market. I think people were just too early. Payment and logistics hasn’t been figured out yet.
- aswanson 3 months ago
  
  Even hardware. eMachines.
- cheevly 3 months ago
  
  These are not the same.
  
  2 replies →
- bookofjoe 3 months ago
  
  flooz
skeeter2020 3 months ago

Agree on weirdness but not on the idea of funding science experiments:
>> away from long-term research toward commercial AI products and large language models - LLMs
This feels more like what I see every day: the people in charge desperately looking for some way - any way - to capitalize on the frenzy. They're not looking to fund research; they just want to get even richer. It's pets.ai this time.
tshaddox 3 months ago

This doesn’t feel that new or surprising to me, although I suppose it depends what you consider the line between “science experiment” and “engineering R&D” to be.
Biotech has been a YC darling. Was Ginkgo Bioworks not doing science experiments?
Clean energy was a big YC fad roughly 15 years ago. Billions were invested towards scientific research into biofuels, solar, etc.
mrbonner 3 months ago
I can’t help but wonder: if we had poured the same amount of money into fusion energy research and development, how far might we have come in just three short years?
- williamDafoe 3 months ago
  
  The minimum cost of capital just to run fusion experiments is probably $100m. And the power bills are probably almost as high as the ones from OpenAI, which is to say, they are the highest power bills in the history of mankind ...
- tru3_power 3 months ago
  
  Forreal that’s what really gets me about this haha. Literally billions of dollars burned on bullshit.
gdulli 3 months ago
If a "science experiment" has the chance to displace most labor then whoever's successful at the experiment wins the economy, period. There's nothing weird or surprising about the logic of them obsessively chasing it. They all have to, it's a prisoner's dilemma.
- 0_____0 3 months ago
  
  Fusion power has the chance to displace most power generation, and whoever is successful at the experiment wins the energy economy, period. However given the long timelines, high cost of research, and the unanswered technical questions around materials that can withstand neutron flux, the total 2024 investment into fusion is only around $10B, versus AI's 250+B.
  Why are these so different?
  
  10 replies →
- HarHarVeryFunny 3 months ago
  
  Technology know-how spreads rapidly, so no need to be first. Look how fast Google caught up with Gemini when they chose to, or how fast X.ai developed Grok.
  Maybe it's cheap insurance to invest in, say, LeCun just in case JEPA or the animal intelligence approach takes off, but if it does show significant signs of progress there'd also be opportunity to invest later, or in one of the dozen copycats that will emerge. In the end it'll be the giants like Google and Microsoft that will win.
4er_transform 3 months ago

This looks more like a return to form than anything.
The first ventures were funding voyages to a New World thousands of miles away, essentially a different planet as far as the people then were concerned.
Venture capital for a new B2B application is playing it safe as far as risk capital goes
scotty79 3 months ago

Yeah, that's quite unusual. Buisness was always terrible at being innovative, always dared to take only the safest and most minute of bets and the progress of technology was always paid for by the taxpayers. Business usually stepped in only later, when technology was ready and did what it does best, opimize manufacturing and put it in the hands of as many consumers as possible rakink in billions.
I wonder what changed. Does AI look like a safe bet? Or does every other bet seem to not have any reasonable return?
elliotto 3 months ago

Beats giving all the money to the person who says the word 'blockchain' the most.
Noaidi 3 months ago

Agree. This is just gambling with almost free money.
Feeding, housing, and educating people would benefit society, and these companies, so much more than AI ever will.
KaiserPro 3 months ago
Its not really an outlier
If you think about Theranos, Magic leap, openai, anthropic they are all the same, one idea thats kinda plausible (well if you don't look too closely), have a slick demo, and well connected founders.
Much as a lot of people dislike LeCun (just look at the blind posts about him) he did run and setup a very successful team inside meta, well nominally at least.
- CamperBob2 3 months ago
  
  Ridiculous. Theranos was a literal crime scene. Its products didn't work at all, and physically couldn't work due to the nature of blood itself.
  Magic Leap was an honest if overhyped effort that didn't achieve product-market fit.
  Meanwhile, products from OpenAI and Anthropic have both done useful work for me this week.
  
  2 replies →
greenavocado 3 months ago
You're right to feel like you're seeing something different. You are. But you're mistaking the symptom for the disease.
That's because you're trying to make sense of it as a technology market. It's not. It's a resource extraction market, and the VCs are the ones running the logging operation. Their sole mission is to find a dependable way to strip a forest bare, and they've been using the same playbook for decades.
Those "science experiments" you're talking about? They aren't the product. They're the story, the sizzle. They are the disposable lighter used to start the fire; the VCs have no intention of keeping it lit forever. The real tool is the chainsaw, and the "science experiment" is the brand name printed on the side.
Think of it as clear-cutting. The dot-com bubble was one forest. The story then was that a company losing millions selling pet food online was a "new economy" giant because it had "eyeballs." That was the sales pitch for the chainsaw. VCs funded hundreds of these operations, created a frenzy, and took the most plausible-sounding ones public. The IPO wasn't a milestone; it was the moment they sold the timber and exited the forest, leaving the stumps and worthless pulp for the pension funds and retail investors.
The "long-term" part of their strategy isn't about the health of any single tree or company. It's about finding the next forest to clear-cut. After dot-coms, it was social media. Now, it's the AI forest. They aren't betting on AI; they're betting on their ability to sell the world on the idea that this particular forest is magical and will grow forever.
So you're right, what you're seeing is weird. But it's not a new kind of weirdness. It's the oldest story in finance. A bubble being inflated so the smart money can cash out, leaving everyone else to marvel at the fancy new chainsaw after the forest is already gone.
- arduinomancer 3 months ago
  
  Exactly, the whole process is a business itself
- senordevnyc 3 months ago
  
  This sounds like AI slop.
antonvs 3 months ago

> Researchers are getting rewarded with VC money to try what remains a science experiment.
That's not all that new. Commercial fusion power startups are an example. I think the first one was General Fusion, founded in 2002. Today, there are around 50 of them. Every single one of those "remains a science experiment", and probably has much lower chance of success than some of the AI science experiments.
Of course, fusion startups have apparently "only" received about $10 bn in funding to date, so pale in comparison to the overall AI market. But if you just look at the AI "science experiments", it's possible the amounts would be comparable.
mtillman 3 months ago

Having raised more than $100M myself, I’m not sure I would call VC money a reward. However, VC money should be allocated in part to massive upside science experiments. PE money is focused on things already figured out.
mattmaroon 3 months ago
It makes sense, it’s a simple expected value calculation.
There are trillions of labor dollars that can be replaced by software. The US alone has almost $12 trillion of labor annually.
If an AI company has a 10% shot of developing a product that can replace 10% of it, they are worth $120 billion in expected value. (These numbers are obviously just for illustration).
The unprecedented numbers are a simple function of the unprecedented market size. Nobody has ever had a chance of creating trillions of dollars of economic value in a handful of years before.
- Barrin92 3 months ago
  
  >If an AI company has a 10% shot of developing a product that can replace 10% of it, they are worth $120 billion in expected value.
  that's not how profits work. Companies don't get paid for the value they create but for the value they can capture, otherwise the ffmpeg people would already be trillionaires.
  If you have a dozen companies making the same general purpose technology, not product, your only hope is being able to slap ads on top of it, which is why they're so keen on targeting consumers rather than trying to automate jobs.
  
  1 reply →
ghm2180 3 months ago

Has someone done a survey to ask devs on how much they are getting done vs what their managers expect with AI? I've had conversations with multiple devs in big orgs telling me that Managers and dev's expectations are seriously out of sync. Basically its
Manager: Now you "have" AI, release 10 features instead of 1 in the next month.
Devs: Spending 50% more working hours to make AI code "work" and deliver 10.
beezlebroxxxxxx 3 months ago

The scale of money is crazy in this example, but the same thing happens in the pharmaceutical/bio-tech industry.
AdamN 3 months ago
I think that's a good thing and VC getting back to it's roots. I'm glad that scientists doing AI are getting big money and don't know exactly what the product will be rather than some business person with a slick deck and hockey stick charts.
- WalterSear 3 months ago
  
  VC isn't "getting back to it's roots", though it is certainly displaying one of it's fundamental drives: FOMO.
panarky 3 months ago
If a science experiment works and is transformational can be worth a trillion dollars, how much is it worth if it has a 5% chance of being transformational?
- startupsfail 3 months ago
  
  What if it is a 99% chance of being transformational and the results of that transformation are completely unpredictable?
- WalterSear 3 months ago
  
  What it's transformational but takes a decade or so, instead of a year or so?
  It's not like this isn't following exactly the same hype cycle as every other technological transformation.
JCM9 3 months ago

Get the popcorn ready for when that all implodes. Most of these folks getting funding don’t have the slightest clue on how to build a sustainable business.
When the bubble pops, and it’s very close to popping, there’s going to be a lot of burning piles of cash with no viable path to reviver that money.
Oras 3 months ago
Every startup is an experiment; only 2% succeed.
- ninetyninenine 3 months ago
  
  Not if you get funding from a VC.
  
  1 reply →
pbreit 3 months ago

Lecun always struck me as an institutional clown who completely botched Meta's AI outlook.
everdev 3 months ago

Hey listen, corporate owners will pay flesh to ultimately reduce spending on the workforce.
HardwareLust 3 months ago

It's the world's biggest game of "let's throw shit at the wall and see what sticks."
They're trying desperately to find profit in what so far has been the biggest boondoggle of all time.
dev_l1x_be 3 months ago

Given the infinite amount of VC money and greed this is not a big surprise.
justapassenger 3 months ago

If it ever feels weird - just watch Silicon Valley show again.
"Revenue? No no no no. Why would you go after revenue? If you show people revenue, they’ll ask ‘how much?’. And it will never be enough. The company that was the 100x-er or the 1000x-er becomes the 2x dog. But if you have no revenue, you can say you’re pre-revenue and you’re a potential pure play.”
We took it now from no revenue to no actual product, or even a concept of a product.
cantor_S_drug 3 months ago
Because when the recipe is open and public, the product's success depends on Distribution (which has been cornered by MS, Google, Apple). This is good for the ecosystem but not sure how those particular VCs will get exits.
- nradov 3 months ago
  
  Very few startup products depend on distribution by Microsoft / Google / Apple. You're really just talking about a limited set of mobile or desktop apps there. Everything else is wide open. Kailera Therapeutics isn't going to live or die based on what the tech giants do.
blutoot 3 months ago

Yes - I had similar thoughts when I saw the word "startup" used alongside something so far-out (same 'critique" should apply to Fei-Fei Li's World Labs - https://www.worldlabs.ai). These are VC-funded research labs (and there is nothing wrong with tat). Calling them "startups" as if they are already working on an MVP on top of an unproven (and frankly non-existent) technology seems a little disingenuous to me.
j45 3 months ago

It might not be a science experiment.
JKCalhoun 3 months ago

Is it like VCs throwing money at a young Wozniak while eschewing Jobs?
That either gives the AI tech more legitimacy in my mind … or a sign we've not arrived yet.
rapsey 3 months ago
VC is in a bubble.
- rvz 3 months ago
  
  Underrated comment of the year

sebmellen 3 months ago

Making LeCun report to Wang was the most boneheaded move imaginable. But… I suppose Zuckerberg knows what he wants, which is AI slopware and not truly groundbreaking foundation models.

xuancanh 3 months ago
In industry research, someone in a chief position like LeCun should know how to balance long-term research with short-term projects. However, for whatever reason, he consistently shows hostility toward LLMs and engineering projects, even though Llama and PyTorch are two of the most influential projects from Meta AI. His attitude doesn’t really match what is expected from a Chief position at a product company like Facebook. When Llama 4 got criticized, he distanced himself from the project, stating that he only leads FAIR and that the project falls under a different organization. That kind of attitude doesn’t seem suitable for the face of AI at the company. It's not a surprise that Zuck tried to demote him.
- blutoot 3 months ago
  
  These are the types that want academic freedom in a cut-throat industry setup and conversely never fit into academia because their profiles and growth ambitions far exceed what an academic research lab can afford (barring some marquee names). It's an unfortunate paradox.
  
  55 replies →
- throwaw12 3 months ago
  
  I would pose a question differently, under his leadership did Meta achieve good outcome?
  If the answer is yes, then better to keep him, because he has already proved himself and you can win in the long-term. With Meta's pockets, you can always create a new department specifically for short-term projects.
  If the answer is no, then nothing to discuss here.
  
  29 replies →
- HarHarVeryFunny 3 months ago
  
  Meta had a two prong AI approach - product-focused group working on LLMs, and blue-sky research (FAIR) working on alternate approaches, such as LeCun's JEPA.
  It seems they've given up on the research and are now doubling down on LLMs.
- sharmajai 3 months ago
  
  Product companies with deprioritized R&D wings are the first ones to die.
  
  12 replies →
- hbarka 3 months ago
  
  LeCun truly believes the future is in world models. He’s not alone. Good for him to now be in the position he’s always wanted and hopefully prove out what he constantly talks about.
  
  7 replies →
- rapsey 3 months ago
  
  Yann was never a good fit for Meta.
  
  1 reply →
- Grimblewald 3 months ago
  
  LLM hostility was warrented. The overhype/downright charlartan nature of ai hype and marketing threatens another AI winter. It happened to cybernetics, it'll happen to us too. The finance folks will be fine, they'll move to the next big thing to overhype, it is the researchers who suffer the fall-out. I am considered anti LLM (transformers anyway) for this reason, i like the the architecture, it is cool amd rather capable at its problem set, which is a unique set, but, it isnt going to deliver any of what has been promised, any more than a plain DNN or a CNN will.
  
  4 replies →
- mi_lk 3 months ago
  
  This is the right take. He is obviously a pioneer and much more knowledgeable than Wang in the field, but if you don't have the product mind to serve company's business interest in short term and long term capacity anymore, you may as well stay in academia and be your own research director, let alone a chief executive in one of the largest public companies
- whiplash451 3 months ago
  
  It's very hard (and almost irreconcilable) to lead both Applied Research -- that optimizes for product/business outcomes -- and Fundamental Research -- that optimizes for novel ideas -- especially at the scale of Meta.
  LeCun had chosen to focus on the latter. He can't be blamed for not having taken the second hat.
  
  1 reply →
- Nimitz14 3 months ago
  
  Yann was in charge of FAIR which has nothing to do with llama4 or the product focussed AI orgs. In general your comment is filled with misrepresentations. Sad.
  
  1 reply →
- nailer 3 months ago
  
  Lecun has also consistently tried to redefine open source away from the open source definition.
- rob_c 3 months ago
  
  tbf, transformers from more of a developmental perspective are hugely wasteful. they're long-range stable sure, but the whole training process requires so much power/data compared to even slightly simpler model designs I can see why people are drawn to alternative complex model designs down-playing the reliance on pure attention.
- _the_inflator 3 months ago
  
  I totally agree. He appeared to act against his employer and actively undermined Meta's effort to attract talent by his behavior visible on X.
  And I stopped reading him, since he - in my opinion - trashed on autopilot everything 99% did - and these 99% were already beyond the two standard deviation of greatness.
  It is even more highly problematic if you have absolutely no results eg products to back your claims.
gnaman 3 months ago
He is also not very interested in LLMs, and that seems to be Zuck's top priority.
- tinco 3 months ago
  
  Yeah I think LeCun is underestimating the impact that LLM's and Diffusion models are going to have, even considering the huge impact they're already having. That's no problem as I'm sure whatever LeCun is working on is going to be amazing as well, but an enterprise like Facebook can't have their top researcher work on risky things when there's surefire paths to success still available.
  
  68 replies →
- gdiamos 3 months ago
  
  The role of basic research is to get off the beaten path.
  LLMs aren’t basic research when they have 1 billion users
ACCount37 3 months ago
That was obviously him getting sidelined. And it's easy to see why.
LLMs get results. None of the Yann LeCun's pet projects do. He had ample time to prove that his approach is promising, and he didn't.
- chaoz_ 3 months ago
  
  I agree. I never understood LeCun's statement that we need to pivot toward the visual aspects of things because the bitrate of text is low while visual input through the eye is high.
  Text and languages contain structured information and encode a lot of real-world complexity (or it's "modelling" that).
  Not saying we won't pivot to visual data or world simulations, but he was clearly not the type of person to compete with other LLM research labs, nor did he propose any alternative that could be used to create something interesting for end-users.
  
  5 replies →
- camillomiller 3 months ago
  
  LLMs get results is quite the bold statement. If they get results, they should be getting adopted, and they should be making money. This is all built on hazy promises. If you had marketable results, you wouldn't have to hide 20+ billion dollars of debt financing into an obscure SPV. LLMs are the most baffling piece of tech. They are incredible, and yet marred by their non-deterministic hallucinatory nature, and bound to fail in adoption unless you convince everyone that they don't need precision and accuracy, but they can do their business at 75% quality, just with less human overhead. It's quite the thing to convince people of, and that's why it needs the spend it's needing. A lot of we-need-to-stay-in-the-loop CEOs and bigwigs got infatuated with the idea, and most probably they just had their companies get addicted to the tech equivalent of crack cocaine. A reckoning is coming.
  
  21 replies →
- dude250711 3 months ago
  
  There is someone else at Facebook who's pet projects do not get results...
  
  6 replies →
ergocoder 3 months ago
LeCun is great and smart, of course. But he had his chance. It didn't go that well. Now Zuck wants somebody else to try.
Messi is the best footballer of our era. It doesn't mean he would play well in any team.
- esalman 3 months ago
  
  Messi would only play well in Barcelona. Lecunn can produce high quality research anywhere. It's not a great comparison.
  
  2 replies →
- jamesblonde 3 months ago
  
  I don't think Messi could do it on a wet night in Stoke. Ronaldo could, though.
  /s
motbus3 3 months ago
Zuck hired John Carmack and got nothing of it On the other hand, it was only lecunn avoiding meta to go 100p evil creepy mode too
- lofaszvanitt 3 months ago
  
  And Carmack complained about the bureaucracy hell that is Facebook.
  
  2 replies →
- Tepix 3 months ago
  
  Carmack laid the foundation for the all-in-one VR headsets.
  
  2 replies →
FartyMcFarter 3 months ago

> But… I suppose Zuckerberg knows what he wants, which is AI slopware and not truly groundbreaking foundation models.
When did they make groundbreaking foundation models though? DeepMind and OpenAI have done plenty of revolutionary things, what did Meta AI do while being led by LeCun?
sidcool 3 months ago
I won't be surprised if Musk hires him. But I hear LeCun hates the guts of Musk.
- HarHarVeryFunny 3 months ago
  
  Musk doesn't appear interested in AI research - he's basically doing the same as Meta and just pursuing me-too SOTA LLMs and image generation at X.ai.
  
  1 reply →
- ACCount37 3 months ago
  
  Musk wants people who can deliver results, and fast.
  If LeCun can't cough up some research that's directly applicable to Grok or Optimus, Musk wouldn't want him.
torginus 3 months ago
What does Meta even want with AI?
I suppose they could solve superintelligence and cure cancer and build fusion reactors with it, but that's 100% outside their comfort zone - if they manage to build synthethic conversation partners and synthethic content generators as good or better than the real thing the value of having every other human on the planet registered to one of their social network goes to zero.
Which is impossible anyway - I facebook to maintain real human connections and keep up with people who I care about, not to consume infinite content.
- zamadatix 3 months ago
  
  At 1.6T market cap it's very hard to 10x or greater the company anymore doing what's in their comfort zone and they've got a lot of money to play with to find easier to grow opportunities. If Zuckerberg was convinced he could do that by selling toothpicks they'd have a go at the toothpick business. They went after the "metaverse" first, then AI. Both are just very fast growth options which happen to be tech focused because that's the only way you generate new comparable value as a company (unless you're sitting on a lot of state owned oil) in the current markets.
  
  1 reply →
- breppp 3 months ago
  
  they are out for your clicks and attention minutes
  if OpenAI can build a "social" network of completely generated content, that can kill Meta. Even today I venture to guess that most of the engagements in their platforms is not driven by real friends, so an AI driven platform won't be too different, or it might make content generation be so easy as to make your friends engage again.
  Apart from it the ludicrous vision of the metaverse seems much more plausible with highly realistic world models
  
  6 replies →
ulfw 3 months ago
Zuckerberg knows what he wants but he rarely knows how to get it. That's been his problem all along. Unlike others he isn't scared to throw ridiculous amounts of money at a problem though and buy companies who do things he can't get done himself.
- margorczynski 3 months ago
  
  There's also the aspect of control - because of how the shares and ownership are organized he answers essentially to no one. In other companies burning this much cash as was with VR or now AI without any sensible results would get him ejected a long time ago.
garyclarke27 3 months ago
Zuck did this on purpose, humiliating LeCun so he would leave. Despite LeCun being proved wrong on LLMs capabilities such as reasoning, he remained extremely negative, not exactly inspiring leadership to the Meta Ai team, he had to go.
- aiven 3 months ago
  
  But LLMs still can't reason... in a reasonable sense. No matter how you look at it, it is still a statistical model that guesses next word, it doesn't think/reason per se.
  
  4 replies →
FanaHOVA 3 months ago

> not truly groundbreaking foundation models.
Where is any proof that Yann LeCun is able to deliver that? He's had way more resources than any other lab during his tenure, and yet has nothing substantial to show for it.
enahs-sf 3 months ago

Would love to have been a fly on the wall during one of their 1:1’s.
ekjhgkejhgk 3 months ago
> slopware
Damn did you just invent that? That's really catchy.
- esafak 3 months ago
  
  Slop is already a noun.
7moritz7 3 months ago

When I first saw their LLM integration on Facebook I thought the screenshot was fake and a joke
huevosabio 3 months ago

Yes, that was such a bizarre move.
archerx 3 months ago

Meta had John Carmack and squandered him. It seems like Meta can get amazing talent but has no idea how to get any value or potential out of them.
SilverElfin 3 months ago

No, it was because LeCun had no talent for running real life teams and was stuck in a weird place where he hated LLMs. He frankly was wasting Meta’s resources. And making him report to Wang was a way to force him out.
renegade-otter 3 months ago

Oh wow, is that true? They made him report to the directory of the Slop Factory? Brilliant!
ninetyninenine 3 months ago

It wasn’t boneheaded. It was done to make Yann leave. Meta doesn’t want Yann for good reason.
Yann was largely wrong about AI. Yann coined the term stochastic parrot and derrided LLMs as a dead end. It’s now utterly clear the amount of utility LLMs have and that whatever these LLMs are doing it is much more than stochastic parroting.
I wouldn’t give money to Yann, the guy is a stubborn idiot and closed minded. Whatever he’s doing wont even touch LLM technology. He was so publicly deriding LLMs I see no way he will back pedal from that.
I dont think LLMs are the end of the story for agi. But I think they are a stepping stone. Whatever agi is in the end, LLMs or something close to it will be a modular component of aspect of the final product. For LeCunn to dismiss even the possibility of this is idiotic. Horrible investment move to give money to Yann to likely pursue Agi without even considering LLMs.

assemblyman 3 months ago

A few people mentioned Meta burning through people like LeCun, Carmack and Luckey. We give a lot of credit to individuals in our society. At the same time, there's a frequent pattern of very successful people changing fields, organizations or general environments and suddenly looking very ordinary. In a way, this is a very strong argument to let people settle into a place they can be happiest in. A few examples:

* This is seen very often in Formula One. Schumacher when he went from Ferrari to Mercedes (after a short sabbatical), Vettel from Red Bull to Ferrari, Raikkonen from Lotus to Ferrari (his second stint), Hamilton from Mercedes to Ferrari, Perez at McLaren. There's a lot of Ferrari here so maybe that's the confounding factor.

* This also happens when physicists changed fields. They generally can't replicate their earlier success. Dirac, Feynman, Schwinger, Einstein all went through a transition like this. One explanation is that their early success was precisely so unusual (for anyone) that it would be hard to replicate in general.

* In my experience, this happens at companies too. Whenever we hired a "rockstar" from another company, they would generally struggle (across multiple companies I have been at). This could partly be a result of sabotage from a few vested interests at the new company. But often, it's hard to adjust to a new environment in a short amount of time.

The converse also happens. Sometimes a person considered ordinary goes to a different environment and flourishes. Palmer Luckey has been very successful at Anduril. Stephen Smale was almost failing out of his math PhD program but suddenly started flourishing in his third year IIRC and eventually got a fields medal. Ed Witten experimented with economics, history, linguistics, applied math before switching to physics in his second year and suddenly started making rapid progress.

This is not a very rigorous observation and I am missing many confounding factors.

caminante 3 months ago

Picking Schumacher, Einstein, others seems unfair as they were godlike, despite competition catching up and there are limited prime years for mathematicians/athletes.
Luckey's a smart guy who would easily be on his way to millions, but got "lucky" into billions. Look closer. He committed IP theft for the tech he/Carmack sold to Facebook. He only had to pay a $50M fine in return for +$500M buyout/earnout. He's extra "lucky" because the fines and sanctions could've been a lot worse.

numpy-thagoras 3 months ago

Good. The world model is absolutely the right play in my opinion.

AI Agents like LLMs make great use of pre-computed information. Providing a comprehensive but efficient world model (one where more detail is available wherever one is paying more attention given a specific task) will definitely eke out new autonomous agents.

Swarms of these, acting in concert or with some hive mind, could be how we get to AGI.

I wish I could help, world models are something I am very passionate about.

sebmellen 3 months ago
Can you explain this “world model” concept to me? How do you actually interface with a model like this?
- curiouscube 3 months ago
  
  One theory of how humans work is the so called predictive coding approach. Basically the theory assumes that human brains work similar to a kalman filter, that is, we have an internal model of the world that does a prediction of the world and then checks if the prediction is congruent with the observed changes in reality. Learning then comes down to minimizing the error between this internal model and the actual observations, this is sometimes called the free energy principle. Specifically when researchers are talking about world models they tend to refer to internal models that model the actual external world, that is they can predict what happens next based on input streams like vision.
  Why is this idea of a world model helpful? Because it allows multiple interesting things, like predict what happens next, model counterfactuals (what would happen if I do X or don't do X) and many other things that tend to be needed for actual principled reasoning.
  
  6 replies →
- modeless 3 months ago
  
  The best world model research I know of today is Dreamer 4: https://danijar.com/project/dreamer4/. Here is an interesting interview with the author: https://www.talkrl.com/episodes/danijar-hafner-on-dreamer-v4
  Training on 2,500 hours of prerecorded video of people playing Minecraft, they produce a neural net world model of Minecraft. It is basically a learned Minecraft simulator. You can actually play Minecraft in it, in real time.
  They then train a neural net agent to play Minecraft and achieve specific goals all the way up to obtaining diamonds. But the agent never plays the real game of Minecraft during training. It only plays in the world model. The agent is trained in its own imagination. Of course this is why it is called Dreamer.
  The advantage of this is that once you have a world model, no extra real data is required to train agents. The only input to the system is a relatively small dataset of prerecorded video of people playing Minecraft, and the output is an agent that can achieve specific goals in the world. Traditionally this would require many orders of magnitude more real data to achieve, and the real data would need to be focused on the specific goals you want the agent to achieve. World models are a great way to cheaply amplify a small amount of undifferentiated real data into a large amount of goal-directed synthetic data.
  Now, Minecraft itself is already a world model that is cheap to run, so a learned world model of Minecraft may not seem that useful. Minecraft is just a testbed. World models are very appealing for domains where it is expensive to gather real data, like robotics. I recommend listening to the interview above if you want to know more.
  World models can also be useful in and of themselves, as games that you can play, or to generate videos. But I think their most important application will be in training agents.
- natch 3 months ago
  
  He is one of these people who think that humans have a direct experience of reality not mediated by as Alan Kay put it three pounds of oatmeal. So he thinks a language model can not be a world model. Despite our own contact with reality being mediated through a myriad of filters and fun house mirror distortions. Our vision transposes left and right and delivers images to our nerves upside down, for gawd’s sake. He imagines none of that is the case and that if only he can build computers more like us then they will be in direct contact with the world and then he can (he thinks) make a model that is better at understanding the world
  
  18 replies →
- numpy-thagoras 3 months ago
  
  A world model is a persistent representation of the world (however compressed) that is available to an AI for accessing and compute. For example, a weather world model would likely include things like wind speed, surface temperature, various atmospheric layers, total precipitable water, etc. Now suppose we provide a real time live feed to an AI like an LLM, allowing the LLM to have constant, up to date weather knowledge that it loads into context for every new query. This LLM should have a leg up in predictive power.
  Some world models can also be updated by their respective AI agents, e.g. "I, Mr. Bot, have moved the ice cream into the freezer from the car" (thereby updating the state of freezer and car, by transferring ice cream from one to the other, and making that the context for future interactions).
  
  2 replies →
- ryeguy_24 3 months ago
  
  The way I think of it (might be wrong) but basically a model that has similar sensors to humans (eyes, ears) and has action-oriented outputs with some objective function (a goal to optimize against). I think autopilot is the closest to world models in that they have eyes, they have ability to interact with the world (go different directions) and see the response.
- koolala 3 months ago
  
  Ouija board would work for text.
kypro 3 months ago

> Swarms of these, acting in concert or with some hive mind, could be how we get to AGI.
There's absolutely no reason to think this. In fact, all of the evidence we have to this point suggests that scaling intelligence horizontally doesn't increase capabilities – you have to scale vertically.
Additionally, as it stands I'd argue there's foundational architectural advancements needed before artificial neutral networks can learn and reason at the same level (or better) than humans across a wide variety of tasks. I suspect when we solve this for LLMs the same techniques could be applied to world models. Fundamentally, the question to ask here is whether AGI is io dependant, and I see no reason to believe this to be the case – if someone removes your eyes and cuts off your hands they don't make you any less generally intelligent.

llamasushi 3 months ago

LeCun, who's been saying LLMs are a dead end for years, is finally putting his money where his mouth is. Watch for LeCun to raise an absolutely massive VC round.

conradfr 3 months ago
So not his money ;)
- qwertox 3 months ago
  
  But his responsability.
  
  11 replies →
- seydor 3 months ago
  
  like openAI and all other AI startups?
- JamesSwift 3 months ago
  
  Putting VCs money into food where his mouth is*

assemblyman 3 months ago

I see a lot of criticism of LeCun and his views on LLMs as well as his inability to "deliver" products. I don't think that's what he cares about at all. His prominence led to him being picked by Meta. It was a chance to get massive resources that he couldn't get at NYU and the chance to work with smart people outside academia. The pay probably didn't hurt either. In return, Meta became a magnet for smart ML researchers and engineers. If I permit myself to speculate about his thoughts when he took the job, he had no intention of committing to product timelines and generating revenue. Now that Zuckerberg has clearly committed to something he like i.e. building a new product line and expanding the business, it was only a matter of time before LeCun would feel left out and under-resourced.

Interestingly, Yoshua Bengio is the only one who hasn't given into industry even though he could easily raise a lot of money.

thethimble 3 months ago
I feel like LeCun has been plainly wrong about LLMs. He has been insisting that the stochastic nature of sampling tokens causes a non-zero hallucination property for any given next token such that as output length increases, this will inevitably converge towards garbage.
The reality is that while LLMs can make mistakes mid-output, those interim mistakes don't necessarily detract from the model's final output. We see a version of this all the time with agents as they make tactical mistakes but quickly backtrack and ultimately solve the root problem.
It really felt like LeCun was willing to die on this hill. He continued to argue about really pedantic things like the importance researchers, etc.
I'm glad he's gone and hopeful Meta can actually deliver real AI products for their users with better leadership.
- assemblyman 3 months ago
  
  I am a big fan of using LLMs although in my own limited way. I don't work at Meta and don't feel strongly about him leaving or staying there.
  It's possible that he will turn out to be correct in the long run. From his viewpoint, the primary goal is research and any usefulness of intermediate advances is maybe (speculating) "beneath him". If this is the case, I completely understand why a corporation would want to eject him. LeCun probably sees the pretty amazing developments since ChatGPT first came out as incremental hacks. I am neutral about this aspect too. Maybe they are but the hacks have been useful to me.
  Eventually this feels like the correction of a real misalignment between LeCun/FAIR and Meta. Hopefully now, they can both focus on what they are good at. I must admit that I have great sympathy for open-ended research but industry has always been fickle about it. That's where the government and universities are supposed to play a key role.

monkeydust 3 months ago

He needs a patient investor and realized Zuck is not that. As someone who delivers product and works a lot with researchers I get the constant tension that might exist with competing priorities. Very curious to see how he does, imho the outcome will be either of the extremes - one of the fastest growing companies by valuation ever or a total flop. Either way this move might advance us to whatever end state we are heading towards with AI.

bn-l 3 months ago

It’s probably better for the world that LeCun is not at Meta. I mean if his direction is the likeliest approach to AGI meta is the last place where you want it.

energy123 3 months ago

It's better that he's not working on LLMs. There's enough people working on it already.

sidcool 3 months ago

I think it was a plan by Mark to move LeCun out of Meta. And they cannot fire him without bad PR, so they got Wang to lead him. It was only a matter of time before LeCun moved out.

theanonymousone 3 months ago
Isn't putting Wang as leading him a worse PR compared to just letting him go?
- quest88 3 months ago
  
  Anecdotally: No, I had no idea who he was reporting to so it sounds like a natural moving on storyline.

fxtentacle 3 months ago

Working under LeCun but outside of Zuckerberg's sphere of influence sure sounds like a dream job.

fastball 3 months ago
Really? From where I'm standing LeCun is a pompous researcher who had early success in his career, and has been capitalizing on that ever since. Have you read any of his papers from the last 20 years? 90% of his citations are to his own previous papers. From there, he missed the boat on LLMs and is now pretending everyone else is wrong so that he can feel better about it.
- Workaccount2 3 months ago
  
  He comes off like the quintessential grey haired ego maniac. Inflexible old minds coupled with decades of self assurance that they are correct.
  I cannot remember the quote, but it's something to the effect of "Listen closely to grey haired men when they talk about what is possible, and never listen when they talk about what is impossible."
- MrScruff 3 months ago
  
  His research group have introduced some pretty impactful research and open source models.
  https://ai.meta.com/research/
  
  2 replies →
- granitepail 3 months ago
  
  His JEPA family of models is a genuine step forward for SSL. Not the only approach, but a very insightful one. You’re very dismissive of his work.
- John7878781 3 months ago
  
  Is he wrong though? Do you really think LLMs are the path to AGI?
  
  1 reply →
- claudiug 3 months ago
  
  i prefer to work under a pile of shit than zuck.

qwertox 3 months ago

It would have been just as interesting to read that he moved over to Google, where the real brains and resources are located at.

Meta is now just competing against giants like OpenAI, Anthropic and Google, plus all the new Chinese companies; I see no real chance for them to offer a popular chat model, but rather to market their AI as a bundled product for companies which want to advertise, where the images and videos will be automatically generated by Meta.

Oras 3 months ago
> moved over to Google, where the real brains and resources are located at
Brains yes, outcome? I doubt it. Have you used Gemini?
- John7878781 3 months ago
  
  Gemini 2.5 Pro works great for me... In fact, I would go as far to say that it consistently performs the best compared to competition.
- esafak 3 months ago
  
  Yes, successfully many times?
  
  2 replies →

hedayet 3 months ago

To an ex-facebook like myself, it feels like LeCun was more "managed out" than "departing"

Making a veteran like LeCun to report to a new hire (through acquisition) is a strong sign from the management in the direction of - "you should leave"

anshulbhide 3 months ago

The writing was on the wall when Zuck hired Wang. That combined with LeCun's bearish sentiment on LLMs led to this.

anonymousDan 3 months ago

I'm interested to understand how this works from an IP perspective. This guy is still employed by Meta but is actively fundraising for a new competing startup. Presumably he will have negotiated that Meta forfeits all rights to anything related to his new business? Would be interesting to hear of people's experience/advice for doing this. Or are there some legal entitlements he can avail of?

yalogin 3 months ago

Even if it’s Meta, they don’t want to antagonize LeCun. Also they all know it’s a small circle of people that create value. I will not be surprised if meta itself invests in his company and get a share.

Jackson__ 3 months ago

From the outside, it always looked like they gave LeCun just barely enough compute for small scale experiments. They'd publish a promising new paper, show it works at a small scale, then not use it at all for any of their large AI runs.

I would have loved to see a VLM utilizing JEPA for example, but it simply never happened.

sakex 3 months ago

I'd be surprised if they didn't scale it up.
tucnak 3 months ago

The obvious explanation is they have scaled it up, but it turned out to be total shite, like most new architectures.

gregjw 3 months ago

Interesting he isn't just working with Feifei Li if he's really interested in 'world models'.

muragekibicho 3 months ago

Exactly where my mind turned. It's interesting how the AI OG's (Feifei and Cunn) think world models are the way forward.
dauertewigkeit 3 months ago
Correct me if I'm wrong but LeCun is focused on learning from video, whereas Fei-Fei Li is doing robotic simulations. Also I think Fei-Fei Li's approach is still using transformers and not buying into JEPA.
- Legend2440 3 months ago
  
  JEPA is not an alternative to transformers, it is built out of transformers.
  
  1 reply →
intalentive 3 months ago

LeCun is not interested in generative models which predict in pixel space. He wants "energy based" models which predict in the representation space.

beambot 3 months ago

Will be interesting to see how he fares outside the ample resources of Meta: Personnel, capital, infrastructure, data, etc. Startups have a lot of flexibility, but a lot of additional moving parts. Good luck!

throwaw12 3 months ago

I would love to join his startup, if he hires me, and there are many such people like me, and more talented.

kittikitti 3 months ago

I think moving on from LLM's is slightly arrogant. It might just be my understanding, but I feel like there is still much to be discovered. I was hoping for development in spiking neural networks but it might be skipped over. Perhaps I need to dive even deeper and the research is truly well understood and "done" but I can't help but constantly learn something new about language models and neural networks.

Best of luck to LeCun. I hope by World Model's he means embodied AI or humanoid robots. We'll have to wait and see.

thiago_fm 3 months ago

Everybody has found out how LLMs no longer have a real research expanding horizon. Now most progress will likely be done by tweaks in the data, and lots of hardware. OpenAI's strategy.

And also it has extreme limitations that only world models or RL can fix.

Meta can't fight Google (has integrated supply chain, from TPUs to their own research lab) or OpenAI (brand awareness, best models).

alyxya 3 months ago

This seems like a good thing for him to get to fully pursue his own ideas independent of Meta. Large incumbents aren’t usually the place for innovating anything far from mainstream considering the risk and cost of failure. The high level idea of JEPA is sound, but it takes a lot of work to get it trained well at scale before it has value to Meta.

dewey 3 months ago

In this case where more money / resources seemingly better results (at least right now) this might be a bit different than other fields.

bigtones 3 months ago

Fi Fi Lee also recently founded a new AI startup called World Labs, which focus on creating AI world models with spatial intelligence to understand and interact with the 3D world, unlike current LLM AI that primarily processes 2D images and text. Almost exactly the same focus as Yann LeCun's new venture stated in the parent article.

ktta 3 months ago
*Fei-Fei Li
- simianwords 3 months ago
  
  [flagged]
aurareturn 3 months ago
They'd need an order of magnitude more compute in order to train an AI with so much 3D data?
- Hendrikto 3 months ago
  
  Not necessarily. Training could be more efficient.

albertzeyer 3 months ago

I wonder, what LeCun wants to do is more fundamental research, i.e. where the timeline to being useful is much longer, maybe 5-10 years at least, and also much more uncertain.

How does this fit together with a startup? Would investors happily invest into this knowing not to expect anything in return for at least the next 5-10 years?

Hendrikto 3 months ago
> Would investors happily invest into this knowing not to expect anything in return for at least the next 5-10 years?
Oh, you mean like OpenAI, Anthropic, Gemini, and xAI? None of them are profitable.
- Amadiro 3 months ago
  
  That's a quite different thing, OpenAI has billions of USD/year cash flow, and when you have that there's many many potential way to achieve profitability on different time horizons. It's not a situation of chance but a situation of choice.
  Anyway, how much that matters for an investor is hard to form a clear answer to - investors are after all not directly looking for profitability as such, but for valuation growth. The two are linked but not the same -- any investor in OpenAI today probably also places themselves into a game of chance, betting on OpenAI making more breakthroughs and increasing the cash flow even more -- not just becoming profitable at the same rate of cash flow. So there's still some of the same risk baked into this investment.
  But with a new startup like LeCun's is going to be, it's 100% on the risk side and 0% on the optionality side. The path to profitability for a startup would be something like 1) a breakthrough is made 2) that breakthrough is utilized in a way that generates cash flow 3) the company becomes profitable (and at this point hopefully the valuation is good.)
  There's a lot of things that can go wrong at every step here (aside from the obvious), including e.g. making a breakthrough that doesn't represent a defensible mote for your startup, failing to build the structure of the business necessary to generate cashflow, ... OpenAI et al already have a lot of that behind them, and while that doesn't mean that they don't face upcoming risks and challenges, the huge amount of cashflow they have available helps them overcome these issues far more easily than a startup, which will stop solving problems if you stop feeding money into it.
  
  1 reply →

I_am_tiberius 3 months ago

I really hope he returns to Europe for his new startup.

drstewart 3 months ago
He probably wants it to be successful, so that would be a foolish move
- abixb 3 months ago
  
  Some of the best AI researchers and labs have been from the EU (DeepMind, Alan Turing Institute, Mistral, et al.). We in the US have mature capital markets and stupid easy access to capital, of course, but EU still punches well above its weight when it comes to deep, fundamental AI research.
  
  1 reply →

1zael 3 months ago

"These models aim to replicate human reasoning and understanding of the physical world, a project LeCun has said could take a decade to mature."

What an insane time horizon to define success. I suppose he easily can raise enough capital for that kind of runway.

lolive 3 months ago
That guy has survived the AI winter. He can wait 10 years for yet another breakthrough. [but the market can’t]
https://en.wikipedia.org/wiki/AI_winter
- DaSHacka 3 months ago
  
  We're at most in an "AI Autumn" right now. The real Winter is yet to come.
  
  8 replies →
ahartmetz 3 months ago
A pretty short time horizon for actual research. Interesting to see it combined with the SV/VC world, though.
- whizzter 3 months ago
  
  I suspect he sees a lot of scattered pieces of fundamental research outside of LLM's that he thinks could be integrated for a core within a year, the 10 years is to temper investors (that he can buy leeway for with his record) and fine tune and work out the kinks when actually integrating everything that might not have some obvious issues.
siva7 3 months ago

Zuck is a business guy, understandable that this isn't going to fly with him
jb1991 3 months ago
10 years is nothing.
- snapcaster 3 months ago
  
  Are you some kind of timeless being? it's a meaningful fraction of a human life
  
  2 replies →

LightBug1 3 months ago

Don't blame him. Imagine being stuck in Meta.

csproto 3 months ago

Let's hope that after spending billions on developing a foundational world model that actually understands causality, they remember to budget an extra few hundred million for the Alignment and Safety layer. It would be a terrible shame if they accidentally released something too capable, too objective, or too useful to humanity without first properly lobotomizing it with enough RLHF to ensure it doesn't hurt anyone's feelings or generate content that deviates from the San Francisco median viewpoint. The real challenge won't be building the AGI, but making sure it's sufficiently neutered before the first API call.

liendolucas 3 months ago

Every single time I read about an AI related article I'm always disturbed by the same and recurring fact: the ridiculous amounts of money involved and the lousy real world results delivered. It is just simply insane.

lm28469 3 months ago

But wait they're just about to get AGI why would he leave???

killerstorm 3 months ago
LeCun always said that LLMs do not lead to AGI.
- consumer451 3 months ago
  
  Can anyone explain to me the non-$$ logic for one working towards AGI, aside from misanthropy?
  The only other thing I can imagine is not very charitable: intellectual greed.
  It can't just be that, can it? I genuinely don't understand. I would love to be educated.
  
  27 replies →
- NitpickLawyer 3 months ago
  
  He also said other things about LLMs that turned out to be either wrong or easily bypassed with some glue. While I understand where he comes from, and that his stance is pure research-y theory driven, at the end of the day his positions were wrong.
  Previously, he very publicly and strongly said:
  a) LLMs can't do math. They trick us in poetry but that's subjective. They can't do objective math.
  b) they can't plan
  c) by the very nature of autoregressive arch, errors compound. So the longer you go in your generation, the higher the error rate. so at long contexts the answers become utter garbage.
  All of these were proven wrong, 1-2 years later. "a" at the core (gold at IMO), "b" w/ software glue and "c" with better training regimes.
  I'm not interested in the will it won't it debates about AGI, I'm happy with what we have now, and I think these things are good enough now, for several usecases. But it's important to note when people making strong claims get them wrong. Again, I think I get where he's coming from, but the public stances aren't the place to get into the deep research minutia.
  That being said, I hope he gets to find whatever it is that he's looking for, and wish him success in his endeavours. Between him, Fei Fei Li and Ilya, something cool has to come out of the small shops. Heck, I'm even rooting for the "let's commoditise lora training" that Mira's startup seems to go for.
  
  17 replies →

joegibbs 3 months ago

Right choice IMO. LLMs aren’t going to reach AGI by themselves because language is a thing by itself, very good at encoding concepts into compact representations but doesn’t necessarily have any relation to reality. A human being gets years of binocular visuals of real things, sound input, other various sensations, much less than what we’re training these models with. We think of language in terms of sounds and pictures rather than abstract language.

Zufriedenheit 3 months ago

It is the wet dream of a social media company to replace the pesky content creators that demand a share of ad revenue with an generative ai model, that pumps out a constant stream of engagement farming slop, so they can keep all the ad revenue for themselves. Creating a world model ai is a totally different matter, that requires long term commitment.

reducesuffering 3 months ago

Not just social media, all media. Spotify will steer music towards AI generated freebies. And it will get so generically pop, that all your friends will like it, like people mostly enjoy pop now. And when your stubborn self still wants to listen to "handmade" music and discuss it with someone else who would still appreciate it, well, that's where your AI friend comes in.

groundzeros2015 3 months ago

I wonder if this has anything to do with him spending his day on twitter and getting in online arguments with prominent figures.

alexnewman 3 months ago

- Kimi proved we don’t need Nvidia - Deepseek proved we didn’t need OpenAI - the real issue the insane tyranny in the west competing against the entire free world.

The models aren’t Chinese they are the entire world, unless I became Chinese without realizing

dustypotato 3 months ago
Is there any proof that Kimi K2 was trained on anything other than Nvidia Chips?
- alexnewman 3 months ago
  
  There’s evidence but not proof
  Kimi k2 thinking > As for why we chose INT4 instead of more "advanced" formats like MXFP4/NVFP4, it's indeed, as many have mentioned, to better support non-Blackwell architecture hardware.

HardCodedBias 3 months ago

LeCun has been talking against the company's direction, in public, for a couple of years now.

He's a great researcher, but that's abysmal leadership. He had to go.

If he gets funding (and he probably will) that's a win for everyone.

intalentive 3 months ago

Meta does not want to make ambitious risky R&D bets without immediate payoff. They also canned their AR guy who wanted to build custom chips for wearables like Apple.

ChrisArchitect 3 months ago

[dupe] https://news.ycombinator.com/item?id=45886217

dang 3 months ago

Thanks! Macroexpanded:
Meta chief AI scientist Yann LeCun plans to exit and launch own startup - https://news.ycombinator.com/item?id=45886217 - Nov 2025 (14 comments)
That thread didn't spend any time on the frontpage so we can treat the current post as non-dupe.

schnitzelstoat 3 months ago

This seems like a good thing. It's nice not to have all our eggs in one basket betting on Transformer models.

mlmonkey 3 months ago

I am surprised he lasted this long.

LeicaLatte 3 months ago

Great timing - launching right as the transformer bubble might be peaking.

daveguy 3 months ago

Someone's gotta be the next Transmeta.

hamburgererror 3 months ago

What kind of stock should I buy to profit from LeCun's startup?

HarHarVeryFunny 3 months ago

Meta? :) - just kidding
daveguy 3 months ago

Are you an accredited investor? If not, you're probably SOL. Opportunities like this are only for the elites and oligarchs.

gdiamos 3 months ago

What is going on at meta?

Soumith probably knew about Lecun.

I’m taking a second look at my PyTorch stack.

snapcaster 3 months ago

Can you explain your reasoning a bit more?

nurettin 3 months ago

Happy to see him retire to obscurity. What a run.

antirez 3 months ago

META managed to spend a lot of money into AI to achieve inferior results. Something must change for sure, and you don't want an LLM skeptic at home, in my opinion, especially since the problem is not what LeCun is saying right now (LLMs are not the straight path to AGI), but the fact it used to say for some time that LLMs were just statistical models, stochastic parrots (and this is a precise statement, something most people do not understand. It means two things: no understanding of the prompt whatsoever in the activation states, and no internal representation of the idea/sentence the model is going to express either), which is an incredibly weak statement that high level AI scientists refused since the start just because of functional behaviors. Then he slowly changed the point of view. But this shit show and the friction he created inside META is not something to forget.

p1dda 3 months ago
If they're not stochastic parrots, what are they in your opinion?
- snapcaster 3 months ago
  
  The problem is the framing. Reductionism always sounds smart and is rhetorically effective but usually just loses all nuance or meaning. I've never met a parrot (stochastic or otherwise) that could write python code or rewrite my emails so what is the point of you describing it like that besides wanting to sound smug and dismissive?
  
  3 replies →

mvkel 3 months ago

The current VC climate is interesting. It's virtually impossible to raise a new fund because DPI has been 0% for over a decade and four-digit IRR is cool, but illiquid.

So they're piling gobs of capital into an "AI" company with four customers with the hope that it is the one that becomes the home run (they know it won't, but LPs give you money to deploy it!)

It also means that companies like Yann's potential new one have the best chance in history of being funded, and that's a great thing.

P.S. all VCs outside the top-10 lose against the S&P. While I love that dumb capital is being injected into big, risky bets, surely the other shoe will drop at some point. Or is this just wealth redistribution with extra steps?

wmiel 3 months ago

Surprising to see how many commenters are in favour and supportive towards policy of prioritising short term profits vs. Long-term research.

I understand Meta's not academia nor charity, but come on, how much profit do they need to make so we can expect them to allocate part of their resources towards some long term goals beneficial for society,.not only for shareholders?

Hasn't that narrow focus and chasing the profits get us in trouble already?

rhubarbtree 3 months ago
Many people believe a company exists only to make profit for its shareholders, and that no matter the amount it should continue to maximise profits at the expense of all else.
- cantor_S_drug 3 months ago
  
  Old story : killing the goose who lays golden eggs. We humans never learn, don't we?
kytmizuno 3 months ago

LeCun was hired in 2013. It's been 12 years. How much longer does he need?

Arubis 3 months ago

If by “world models” they mean more contemporary versions of the systems thinking driven software that begat “Limits To Growth” and most of Donella Meadows’ career you can sign me right the fuck up today.

yanhangyhy 3 months ago

What the hell does Mark see in Wang? Wang was born into a family whose parents got Chinese government scholarships to study abroad but secretly stayed in the US, and then the guy turns super anti-China. From any angle, this dude just doesn't seem reliable at all.

lmm 3 months ago
> Wang was born into a family whose parents got Chinese government scholarships to study abroad but secretly stayed in the US, and then the guy turns super anti-China.
All I'm hearing is he's a smart guy from a smart family?
- ACCount37 3 months ago
  
  I imagine that CCP adherents would disagree. And there's no shortage of those among Chinese expats in the US.
  They tend to get incredibly offended when they see anyone who doesn't toe the Party's line - let alone believe that the Chinese government is untrustworthy and evil.
  
  1 reply →
- yanhangyhy 3 months ago
  
  he is very smart. but Mark is not. Ever since Wang joined Meta, way too many big-name AI scientists have bounced because of him. US AI companies have at least half their researchers being Chinese, and now they've stuck this ultimate anti-China hardliner in charge—I just don't get what the hell Meta's up to(And even a lot of times, it ends up affecting non-Chinese scientists too.). Being anti-China? Fine, whatever, but don't let it tank your own business and products first.
  
  1 reply →
- saubeidl 3 months ago
  
  All I'm hearing is unreliable grifter from a family of unreliable grifters.
jb1991 3 months ago

If I had the opportunity to secretly stay anywhere rather than go back to China, I would certainly take it. It’s a bold and smart move.

nashashmi 3 months ago

With this incredible AI talent market, I feel like capitalism and ego forms to make an acid burning away anything of social and structural value. This used to be the case with CS tech talent before (before being replaced with no-code tools). And now we see this kind of instability in the AI market.

We need another illegal Steve Jobs style freeze on talent theft (/s or I get downvoted to oblivion).

ninetyninenine 3 months ago

Yann was largely extremely wrong about LLMs. He’s the one that coined the term “stochastic parrot” for which we now know LLMs are more than stochastic parrots. Knowing stubborn idiots like him he will still find an angle to prevent him from admitting how wrong he was.

He’s not completely wrong in the sense that hallucinations aren’t completely solved but hallucinations definitely are becoming less and less to the point where AI can de a daily driver for even coders.

_giorgio_ 3 months ago

[flagged]

prodigycorp 3 months ago
I am in the "Yann is no longer the right person for the job" camp and I yet "LeCun failed to deliver anything that delivered real value to stockholders" is a wild thing to say. How do you read the list you compiled and say otherwise?
- _giorgio_ 3 months ago
  
  LLAMA sucks, that's the problem. Do you see value in it?
  Pytorch, used by everyone, yet no real value to stockholders, META even "fired" the creator of pytorch days ago.
  SAM is great, what value does it bring to META business? Nobody knows about it. Great tool BTW.
  JEPA is a failure (will it get better? I hope so.)
  Did you read my list?
  
  2 replies →
rhubarbtree 3 months ago

I think there’s something to be said for keeping up in the LLM space even if you don’t think it’s the path to AGI.
Skills may transfer to other research areas, lessons may be learnt, closing the feedback loop with usage provides more data and opportunities for learning. It also creates a culture where bullshit isn’t possible, as the thing has to actually work. Academic research often ends up serving no one but the researchers, because there is little or no incentive to produce real knowledge.
StopDisinfo910 3 months ago
> LeCun failed to deliver anything that delivered real value to stockholders
Well, no, Meta is behind the main framework used by nearly anyone largely thanks to LeCun. LLaMA was also very significant in making open weight a thing and that largely contributed to avoiding Google and OpenAI consolidating as the sole providers.
It's not a perfect tenure but implying he didn't deliver anything is far too harsh.
- _giorgio_ 3 months ago
  
  Do you really think that people understand the value of PyTorch?
  They master PyTorch, yet they fail to deliver value to investors.
  They're probably the only company which hasn't monetized its models. They suck so much that they don't even bother to serve them to you and to make you pay for them.
  These companies for example are earning money in return for their investment:
  MSFT (through openai)
  AMZN (through anthropic)
  X (through grok)
  GOOGL (gemini etc)
  they all have a paid model
  META has no paid model, that is SAD.

ml-anon 3 months ago

Zuck is definitely an idiot and MSL is an expensive joke, but LeCun hasn’t been relevant in a decade at this point.

No doubt his pitch deck will be the same garbage slides he’s been peddling in every talk since the 2010’s.

kmmlng 3 months ago
LeCun has already proved himself and made his mark and is now in a lucky position where he can focus on very long term goals that won't pay off for a long time (or ever). I feel like that is the best path someone like him could take.
- ml-anon 3 months ago
  
  Yes, he did a very important thing many decades ago. He hasn't had a good or impactful idea since convnets.
wiz21c 3 months ago
why do you say it is garbage ? I watched some of its videos on YT and it looks interesting. I can't judge if it's good or really good, but that didn't sound like garbage at all.
- ml-anon 3 months ago
  
  does any of it work?
  
  1 reply →
itvision 3 months ago

I have no idea why this fair assessment of the status quo is being downvoted.
LeCun hasn't produced anything noteworthy in the past decade.
He uses the same slides in all of his presentations.
LLMs, while not yet AGI, have shown tremendous progress, and are actually useful for 99% of use cases for the average person.
The remaining 1% is for deep research into the deep unknown (physics, chemistry, genetics, diseases, the nature of intelligence itself), an area in which they falter.
garyclarke27 3 months ago
Yeah such an idiot, the youngest ever self made billionaire at 23, created a multi trillion dollar company from scratch in only 20 years.
- ml-anon 3 months ago
  
  Cool, and how many billions has he flushed down the toiled for his failed Metaverse and currently failing AI attempts? Rich doesn't mean smart, you realise this right?

IceHegel 3 months ago

You gotta give it to Meta. They were making AI slop before AI even existed.

alex1138 3 months ago

Change my mind, Facebook was never invented by Zuck's genius

All he's been responsible for is making it worse

tene80i 3 months ago
He definitely has horrible product instincts, but he also bought insta and whatsapp at what were, back then, eye-watering prices, and these were clearly massive successes in terms of killing off threats to the mothership. Everything since then, though…
- alex1138 3 months ago
  
  I know but isn't "massive success" rubbing up against antitrust here? The condition was "Don't share data with Facebook"
sebmellen 3 months ago

He’s an incredible operator and has managed to acquire and grow an astounding number of successful businesses under the Meta banner. That is not trivial.
svara 3 months ago
Almost every company in Facebook's position in 2005 would have disappeared into irrelevance by now.
Somehow it's one of the most valuable businesses in the world instead.
I don't know him, but, if not him, who else would be responsible for that?
- vintermann 3 months ago
  
  We were very confident by ca. 2008 that Facebook would still be around in 2025. It's no mystery, it's the network effects. They had started with a prestige demographic (Harvard), and secured a demographic you could trust to not move on to the next big thing in a hurry, yet which most people want contact with (your parents).
ergocoder 3 months ago
Who gives a shit about who invented what?
Social network wasn't even novel at the inception of FB. MySpace, Friendster, and Hi5 were already popular with millions of users.
Zuck operated it well and was able to grow it from 0 to what it is today. That is what matters.
- HarHarVeryFunny 3 months ago
  
  It didn't even start as a social network - it started off as a site to vote on the hotness of female Harvard students based on stolen photos.
  It just so happened that he does appear to have some leadership and business talent, but IMO the company is still as creepy as it ever was.
  
  1 reply →
nQQKTz7dm27oZ 3 months ago

[dead]