← Back to context

Comment by gpm

3 days ago

> Do you think that Chinese labs will continue to release open models forever

Yes.

I think the Chinese government either already has, or will soon, grasp that if they train the models that people use they dictate what people believe (at least around the margins where that's malleable), and they will happily throw resources at that.

And simultaneously that the only way they can actually get everyone to use their models is if it's possible for us to run them on our own hardware.

(This isn't exactly a utopian view of the future)

This is going to age very poorly when the best Chinese labs ALREADY just started not open sourcing their models.

Qwen 3.7 is not open source; previous Qwen versions would have open source releases, but Qwen 3.7 plus does not. The second best Chinese model, Minimax M3, is testing the waters by taking longer and longer between “model release” and open sourcing it. This time, they spent 2 weeks after release before open sourcing it. There’s also a lot of rumors of GLM and Deepseek not open sourcing future models.

It’s pretty obvious that you cannot take Chinese models as open source for granted, they’ll be closed source soon.

  • If we're measuring progress in hours and days then yes. But if we're measuring progress in months then OSS models are doing fine. You can get a state-of-the art performance in an open model if you pretend it is January 2026 instead of June.

    There is no evidence here that the cutting edge labs have any durable advantage. Extrapolating current trends it seems likely that even the Europeans will be capable of meeting any given performance measure with enough time. In fact the evidence suggests that the capital required to run the models is where a moat will develop. Knowing the weights won't help much.

  • The best chinese models are deepseek (general purpose) and glm (coding) and they are both open weight and share lots of their tooling.

    There are lots of AI companies and it doesn’t seem that they all have the same funding fountain or share monetization goals. I wouldn’t read much into what each one of them is doing.

    • Even if the models by the Chinese labs are open source or open weights even after they get to mythos level intelligence lets say, still inference and the optimization of those models to be accessed at speeds of 1000 tokens/sec in not in the hands of general public as these models have parameters more than a trillion and they can't be run on some publicly available hardware, So even after being open source it does'nt fix the problem as the general public will still pay the company for inference.

      3 replies →

    • > The best chinese models are deepseek (general purpose)

      DeepSeek is developed by the largest Chinese hedge fund, their models used to make them $ on the share market are very profitable, they've never ever released anything on those models.

      Somehow you are claiming that those same group of people are going to totally change their very consistent long term behaviour and start promoting openness when they are in the global leading position in AI?

      2 replies →

  • The main reason the Chinese labs are releasing models as open weights is because they don't have the compute necessary to provide all of the inference. For the US frontier models something like 80-90% of the lifetime compute required for the model is inference rather than training. China wants to shepherd as much of their limited compute as possible towards training to keep up in the race.

    • I think the main reason is to minimize the market for closed-source models from US companies.

      China knows that doing what Anthropic/OpenAI/Google/... are doing is impossible for them. No one outside of China in any sane condition will send their data to compute farms IN CHINA like people currently do with US-based frontier models. Even if they could muster the inference power.

      Hence they do the second-best thing possible to attack the dominance of the US-based corporations: reduce their moat by open-sourcing models that are not fully equal, but practically useful and good enough for easily 90% of typical tasks people use agents for in their daily lives. But way cheaper to run.

      As long as this arms race in AI continues, China as "number two" will have some incentive to continue open-sourcing models. But of course the US government might force a change if they continue to enforce limited public access to new frontier models - there is no market to minimize if a model is not allowed to be publicly available.

      22 replies →

    • With nearly everyone using inference accelerators, the pool of hardware is no longer shared between training and use.

  • Kimi 2.7 and GLM 5.2 released today and are open source.

    • Minimax M3 too, and huawei claims to be releasing non-nvidia dependent training software too. openPangu 2.0 could be a shake-up if it holds up as a good model

      China may not care about open source, but they know they will personally fund AI through government investments while US relies on private investments, best way to scare private investments is a free capable alternative for everyone

      Add on the fact that they actually invested in energy infrastructure and can offer AI very cheap to their citizens and you can get a population well versed in AI to reduce menial tasks and focus on more productive things (if we're to believe the claims of the technology)

The US administration restricting the use of US-trained models is one of the best gifts it could make to the Chinese LLM producers, and to the PRC government.

  • This entire administration is a gift to everybody but the US. It’s either in service of Russia, China or whoever is willing to pay Trump the most.

    • It's funny how the acceleration of the downfall of the US (due to trump) is a gift to everyone else. It's almost as if US didn't have as postitive impact on the world as they thought.

      5 replies →

    • Chinese have a nickname for Trump. 川建国. Trump the nation builder(meaning China). But Biden actually continued most of Trumps policies.

      4 replies →

There's also the Meta motivation, that even if you don't get the control you would like from releasing a model, it may still be worth it to at least deny others that control. I'm sure that matters even more to China vs. the US than it mattered to Facebook vs. Google.

There is no moat in the model and by making the them open, it’s hard for one to be established when the free models are “good enough”.

OpenAI and Anthropic are both hamstrung by this. Anthropic does have the better chance of surviving.

You don’t need the cutting edge to influence people’s opinion. “Export LLMs” to the rescue.

> I think the Chinese government either already has, or will soon, grasp that if they train the models that people use they dictate what people believe (at least around the margins where that's malleable), and they will happily throw resources at that.

that doesn't require the model to be SOTA, it can be just a compact model capable of running on some inexpensive hardware. that is vastly different from SOTA models like Mythos which can potentially disrupt lots of things.

> > Do you think that Chinese labs will continue to release open models forever

> Yes.

holy shit the naivete of HN nowadays.