Comment by cubefox

10 hours ago

Someone recently made a graph showing that the gap between US American frontier LLMs and Chinese open weight LLMs (including DeepSeek v4) is widening. Unfortunately I can't find it anymore.

Update: GPT-5.5 found it.

Article: https://www.nist.gov/news-events/news/2026/05/caisi-evaluati...

Graph: https://www.nist.gov/sites/default/files/images/2026/05/01/1...

6 comments

cubefox

mordae 7 hours ago

Give it time. It's inevitably a logistic curve.

cubefox 7 hours ago

I believe logistic curves make no sense when you have Elo scores.

tirpen 7 hours ago

This is propaganda, not data.

If the Chinese government published a graph that said the opposite, would you consider that a serious and objective source?

cubefox 7 hours ago

If the methodology in the accompanying write-up did look credible, yes. Though I have significantly more trust in US agencies, like NIST in this case.

lugu 8 hours ago

Someone is an official website of the united states gouvernement. I would prefer another source.

cubefox 8 hours ago

I think no other source exists.