Comment by cubefox
10 hours ago
Someone recently made a graph showing that the gap between US American frontier LLMs and Chinese open weight LLMs (including DeepSeek v4) is widening. Unfortunately I can't find it anymore.
Update: GPT-5.5 found it.
Article: https://www.nist.gov/news-events/news/2026/05/caisi-evaluati...
Graph: https://www.nist.gov/sites/default/files/images/2026/05/01/1...
Give it time. It's inevitably a logistic curve.
I believe logistic curves make no sense when you have Elo scores.
This is propaganda, not data.
If the Chinese government published a graph that said the opposite, would you consider that a serious and objective source?
If the methodology in the accompanying write-up did look credible, yes. Though I have significantly more trust in US agencies, like NIST in this case.
Someone is an official website of the united states gouvernement. I would prefer another source.
I think no other source exists.