Comment by internet2000

1 day ago

Attribution isn't the relevant part. Lying about your lab's capabilities is.

That's also something all the AI companies have been doing.

  • Lying about model capability is right now the lingua franca of the cloud AI business model, almost; they yes-and each other's lies because they are in a position of needing to generate interest, including going as far as needing to trigger regulatory capture.

    (It's not news to anyone who has worked in sales-led businesses that salespeople are prone to believing the claims of other salespeople, I guess).

    • > Lying about model capability is right now the lingua franca of the cloud AI business model

      Lying about your lab's capabilities != Lying about model capability

      Exaggerating the capabilities of a new model that you've actually trained in press bulletins can be called marketing. Merging two models and claiming that you trained a new model is plain lazy.

      1 reply →

It seems to me like the lies are both for the same reason. To capture attention and profits that are not deserved.

I do not see anyone lying.

The model card says:

> Post-trained from Qwen 3.5 397B

The model card also says that they use an inference framework based on "SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs" by Shi et al.:

https://arxiv.org/abs/2510.05069

So the sources seem properly attributed.

They only claim that what they did to "Qwen 3.5 397B" has improved the LLM, including, as expected, with "strong performance in Portuguese".

  • That's attribution to Qwen team.

    There (is/was) no attribution to Nex team (they've released a model based on Qwen 3.5 397B as well).

    As per OP link Nex claims that what Rio team released (so far) is just linear interpolation of weights between Nex and OG Qwen model. With no attribution to Nex and zero signs of Rio doing any training of their own.