← Back to context

Comment by unrvl22

1 day ago

The municipality of Rio de Janeiro (via its IT company IplanRIO) released Rio-3.5-Open-397B, presented as a homegrown Qwen3.5 fine-tune that beats comparable open models on benchmarks. The linked issue argues it's actually a weighted merge of ~60% Nex-N2 Pro + ~40% Qwen3.5-397B-A17B - Nex-N2 having been released about a week earlier.

I didn't know model merging like that was possible. (Obviously possible from a pure software standpoint but I'm surprised it's effective)

So the problem isn’t in the missing attribution to Qwen, but with the fact that they didn’t mention Nex-N2 Pro right?

  • The problem is that they claimed to have made a big achievement with their home grown post training, and they expected to receive a lot of praise for it.

    Then researchers looked at the weights and there is no post training at all.

    They are now attributing both models they merged, but their excuse for the lack of post training is to claim they accidentally uploaded the wrong files.

    • I’d believe they accidentally uploaded the wrong files if they uploaded the correct ones. To state that they accidentally uploaded something else and then not upload the correct version means they probably do not have anything and either hope people forget about this or they are scrambling to have something that is at least close to their original claim.

      1 reply →

Rio better have the best IT infrastructure and software in the world if they are spending time on LLMs. What a waste of tax payer money.

  • Piaui state it's also doing a LLM it seems. But indeed it would make more sense if it was a national thing rather than local...