← Back to context

Comment by noelsusman

10 hours ago

The Artificial Analysis benchmark results are pretty underwhelming. Roughly the same "intelligence" as MiMo-V2.5-Pro for over 3x the cost. We'll have to see how that translates to actual usage but it's not a great sign.

That really depends on whether they have similar parameter counts, doesn't it? Unless you know that, the comparison is just strange

  • Bad look to tell people they're not allowed to compare things just because we need to respect Google's privacy

    • I didn't take the price into consideration when writing that. I meant to point out that even if they have similar scores, the Flash model might be smaller than MiMo or Kimi, which would by itself be a win

      That said, haste makes waste as the price point completely invalidates that