Comment by irthomasthomas

9 days ago

Insane. 3 points behind opus on the artificialanalysis index.

Mimo cost ~$400 at the old price, so about $40 today. Opus cost ~$5000

That's over 100x cheaper, and just 3 points behind.

I can't wait to experiment with an llm consortium of 100 deepseek and mimo models. Crazy times.

Shut up and take my m̶o̶n̶e̶y̶ data!

Edit: Gemini on google search told me I could write strikethrough text on hn using <s>. Mimo told me it was unsupported and then went on to list some tags that are supported, like <b>bold</b>. I tried copy pasting the word in strikethrough from a word processor but it lost the format. I ended up using mimo in an agent shell wrapper to produce it, and copy pasting from the terminal worked for some reason.

11 comments

irthomasthomas

_davide_ 9 days ago

I had a subscription before the price was cut down; the model kept randomly looping the with same character (burning 30% of the budget in one shot), and the overall performance for agentic purposes is, simply put, terrible. It finds non-existing bugs and randomly removes chunks of code to fix them, then even presents it as an "extra fix". Maybe it's a good generalistic model; I haven't tested it in that regard.

MiniMax (currently 2.7) which is a ~270B model tuned exclusively for agentic purposes, performs so MUCH better; it's more reliable and cheaper. Both are still far away from Opus 4.7 that I'm using at work. IMO benchmarks are just a very rough estimation; everyone cheats as much as they can get away with. Test the model yourself; do not make any assumptions based on the benchmarks.

I would love to see specialized, cheaper, bleeding-edge models like MiniMax for other non-agentic purposes as well. Why pay $1 for a general model when, for example, you can pay $0.1 for a content-moderator model that you actually need?

zarify 9 days ago

Funny, I had the opposite experience with MiniMax and Mimo when using OpenCode. MiniMax got stuck with looping through broken tool calls all the time and MiMo just powered through things and for the most part just worked.
shanoaice 6 days ago

similarly for me, MiniMax is kind of horrible that it somewhat regularly fall into loops that I had to save it from. DeepSeek & MiMO rarely got stuck. wonder how you get completely reversed experience.

megous 9 days ago

So I tried the $16/mo token plan. Burned through 31% of monthly budget in one 1-2h session of a small C project refactoring, saw some not great behavior (hey subagent, read me back these 6 files exactly - which probably burned a lot of output tokens) and will cancel, obviously.

This is waaaaay more constrained than even Claude Pro plan, let alone Deepseek V4 or Kimi K2.6 pricing.

maxdo 9 days ago

benchmarks we deserve: google search quick ai answers vs full llm model :)

irthomasthomas 9 days ago
search answers use Flash 3.5
- maxdo 9 days ago
  
  they use a "low" flavor of it to scale it on billions of users

noman-land 9 days ago

What did MiMo say?

irthomasthomas 9 days ago
Says its not supported and lists a few tags that are, like <b>bold</b>
Does this work: s̶t̶r̶i̶k̶e̶t̶h̶r̶o̶u̶g̶h̶
- noman-land 9 days ago
  
  Well done. Unicode wins again. 𓂺