Comment by thot_experiment

7 days ago

Not true! Mistral is really really good, but I agree that there isn't a single decent open model from the USA.

21 comments

thot_experiment

Mistral is cool and I wish them success but it consistently ranks extremely low on benchmarks while still being expensive. Chinese models like DeepSeek might rank almost as low as Mistral but they are significantly cheaper. And Kimi is the best of both worlds with incredible benchmark results while still being incredibly cheap

I know things change rapidly so I'm not counting them out quite yet but I don't see them as a serious contender currently

thot_experiment 7 days ago

Sure, benchmarks are fake and I use Mistral over equivalently sized models most of the time because it's better in real life. It runs plenty fast for me, I don't pay for inference.
BoredomIsFun 7 days ago
> it consistently ranks extremely low on benchmarks
As general purpose chatbots small Mistral models are better than comparably sized Chiniese models, as they have better SimpleQA scores and general knowledge of Western culture.
- seanmcdirmid 6 days ago
  
  It’s really hard to beat qwen coder, especially for role play where the instruction following is really useful. I don’t think their corpus is lacking in western knowledge, although I wonder if Chinese users get even better results from it?
  
  8 replies →
Eupolemos 7 days ago
Why are you talking price when we are talking local AI?
That doesn't make any sense to me. Am I missing something?
- dirasieb 7 days ago
  
  15 missed calls from your local power company
- culi 7 days ago
  
  Your electricity is free?
  
  4 replies →

ac29 5 days ago

Arcee is working on that, see a blog post about their newest in progress model here: https://www.arcee.ai/blog/trinity-large

Its still not fully post trained and its a non-reasoning model, but its worth keeping an eye on if you dont want to use the Chinese models that currently are the best open-weight options.

CamperBob2 7 days ago

To be fair there are lots of worse models than OpenAI's GPT-OSS-120b. It's not a standout when positioned next to the latest releases from China, but prior to the current wave it was considered one of the stronger local models you can reasonably run.