Comment by culi

7 days ago

Mistral is cool and I wish them success but it consistently ranks extremely low on benchmarks while still being expensive. Chinese models like DeepSeek might rank almost as low as Mistral but they are significantly cheaper. And Kimi is the best of both worlds with incredible benchmark results while still being incredibly cheap

I know things change rapidly so I'm not counting them out quite yet but I don't see them as a serious contender currently

18 comments

culi

thot_experiment 6 days ago

Sure, benchmarks are fake and I use Mistral over equivalently sized models most of the time because it's better in real life. It runs plenty fast for me, I don't pay for inference.

BoredomIsFun 6 days ago

> it consistently ranks extremely low on benchmarks

As general purpose chatbots small Mistral models are better than comparably sized Chiniese models, as they have better SimpleQA scores and general knowledge of Western culture.

seanmcdirmid 6 days ago
It’s really hard to beat qwen coder, especially for role play where the instruction following is really useful. I don’t think their corpus is lacking in western knowledge, although I wonder if Chinese users get even better results from it?
- BoredomIsFun 6 days ago
  
  > It’s really hard to beat qwen coder, for role play
  I am not sure if you actually tried that. Mistrals are widely asccepted go-to models for roleplay and creative writing. No Qwens are good at prose, except for their latest big Qwen 3.5.
  > I don’t think their corpus is lacking in western knowledge,
  It absolutely does, especially pop culture knowledge.
  
  7 replies →

Eupolemos 7 days ago

Why are you talking price when we are talking local AI?

That doesn't make any sense to me. Am I missing something?

dirasieb 6 days ago

15 missed calls from your local power company
culi 6 days ago
Your electricity is free?
- seanmcdirmid 6 days ago
  
  Apple silicon is crazy efficient as well as being comparable to GPUs in performance for max and ultra chips.
- cpburns2009 6 days ago
  
  If you have the hardware to run expensive models, is the cost of electricity much of a factor? According to Google, the average price in the Silicon Valley Area is $0.448 per kWh. An RTX 5090 costs about $4,000 and has a peak power consumption of 1000 W. Maxing out that GPU for a whole year would cost $3,925 at that rate. It's not particularly more expensive than that hardware itself.
  
  1 reply →
- thot_experiment 6 days ago
  
  for almost the entire year, yes.