Comment by DeathArrow
9 hours ago
Those are some impressive benchmark results. I wonder how well it does in real life.
Maybe we can get away with something cheaper than Claude for coding.
9 hours ago
Those are some impressive benchmark results. I wonder how well it does in real life.
Maybe we can get away with something cheaper than Claude for coding.
I'm curious about the "cheaper" claim -- I checked Kimi pricing, and it's a $200/mo subscription too?
On openrouter 2.5 is at 0.60/3$ per Mtok. That's haiku pricing.
The unit economics seem tough at that price for a 1T parameter model. Even with MoE sparsity you are still VRAM bound just keeping the weights resident, which is a much higher baseline cost than serving a smaller model like Haiku.
They also have a $20 and $40 tier.
https://www.kimi.com/code
If you bargain with their bot Kimmmmy (not joking), you can even get lower pricing.
2 replies →