Comment by simonw 7 months ago Big release - https://huggingface.co/moonshotai/Kimi-K2-Instruct model weights are 958.52 GB 24 comments simonw Reply c4pt0r 7 months ago Paired with programming tools like Claude Code, it could be a low-cost/open-source replacement for Sonnet scottyeager 7 months ago Here's a neat looking project that allows for using other models with Claude Code: https://github.com/musistudio/claude-code-routerI found that while looking for reports of the best agents to use with K2. The usual suspects like Cline and forks, Aider, and Zed should be interesting to test with K2 as well. martin_ 7 months ago how do you low cost run a 1T param model? maven29 7 months ago 32B active parameters with a single shared expert. 19 replies → kkzz99 7 months ago According to the bench its closer to Opus, but I venture primarily for English and Chinese.
c4pt0r 7 months ago Paired with programming tools like Claude Code, it could be a low-cost/open-source replacement for Sonnet scottyeager 7 months ago Here's a neat looking project that allows for using other models with Claude Code: https://github.com/musistudio/claude-code-routerI found that while looking for reports of the best agents to use with K2. The usual suspects like Cline and forks, Aider, and Zed should be interesting to test with K2 as well. martin_ 7 months ago how do you low cost run a 1T param model? maven29 7 months ago 32B active parameters with a single shared expert. 19 replies → kkzz99 7 months ago According to the bench its closer to Opus, but I venture primarily for English and Chinese.
scottyeager 7 months ago Here's a neat looking project that allows for using other models with Claude Code: https://github.com/musistudio/claude-code-routerI found that while looking for reports of the best agents to use with K2. The usual suspects like Cline and forks, Aider, and Zed should be interesting to test with K2 as well.
martin_ 7 months ago how do you low cost run a 1T param model? maven29 7 months ago 32B active parameters with a single shared expert. 19 replies →
kkzz99 7 months ago According to the bench its closer to Opus, but I venture primarily for English and Chinese.
Paired with programming tools like Claude Code, it could be a low-cost/open-source replacement for Sonnet
Here's a neat looking project that allows for using other models with Claude Code: https://github.com/musistudio/claude-code-router
I found that while looking for reports of the best agents to use with K2. The usual suspects like Cline and forks, Aider, and Zed should be interesting to test with K2 as well.
how do you low cost run a 1T param model?
32B active parameters with a single shared expert.
19 replies →
According to the bench its closer to Opus, but I venture primarily for English and Chinese.