Good, coding harnesses should be open source and LLMs should be treated as commodities. Minimize switching costs for consumers, and let people understand how they're interacting with the context and the LLM outputs.
The industry has been moving the wrong direction with Claude Code staying closed (despite multiple times leaking the source code!) and the open source Gemini CLI being deprecated in favor of closed source Antigravity CLI.
Why would a company do any of these things? What is their motivation for any of it? That’s like saying cloud providers should be commodity and should open source all of their platforms and eliminate egress fees so customers can easily leave at any point in time.
Joel Spolsky in 2002 identified a major pattern in technology business & economics: The pattern of "commoditizing your complement", an alternative to vertical integration, where companies seek to secure a choke point or quasi-monopoly in products composed of many necessary & sufficient layers by dominating one layer while fostering so much competition in another layer above or below its layer that no competing monopolist can emerge, prices are driven down to marginal costs elsewhere in the stack, total price drops & increases demand, and the majority of the consumer surplus of the final product can be diverted to the quasi-monopolist.
No matter how valuable the original may be and how much one could charge for it, it can be more valuable to make it free if it increases profits elsewhere.
This pattern explains many otherwise odd or apparently self-sabotaging ventures by large tech companies into apparently irrelevant fields, such as the high rate of releasing open-source contributions by many Internet companies or the intrusion of advertising companies into smartphone manufacturing & web browser development & statistical software & fiber-optic networks & municipal WiFi & radio spectrum auctions & DNS: they are pre-emptive attempts to commodify another company elsewhere in the stack, or defenses against it being done to them.
On the question of language models and periphery tooling -
Open weight models are disruptive to the business models of closed model businesses. An incentive is if your business is built around X but model training is helpful to you, but you don’t expect to meter it specifically. You can release your models and undercut the exclusive moat of a new model company like OpenAI or Anthropic from becoming at some point a competitor, or holding their access as a chip in pricing negotiations. By opening your architectures and weights other competitors can build on them and newer better models emerge faster decoupled from a small number of proprietary models. This lets you focus on X while gaining overall momentum on your model release at no additional cost and no loss in focus on X, while defending against upstarts and monopolies.
This is effectively a lot of the open source world that comes from corporate development as well. It feels odd after this many decades of discussing corporate reasons to participate in open source we keep rehashing it.
Yes I do think cloud providers should open source all of their platforms, and this is not charity because it is essentially the hosting that they are providing as a product. Even if, say, google open sources its whole search infrastructure, it does not at all means you can just host your own due to the huge hardware requirements, but you can know(especially after AI which can be utilized to do this) that they are not using your data in a way they shouldnt.
Good will and trust can ultimately have monetary value, and having a funnel based on open source is a viable play if it leads to a service that is sticky.
Cloud providers are commodity, and egress pricing is partially cost following because they have to pay peering to their interconnect points for WAN. Internal networks are not charged within the account because the economics of the VPC overlay are optimized for that use, but inter account and VPC and other boundaries carry cost - especially interconnection between accounts because the way VPC treats virtualization requires a relatively expensive routing. Inter AZ and inter region pricing also exists for the same cost following reasons. They also help shape incentives because it allows them to optimize placement of compute within the same AZ to physical buildings or rings.
The case that is largely nonsense is the egress pricing on direct connects since beyond the circuit costs, which the customer pay, there’s no costs for aws not already on the customer regardless. It also makes DC friction weird in that you are incentivized to NOT move storage before compute.
The familiar Chinese recipe for success: Always copy and imitate first, even if it is inferior, always make it cheaper or even free so that the original innovator will be burdened by brutal price competition and much bigger R&D costs and cannot keep up in the long run. Then the copycat will win in the endgame.
Most of the cloud platforms are open source. Linux, container, k8s… it’s entirely possible for someone to build and deploy their private cloud if they have the resources.
> and eliminate egress fees
What does it mean? If I sign up for cloud service I am only bound to the contract terms. If I am PAYGO I can switch anytime.
The cloud provider isn't the harness, Terraform/OpenTofu/Pelumi and the abstractions you build using them are. The cloud provider is the LLM. It's not as fungible as the LLM and there's no direct comparison to egress costs of course, but that's moreso a problem with the metaphor.
Because Claude Code is literally nothing particularly special. We don't need their business model. They need their business model, and to that I say, tough shit.
After 16 years on Hacker News, I've come to associate its readership with cheap bastards who think everything should be free while simultaneously wanting to keep their 6-figure jobs.
There's a very strong overlap with male gamers, who also think everything involving sophisticated engineering and design should be cheaper than a cup of coffee.
Just call it out and maybe we can collectively choose to towards a culture that doesn't encourage such shameless behavior or perverted values.
opensourcing software may enable leverage of wider network of contributors to given piece of software,hence software can evolve much more quickly and efficiently.
I don’t like changing tools. What engineer does? I want to learn one tool and tune it to my exact preferences. Proprietary vendor tools are not portable and I avoid them.
Either Anthropic or OpenAI could drop the first-to-market open coding harness tomorrow and it would be as big as VSCode, it would be the standard platform everyone builds stuff on.
The jagged frontier of frontier models means treating tokens as fungible between providers is naive at the limit of capability, but also will work for solved problems far from the boundary. The problem is you need to keep evaluating all models to know where your use case lies on the frontier map.
As a concrete example, you’ll get very different results for the same prompt for sonnet, opus, fable, gemini, gpt 5.5, …
The complements ARE the LLM AND the harness. The actual products we're all consuming are GPUs. Memory being expensive is a second-order effect.
The platform is the GPU, and doing cool shit with it IS the complement, which requires more memory. And demand is so high and will stay high, that it looks like the platform itself.
The question is why supply is restricted, primarily by sanctions and tariffs to China, and the expressed refusal of RAM makers to even think about increasing supply, they are actually all sweaty about China taking a bit of the unrestricted market.
There is really nothing free in terms of money, there are only things really free in term of spirit. But AI coding assistant are not those things related to spiritual freedom.
> MiMoCode is built as a fork of OpenCode. It keeps all core OpenCode capabilities (multiple providers, TUI, LSP, MCP, plugins) and adds persistent memory, intelligent context management, subagent orchestration, goal-driven autonomous loops, compose workflows, and self-improvement via dream/distill.
Sounds like they slapped in a bunch of common plugins and released it as a product to promote the free-for-a-limited-time use of their new coding AI service.
> promote the free-for-a-limited-time use of their new coding AI service
Not sure which "free" service you're referring to, but MiMo v2.5 Pro is plenty capable & (after its recent 70%+ price drop) one of the most affordable options in its class (DeepSeek v4 Pro, MiniMax M3, & Qwen 3.7 Plus). I read somewhere that Labs are incentivized to implement custom harnesses because each model has its strengths, quirks, & blindspots (like Qwen forking Gemini CLI)?
Since the link is in Chinese: MiMo Code is Xiaomi’s AI agentic coding harness.
“ MiMoCode is a terminal-native AI coding assistant. It can read and write code, run commands, manage Git, and use a persistent memory system to keep a deep understanding of your project across sessions while continuously improving itself.”
Thanks, I missed that on first glance and did manual translation.
Not sure why my iPhone shows an option to translate website but all the destination languages to pick from (I have multiple languages installed), including English, are greyed out. iPhone does support translating from Chinese (Simplified or Traditional), and the button to translate website isn’t greyed out like it is for unsupported/unrecognized languages. Might be an iOS 27 bug, because it is working on other websites?
It's entirely possible, and even standard, to allow the browser to tell your site which language to respond in.
While ignorance of internationalization standards is a possibility, and the most likely cause.. I do wonder if it's a bit of a nudge to promote Chinese influence in the AI space.
Not that they really need to do that, China is already doing great (relatively, depending on criteria). The implosion of the US, the resulting brain drain and world shake-up has been very timely for their AI and other industries.
It's a very smart move for them to think longer term and start freezing out NVIDIA. Then they can take Taiwan purely for ideological concerns and not worry at all about the fabs blowing up in the process.
And they won't be dependent on foreign factories sitting on an island just off the shore of a superpower who's shown nothing below absolute resolve for decades towards the idea of conquering that island....
What a transformation by Xiaomi to build almost frontier level models. Five years back, when I was in the data science team, they dint really bother about AI models and were using Baidu for NLP and vision under the hood of their APIs
I fully expect Baidu and other tech giants on the Chinese shores to try and push the boundaries of technology. Silicon Valley (and the US) in general has always been the hot-bed of innovation. But with enormous increase in wealth in China (and to an extent in India), I can see these companies being more and more ambitious. Not long ago Andrew Ng of Coursera and Stanford AI Lab fame joined Baidu to further their rival to the 'Google Brain' project.
Xiaomi has long been positioning itself as a company with design chops of Apple, engineering chops of Google, and e-commerce chops of Amazon, all rolled into one-- and I can see where they are coming from. If they manage to pull it off, I guess that's when we'd start seeing the proverbial "Death of Silicon Valley" as in, it loosing its strange monopoly and strangle hold on tech world in terms of both talent and innovation.
"Death of Silicon Valley" in this case is such a funny perspective. Like, how twisted is the US's view of the market that they think "Competition? Oh no. Sound the alarms."
I know it's more mixed and complex than this, but i think a big opposition is not to the data centers themselves but to their locations. Too often it feels like the centers are exploiting local resources and community infrastructure rather than paying their share or locating themselves in places that are less likely to cause problems to home owners.
The whole process feels indifferent or even adversarial at times.
Do you know the old anecdote about the russian and american scientists talking about freedom? The one where the american explains that he is free to go and protest against the war in Vietnam and where the russian dismisses him that he is also free to protest against the war in Vietnam.
Xiaomi have been cooking a lot in recent times. Their model, especially the pro series, is underrated in my opinion. It haven't received the attention it deserves while it is pushing higher and higher in benchmark scores (looking at artifical analysis), and this was before Deepseek dropped V4.
Furthermore, their pricing plan is insanely cheap, they even upped usage limit for their cheapest plan, lite plan, which is at 5$ / month. And now, they are dropping a Harness for their own model? Amazing. I wish they added support for installation through Homebrew though.
On another note, this is what I would like to see more of from a company, what I do not welcome is startups making their model exclusive and hurt their customer base through sabotaging as a way to prevent eventual distillation attempts.
>Furthermore, their pricing plan is insanely cheap, they even upped usage limit for their cheapest plan, lite plan, which is at 5$ / month.
Unless something changed their plans aren't really worth getting. They're not that much cheaper than the per-token rates, and because it's a plan, you have to contend with weird usage restrictions. You're better off paying per-token unless you have some use case that demands a very steady stream of tokens.
Indeed. I did the math and arrived at the same conclusion. They don’t really subsidize their token plans. Maybe because their api pricing is already dirt cheap
Looks like they have very effective collaboration with DeepSeek and Kimi. Those three models have been bouncing ideas and sharing R&D innovation, which made all of them improve very fast.
Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
>Looks like they have very effective collaboration with DeepSeek and Kimi.
The collaboration is informal. People don’t seem to realize this, but the Chinese internet for programmers and developers today feels a lot like StackExchange in its heyday. There’s a huge emphasis on sharing knowledge, because sharing what you know builds your profile, and becoming a rockstar in a subfield is one of the only ways to get ahead.
Competition in China is ruthless. But unlike in North America, where individuals are often bound by agreement to hoard knowledge because it can give them a competitive edge, the competitive advantage in China is building face and peer recognition. And that comes from proving that you are worthy of being a "master/teacher", and that extends to the valuation of your knowledge business. For example, the third wave coffee shops in China, the master roaster is often called "master/teacher" once they win a roasting competition and start sharing new knowledge of roasting in the public sphere, and that's a title of sincere respect.
You can see parallels with those that apply to give talks at conferences and post snazzy technical presentations they give in the US, but the bar for what qualifies as new knowledge is far higher in China because there's a massive ecosystem of people rushing to outcompete what you have to offer, and once the ball gets rolling on knowledge sharing, lots of people will go off and build upon that knowledge or try to build businesses on top of that, which in turn produces more knowledge.
Reading developer forums in China, once you crack the code (I find Gemini will get you a good chunk of the way with good translations), they are really quite far ahead with what they're willing to share. And I suspect in great part, the decision to release open-weights is heavily tied to that concept of building face/peer recognition = building valuations.
> Looks like they have very effective collaboration with DeepSeek and Kimi. Those three models have been bouncing ideas and sharing R&D innovation, which made all of them improve very fast.
Very fascinating to learn this, didn't know Moonshoot (Kimi) also collaborated with others. I think I read in another post that DeepSeek and Qwen team shared the same building? So that kind of explains it.
> Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
I have to agree. I had the great opportunity to take the offer Z.ai had with their Christmas deal, their lite plan was 3 months for 7$. GLM-4.7 was already impressive enough.
When they released GLM-5-Turbo and GLM-5.1, that is when I came to the realization of how close the gap is between proprietary western models and Chinese open-weight ones (not all of them are ofc).
I could barely believe how good GLM-5.1 was, I didn't think I was using it in CC and had to check the settings again. It's astonishing how close the gap is now, and this competition benefits us very much, the pricing is so low atm, its amazing.
Pretty neat that you can just install it and start using it (at a Sonnet 4.6-level model) without needing to sign in or pay.
Typically, Chinese websites are a big pain to log in or sign up because they require a +86 phone number due to legal reasons. Being able to use it without having to make an account is amazing for friction reduction. I could probably even just install it onto new machines to help with set up.
I wonder how they are gonna detect and block abuse though?
MiMo v2.5.0-Pro is honestly the first Chinese model that I've tried where I really though why should I use Claude Sonnet when I can get the same results for a fraction of the cost. There was always something off about Chinese models that made it apparent that it couldn't fully compete with GPT, Claude, Gemini, etc. but this was the first model where I was like, this feels like Sonnet.
I can't prove it, but I think they trained heavily on Claude output. From my perspective I don't care since Anthropic trained on my data.
Using them also works well for North Americans as our peak hours is not theirs.
If I had one complaint, the v2.5.0-Pro model thinks too much.
GLM 5.1 is stronger than Sonnet 4.6 in my opinion, but while they have a coding plan that is a good value MiMo beats it on price. I haven't used MiMo much yet but it felt pretty similar.
So funny I have noticed how terrible the signup is on all these Chinese models, companies etc. Always wonder why it is such an easy process. Like QQ, Tencent etc demos Ive seen past year
Claude and Codex pricing will eventually have to come down, for most common coding tasks you don't need a super smart slow model but a smart-enough and very fast one.
Microsoft github copilot recently changed their billing. i'm on the yearly subscription. GPT-5.4 is now 6x and even previously free model like GPT-5 mini now cost .33x. its only June 11 and my usage is now at 50%.
I don't think many understand that Sonnet and even Haiku can probably accomplish their task, instead of them invoking a beast like Opus to tell them about todays weather.
And yet, MiMo and DeepSeek, even MiniMax, are way cheaper and arguably better, or way better than both Sonnet and especially Haiku.
While you can argue you are ready to pay 100-1000 times the price for Fable or Opus because you need those last 1-2% of edge, there's no valid reason to keep paying the obscene amounts of money for Sonnet and Haiku when alternatives exist.
I don't known how Codex works, but we can set environment variables and point Claude CLI to deepseek. I think that before slashing prices they will slash those environment variables. After all they are not working to give a free TUI to deepseek and possibly to other competitors. But eventually yes, prices will go down or there will be an attempt at a regulatory capture.
Most importantly, we need a model that doesn't randomly refuse us when we ask it to do something, or worse, deliberately sabotages us when it thinks we're building competing products. Like Anthropic's Fable.
I've worked a lot with MiMo in my project that pits LLMs against each other in games (clankerfights.ai). It is a very very good model for the price. MiniMax I'd say is smarter, but MiMo really touches near pareto frontier.
I found it relevant and actually just the information I was looking for. Having a highly recommended model behind the tool makes it worth further investigation.
Good timing, I was looking for alternatives earlier today. opencode didn't install properly and I wasn't a fan of oh-my-pi and nanocoder.
MiMo code (via my z.ai coding plan) is very pleasant so far, nice UI and seems to respond faster than Claude Code. It might be injecting much less cruft into the conversation.
I also got access to the mimo-2.5-pro ultraspeed model yesterday, which is really quite snappy. It does cost more than DeepSeek, though, so I'm not sure whether it's worth it yet. Definitely fast though.
It did install, but then would hang when trying to configure the provider (specifically I was trying diffusiongemma via Nvidia). I gave up after a couple of attempts and moved to the next one.
it does have telemetry, enabled by default, that sends metrics to tracking.miui.com, including what model you are using. it can be turned off by environment variable (MIMOCODE_ENABLE_ANALYSIS=false), and yes it still has all the normal OpenCode provider logic so it will work with other/local models. it also automatically looks for updates and fetches a mimo model list, including when the telemetry is off, though those can also be disabled.
telemetry enabled by default and named "analysis" is not great.
Because they want to optimize it for their models and don't want to be blocked by waiting for PRs to merge or be rejected.
There's plenty of reasons to start your own fork that you have full agency of, as long as the OSS License is maintained anyone will be able to benefit from any new features they want to make use of.
Because its currently impossible to "contribute to Opencode".
There are over 500 pages of open issues, up from 78 less than a month ago. They are doing nothing to halt the garbage/duplicates that pop up, and not even addressing legitimate PRs/reports.
To go a different path perhaps? You can't expect that all your ideas will land into a main repo and you really want to implement your vision while using a sane base.
Could just be a courtesy - Americans tend to be rather suspicious and hostile to contributions coming from China, and it might draw unwarranted attention from agencies and bad media.
I don't think that's true? AFAIK OpenCode started as a TUI and their GUI app is Tauri-based, so don't think it was forked from OpenCode. You might be thinking of Cursor
There were once two harnesses named OpenCode, one written in Go & the other in TypeScript (the more popular one).
Kujtim Hoxha creates a project named TermAI. (SST folks) Dax & Adam join the project, rebrand it to OpenCode with Dax buying the domain, opencode.ai.
Charm, the company behind the original libraries, acqui-hires Kujtim, who moves the project to Charm's organization, leaving SST unimpressed (due to VC involvement?)
Allegations Charm rewrote git history and deleted GitHub comments.
Dax claims ownership of the brand, forks project. For a time, 2 projects named OpenCode exist. Charm eventually renames its version to "Crush".
I like that they do their thing, but I'd be happier if there was a harness that could be tweaked, trivially, to do what their custom harness does. E.g. would this have been possible to do within Pi, as an extension? Then I could just "pi install mimo" and get the same results. I guess it would be more work on their part, to make sure it doesn't regress when something changes in pi itself.
As much as I absolutely love Mimo V2.5 Pro (it's a genuinely good model), I absolutely hate the way they calculate usage in their token plan.
For example: For a super small task in a small project that should not be consuming more than 500K total tokens after all tool calls included, their shown usage shot up to 152 million tokens.
But, when I scroll down on the same page, a table shows usage as 3 million tokens, out of which 2.5 million were cached.
This is such a huge conflict on the very same page. The bad thing is that the usage progress bar is shown against that 150 million token usage, not against that 3 million one.
This has been in discussions for at least past 3 months on reddit as well, and was precisely the reason I subscribed to their lowest tier, and for a single month only.
Update: their own harness, mimocode, shows total token usage as just 63.1K. We now have 3 entirely different values, differing in 3 orders of magnitude.
Update 2: So, I did the exact same task this time using DS4Pro, and total token usage was just 101K (as shown by opencode).
My coding agent VT Code has recently become a Xiaomi Orbit partner. If you want to try out Xiaomi Mimo V2.5 and V2.5 Pro in a different harness, feel free to use my VT Code. VT Code supports Mimo V2.5/Pro via official Xiaomi endpoints and via OpenRouter. Thank you!
I'm a fool for thinking that MiMo, in the context of Xiaomi who makes WiFi equipment (smartphones and routers), would be about network technology to manage parallel data streams (multiple inputs, multiple outputs https://en.wikipedia.org/wiki/MIMO)
Authorization for their own API doesn't work.. the web 'Authorize' page denies it, eventually goes through, but then you get stuck on 'Waiting for authorization' in the app. The web page says 'Paste this into MiMo Code' but there is nowhere to paste in.
I'm pretty sure most of the comments in this thread are AI bots propping up this product?
I tried the free model and it's nowhere near Sonnet 4.6 in terms of capabilities. The fact that token speed will randomly get stuck at 0/s makes sense given it's a free service, but the way it performs is more reminiscent of AI from 2025.
> MiMo Code is a terminal-based coding agent built by Xiaomi's MiMo team on top of OpenCode and open-sourced under the MIT license.
I think it is great that they built it on top of open code. Open Code harness is good and I want it to grow. Harness is very important and more projects use it, the more it is adopted.
It's "just" an opencode fork but it adds some nice features to try out while not being a full orchestrator metapackage like oh-my-opencode. Quite nice! Though it would be even nicer if this stuff came upstream or as an easy extension instead in the future
Only tangentially related: MiMo-2.5Pro is fast, cheap and very capable, although not quite gpt5.5 level iontelligence (I dont use the claudes). It works flawlessly in Pi and is an excellent workhorse. I expect big things from their next model.
Let the battle for the harnesses give free tokens for all, until the next competition arena does the same. It seems that's the only way AI will remain acessible.
This website is gorgeous, by the way. The mouse reveal on the background, amazing.
This is usually a PoC (Proof of concept) way to install something on a temporary container or temporary VM, but not for production use during daily desktop operation.
I was hoping their documentation would provide better installation instructions. But strangely, only for Windows do they recommend "npm install -g @mimo-ai/cli," which is a much better approach to managing installed packages.
For Mac/Linux, they have the strange recommendation to use the dangerous "curl <some_url> | bash." Quote:
> (for the best experience, Mac users are strongly encouraged to use iTerm or the VSCode Terminal)
> curl -fsSL https://mimo.xiaomi.com/install | bash
This is how everyone does it now. Including Anthropic.
To be fair, is that any different from naively trusting NPM? It's not like NPM is doing any vetting. They're every threat actors favorite sandbox these days.
You're right that it's as dangerous as it's executing random third-party code on your machine, but the method also has propagated far beyond PoCs and such at this point. All of these projects and many others push that install method: Bun, Deno, rustup, k3s, Docker (if using their helper script), Homebrew, Tailscale...
Frankly, it's not really more insecure than any other installation method. Apt packages and the like generally have the ability to specify pre/post-install scripts, so `sudo dpkg -i ./random.deb` is equivalent to `sudo bash ./random.sh`. Even if they didn't have pre/post-install scripts, they're still writing arbitrary files to arbitrary locations on your disk, so they can trigger execution the next time you boot or log in or whatever.
And at the end of the day, no matter the installation method (even just unpacking a tarball and executing the program directly from that directory), you're going to run their program on your computer, and then the program can do whatever it wants. Maybe you don't run it with sudo, but https://xkcd.com/1200/ seems relevant.
We've had this discussion since Eazel Linux desktop popularized bash | curl in 2001.
> npm install ... is a much better approach to managing installed packages.
No. Until the upcoming version of npm is out, npm will also run arbitrary code. Almost all common installation tools run arbitrary code. Not doing that is sadly the exception for now.
Unlimited context catches m :D I understand that it's not possible but it's worth to be checked to understand how you do your context compressions, etc.
Is that Open-Source like, run it locally, no phone home included, or open source like the thin front-end layer is all that is actually open-source but it’s an empty shell without the remote API it relies on?
They default it to talking to a free version of their model (which is incredibly cheap if you decide you like it.)
But it seems trivially easy to run it against local models. Their onboarding guide offers that option, though I have no idea if it changes any functionality.
The latter. It looks like it's meant to be a batteries-included agent to promote their free-for-a-limited time AI service that it connects to by default.
Ok, fair enough compared to the rest of the proeminent actors I guess, but quite confusing from dev point of view. Lately I started to experiment with model like Qwen2.5 on local. Good enough to ask simple question, but didn’t manage to do anything remotely close a agents I started to experiment with through Copilot.
sounds really cool if it was coming out of anywhere except China, which has laws to exfiltrate your data and send it back to the government for espionage purposes [0].
Isn't Unlimited Context pretty difficult to promise? What exactly do they mean, could I just have two agents locked into a TTRPG back and forth forever?
You can run the two agents in GAN like loop.. each trying to better the other. Give them a common task like design a better alternative to transformer model that uses max O(nlogn) memory, and the result comes closest to existing n^n implementations.
Good idea actually.. why haven't I tried this before.
Open source and open weight AI is very important to protect freedom of speech. OpenAI, and ESPECIALLY Anthropic, will try to ban them through regulatory capture / safety fearmongering. We need to make sure that does not happen. It’s not society’s problem if these frontier labs have no moats.
Terminals and things that live in terminals have relied on wcwidth() to handle this since time immemorial (which is always fun when they are out of sync, e. g. remote over ssh, but in the vast majority of the cases it just works).
Strange, I reckon I installed Tahoe just a while ago and still didn't have a similar issue, but I remember on previous MacOS versions the error message for unotarized binaries, was to warn that there was indeed a security issue with the binary, not that it was simply "damaged".
Reading "xiaomi"in the headline I was thrilled to see MiMo only to find out it's NOT this one: https://en.wikipedia.org/wiki/MIMO
What a disappointment!
Today on open source vampires: xaomi forks an open source project, doesn't contribute upstream, attaches usage restrictions that are probably incompatible with the license, and wants good PR. Fuck these people.
Good, coding harnesses should be open source and LLMs should be treated as commodities. Minimize switching costs for consumers, and let people understand how they're interacting with the context and the LLM outputs.
The industry has been moving the wrong direction with Claude Code staying closed (despite multiple times leaking the source code!) and the open source Gemini CLI being deprecated in favor of closed source Antigravity CLI.
Why would a company do any of these things? What is their motivation for any of it? That’s like saying cloud providers should be commodity and should open source all of their platforms and eliminate egress fees so customers can easily leave at any point in time.
That’s a charity, not a business model.
> That’s a charity, not a business model.
https://gwern.net/complement
8 replies →
On the question of language models and periphery tooling -
Open weight models are disruptive to the business models of closed model businesses. An incentive is if your business is built around X but model training is helpful to you, but you don’t expect to meter it specifically. You can release your models and undercut the exclusive moat of a new model company like OpenAI or Anthropic from becoming at some point a competitor, or holding their access as a chip in pricing negotiations. By opening your architectures and weights other competitors can build on them and newer better models emerge faster decoupled from a small number of proprietary models. This lets you focus on X while gaining overall momentum on your model release at no additional cost and no loss in focus on X, while defending against upstarts and monopolies.
This is effectively a lot of the open source world that comes from corporate development as well. It feels odd after this many decades of discussing corporate reasons to participate in open source we keep rehashing it.
Maybe these things should be utilities that can be swapped out at will and shouldn’t even be privately owned at all? Heresy, I know!
10 replies →
Because there is literally nothing special about coding hardnesses. The models are doing all the lifting. It just user experience that separates them.
A coding hardness with just bash outperforms Codex, Claude Code, OpenCode, Pi ect. The added features are just user experience features.
14 replies →
Public good isn’t a charity, and a business model that doesn’t contribute to the public good should not be allowed to exist.
18 replies →
Yes I do think cloud providers should open source all of their platforms, and this is not charity because it is essentially the hosting that they are providing as a product. Even if, say, google open sources its whole search infrastructure, it does not at all means you can just host your own due to the huge hardware requirements, but you can know(especially after AI which can be utilized to do this) that they are not using your data in a way they shouldnt.
1 reply →
Because they steal everything to train their models. They literally make you pay for the "commons" knowledge
Good will and trust can ultimately have monetary value, and having a funnel based on open source is a viable play if it leads to a service that is sticky.
Cloud providers are commodity, and egress pricing is partially cost following because they have to pay peering to their interconnect points for WAN. Internal networks are not charged within the account because the economics of the VPC overlay are optimized for that use, but inter account and VPC and other boundaries carry cost - especially interconnection between accounts because the way VPC treats virtualization requires a relatively expensive routing. Inter AZ and inter region pricing also exists for the same cost following reasons. They also help shape incentives because it allows them to optimize placement of compute within the same AZ to physical buildings or rings.
The case that is largely nonsense is the egress pricing on direct connects since beyond the circuit costs, which the customer pay, there’s no costs for aws not already on the customer regardless. It also makes DC friction weird in that you are incentivized to NOT move storage before compute.
2 replies →
The capital motivation isn't the only one that exists. You can say something should be true without having a plan to maximize quarterly revenues.
Even if you consider profit motive, what is the profit motive for corporate contributions to open source? The same applies here.
The familiar Chinese recipe for success: Always copy and imitate first, even if it is inferior, always make it cheaper or even free so that the original innovator will be burdened by brutal price competition and much bigger R&D costs and cannot keep up in the long run. Then the copycat will win in the endgame.
2 replies →
> cloud providers should be commodity
They are already.
> and should open source all of their platforms
Most of the cloud platforms are open source. Linux, container, k8s… it’s entirely possible for someone to build and deploy their private cloud if they have the resources.
> and eliminate egress fees
What does it mean? If I sign up for cloud service I am only bound to the contract terms. If I am PAYGO I can switch anytime.
2 replies →
They did steal all of our written knowledge
The cloud provider isn't the harness, Terraform/OpenTofu/Pelumi and the abstractions you build using them are. The cloud provider is the LLM. It's not as fungible as the LLM and there's no direct comparison to egress costs of course, but that's moreso a problem with the metaphor.
Because Claude Code is literally nothing particularly special. We don't need their business model. They need their business model, and to that I say, tough shit.
Well, until you established monopoly you need to build trust. Open Source is one way of doing just that. One of the better ways I would say even...
Do you think Internet Explorer 6.0 was a good decision?
1 reply →
After 16 years on Hacker News, I've come to associate its readership with cheap bastards who think everything should be free while simultaneously wanting to keep their 6-figure jobs.
There's a very strong overlap with male gamers, who also think everything involving sophisticated engineering and design should be cheaper than a cup of coffee.
Just call it out and maybe we can collectively choose to towards a culture that doesn't encourage such shameless behavior or perverted values.
5 replies →
opensourcing software may enable leverage of wider network of contributors to given piece of software,hence software can evolve much more quickly and efficiently.
1 reply →
“A business that does things that customers actually like is a charity” lmao
> Why would a company do any of these things?
What? It’s actually insane that they haven’t yet.
I don’t like changing tools. What engineer does? I want to learn one tool and tune it to my exact preferences. Proprietary vendor tools are not portable and I avoid them.
Either Anthropic or OpenAI could drop the first-to-market open coding harness tomorrow and it would be as big as VSCode, it would be the standard platform everyone builds stuff on.
3 replies →
This ^
ah nevermind it's just a fork of OpenCode
The jagged frontier of frontier models means treating tokens as fungible between providers is naive at the limit of capability, but also will work for solved problems far from the boundary. The problem is you need to keep evaluating all models to know where your use case lies on the frontier map.
As a concrete example, you’ll get very different results for the same prompt for sonnet, opus, fable, gemini, gpt 5.5, …
The complements ARE the LLM AND the harness. The actual products we're all consuming are GPUs. Memory being expensive is a second-order effect.
The platform is the GPU, and doing cool shit with it IS the complement, which requires more memory. And demand is so high and will stay high, that it looks like the platform itself.
> And demand is so high and will stay high,
The question is why supply is restricted, primarily by sanctions and tariffs to China, and the expressed refusal of RAM makers to even think about increasing supply, they are actually all sweaty about China taking a bit of the unrestricted market.
There is really nothing free in terms of money, there are only things really free in term of spirit. But AI coding assistant are not those things related to spiritual freedom.
what do you mean by should?
As much as their propaganda wants us to believe, Anthropic is not 'the industry'.
> MiMoCode is built as a fork of OpenCode. It keeps all core OpenCode capabilities (multiple providers, TUI, LSP, MCP, plugins) and adds persistent memory, intelligent context management, subagent orchestration, goal-driven autonomous loops, compose workflows, and self-improvement via dream/distill.
From github
Sounds like they slapped in a bunch of common plugins and released it as a product to promote the free-for-a-limited-time use of their new coding AI service.
> promote the free-for-a-limited-time use of their new coding AI service
Not sure which "free" service you're referring to, but MiMo v2.5 Pro is plenty capable & (after its recent 70%+ price drop) one of the most affordable options in its class (DeepSeek v4 Pro, MiniMax M3, & Qwen 3.7 Plus). I read somewhere that Labs are incentivized to implement custom harnesses because each model has its strengths, quirks, & blindspots (like Qwen forking Gemini CLI)?
9 replies →
So, basically the same thing silicon valley has been doing for the past half decade.
Since the link is in Chinese: MiMo Code is Xiaomi’s AI agentic coding harness.
“ MiMoCode is a terminal-native AI coding assistant. It can read and write code, run commands, manage Git, and use a persistent memory system to keep a deep understanding of your project across sessions while continuously improving itself.”
GitHub link (English): https://github.com/XiaomiMiMo/MiMo-Code
@dang might be better to link to the GitHub, and not for language reasons.
(Edit: for posterity, original URL as submitted was [0]).
[0]: https://mimo.xiaomi.com/mimocode
You can change the language via the header: The rightmost option is a language dropdown.
It's a client-side change and doesn't impact the URL so users must manually change it each time they visit the site though
Thanks, I missed that on first glance and did manual translation.
Not sure why my iPhone shows an option to translate website but all the destination languages to pick from (I have multiple languages installed), including English, are greyed out. iPhone does support translating from Chinese (Simplified or Traditional), and the button to translate website isn’t greyed out like it is for unsupported/unrecognized languages. Might be an iOS 27 bug, because it is working on other websites?
It's entirely possible, and even standard, to allow the browser to tell your site which language to respond in.
While ignorance of internationalization standards is a possibility, and the most likely cause.. I do wonder if it's a bit of a nudge to promote Chinese influence in the AI space.
Not that they really need to do that, China is already doing great (relatively, depending on criteria). The implosion of the US, the resulting brain drain and world shake-up has been very timely for their AI and other industries.
It's a very smart move for them to think longer term and start freezing out NVIDIA. Then they can take Taiwan purely for ideological concerns and not worry at all about the fabs blowing up in the process.
And they won't be dependent on foreign factories sitting on an island just off the shore of a superpower who's shown nothing below absolute resolve for decades towards the idea of conquering that island....
Why not persist it through a query param? Or a lang param for that matter
8 replies →
What a transformation by Xiaomi to build almost frontier level models. Five years back, when I was in the data science team, they dint really bother about AI models and were using Baidu for NLP and vision under the hood of their APIs
Wrote this eons ago:
https://news.ycombinator.com/item?id=9421471
"Death of Silicon Valley" in this case is such a funny perspective. Like, how twisted is the US's view of the market that they think "Competition? Oh no. Sound the alarms."
3 replies →
[flagged]
I remember when they made this:
https://en.wikipedia.org/wiki/Xiaomi_Mi_1
And now they make one of the fastest cars ever created and frontier-level AI. In just over a decade. 你好!
[flagged]
> While Americans Oppose AI Data Centers
I know it's more mixed and complex than this, but i think a big opposition is not to the data centers themselves but to their locations. Too often it feels like the centers are exploiting local resources and community infrastructure rather than paying their share or locating themselves in places that are less likely to cause problems to home owners.
The whole process feels indifferent or even adversarial at times.
9 replies →
Do you know the old anecdote about the russian and american scientists talking about freedom? The one where the american explains that he is free to go and protest against the war in Vietnam and where the russian dismisses him that he is also free to protest against the war in Vietnam.
correct
Xiaomi have been cooking a lot in recent times. Their model, especially the pro series, is underrated in my opinion. It haven't received the attention it deserves while it is pushing higher and higher in benchmark scores (looking at artifical analysis), and this was before Deepseek dropped V4.
Furthermore, their pricing plan is insanely cheap, they even upped usage limit for their cheapest plan, lite plan, which is at 5$ / month. And now, they are dropping a Harness for their own model? Amazing. I wish they added support for installation through Homebrew though.
On another note, this is what I would like to see more of from a company, what I do not welcome is startups making their model exclusive and hurt their customer base through sabotaging as a way to prevent eventual distillation attempts.
>Furthermore, their pricing plan is insanely cheap, they even upped usage limit for their cheapest plan, lite plan, which is at 5$ / month.
Unless something changed their plans aren't really worth getting. They're not that much cheaper than the per-token rates, and because it's a plan, you have to contend with weird usage restrictions. You're better off paying per-token unless you have some use case that demands a very steady stream of tokens.
Wow. I saw 4.1B credits and thought it was super generous. But my math says the subscription plan gives less value than the API.
For example, API input is $0.435 / M tokens, which works out to 13.79 M tokens for $6.
Plan is 300 credits per input token, which works out to 13.67 M tokens at 4.1B credits per $6.
Very similar math for cache input and output.
1 reply →
Indeed. I did the math and arrived at the same conclusion. They don’t really subsidize their token plans. Maybe because their api pricing is already dirt cheap
Looks like they have very effective collaboration with DeepSeek and Kimi. Those three models have been bouncing ideas and sharing R&D innovation, which made all of them improve very fast.
Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
>Looks like they have very effective collaboration with DeepSeek and Kimi.
The collaboration is informal. People don’t seem to realize this, but the Chinese internet for programmers and developers today feels a lot like StackExchange in its heyday. There’s a huge emphasis on sharing knowledge, because sharing what you know builds your profile, and becoming a rockstar in a subfield is one of the only ways to get ahead.
Competition in China is ruthless. But unlike in North America, where individuals are often bound by agreement to hoard knowledge because it can give them a competitive edge, the competitive advantage in China is building face and peer recognition. And that comes from proving that you are worthy of being a "master/teacher", and that extends to the valuation of your knowledge business. For example, the third wave coffee shops in China, the master roaster is often called "master/teacher" once they win a roasting competition and start sharing new knowledge of roasting in the public sphere, and that's a title of sincere respect.
You can see parallels with those that apply to give talks at conferences and post snazzy technical presentations they give in the US, but the bar for what qualifies as new knowledge is far higher in China because there's a massive ecosystem of people rushing to outcompete what you have to offer, and once the ball gets rolling on knowledge sharing, lots of people will go off and build upon that knowledge or try to build businesses on top of that, which in turn produces more knowledge.
Reading developer forums in China, once you crack the code (I find Gemini will get you a good chunk of the way with good translations), they are really quite far ahead with what they're willing to share. And I suspect in great part, the decision to release open-weights is heavily tied to that concept of building face/peer recognition = building valuations.
4 replies →
> Looks like they have very effective collaboration with DeepSeek and Kimi. Those three models have been bouncing ideas and sharing R&D innovation, which made all of them improve very fast.
Very fascinating to learn this, didn't know Moonshoot (Kimi) also collaborated with others. I think I read in another post that DeepSeek and Qwen team shared the same building? So that kind of explains it.
> Based solely on quality and price, OpenAI, Anthropic, and other western models just can't compete with the new generation of Chinese open models.
I have to agree. I had the great opportunity to take the offer Z.ai had with their Christmas deal, their lite plan was 3 months for 7$. GLM-4.7 was already impressive enough.
When they released GLM-5-Turbo and GLM-5.1, that is when I came to the realization of how close the gap is between proprietary western models and Chinese open-weight ones (not all of them are ofc).
I could barely believe how good GLM-5.1 was, I didn't think I was using it in CC and had to check the settings again. It's astonishing how close the gap is now, and this competition benefits us very much, the pricing is so low atm, its amazing.
1 reply →
Pretty neat that you can just install it and start using it (at a Sonnet 4.6-level model) without needing to sign in or pay.
Typically, Chinese websites are a big pain to log in or sign up because they require a +86 phone number due to legal reasons. Being able to use it without having to make an account is amazing for friction reduction. I could probably even just install it onto new machines to help with set up.
I wonder how they are gonna detect and block abuse though?
> at a Sonnet 4.6-level model
MiMo v2.5.0-Pro is honestly the first Chinese model that I've tried where I really though why should I use Claude Sonnet when I can get the same results for a fraction of the cost. There was always something off about Chinese models that made it apparent that it couldn't fully compete with GPT, Claude, Gemini, etc. but this was the first model where I was like, this feels like Sonnet.
I can't prove it, but I think they trained heavily on Claude output. From my perspective I don't care since Anthropic trained on my data.
Using them also works well for North Americans as our peak hours is not theirs.
If I had one complaint, the v2.5.0-Pro model thinks too much.
I find deepseek-v4-pro to be every bit as good as sonnet tbh
GLM 5.1 is stronger than Sonnet 4.6 in my opinion, but while they have a coding plan that is a good value MiMo beats it on price. I haven't used MiMo much yet but it felt pretty similar.
Is there a guide to running these models locally? Sonnet level inference on my own hardware would be world changing.
I have Claude but I don't want to ask it because Anthropic could decide to sabotage me.
4 replies →
So funny I have noticed how terrible the signup is on all these Chinese models, companies etc. Always wonder why it is such an easy process. Like QQ, Tencent etc demos Ive seen past year
Xiaomi has been selling physical products to West through their websites for 10+ years now. I dont think you will encounter any issues here
“Just install it and use it” is great UX right up until the first botnet also discovers great UX.
Much more information in the blog post this links to: https://mimo.xiaomi.com/blog/mimo-code-long-horizon
Terrific link thanks for highlighting it
Claude and Codex pricing will eventually have to come down, for most common coding tasks you don't need a super smart slow model but a smart-enough and very fast one.
I’m not convinced they can come down, especially as they both are opening their books for S1 IPO filing.
cheap token for the win.
Microsoft github copilot recently changed their billing. i'm on the yearly subscription. GPT-5.4 is now 6x and even previously free model like GPT-5 mini now cost .33x. its only June 11 and my usage is now at 50%.
I don't think many understand that Sonnet and even Haiku can probably accomplish their task, instead of them invoking a beast like Opus to tell them about todays weather.
And yet, MiMo and DeepSeek, even MiniMax, are way cheaper and arguably better, or way better than both Sonnet and especially Haiku.
While you can argue you are ready to pay 100-1000 times the price for Fable or Opus because you need those last 1-2% of edge, there's no valid reason to keep paying the obscene amounts of money for Sonnet and Haiku when alternatives exist.
3 replies →
I don't known how Codex works, but we can set environment variables and point Claude CLI to deepseek. I think that before slashing prices they will slash those environment variables. After all they are not working to give a free TUI to deepseek and possibly to other competitors. But eventually yes, prices will go down or there will be an attempt at a regulatory capture.
Claude Code TUI is garbage. There's nothing worth protecting in there.
1 reply →
Most importantly, we need a model that doesn't randomly refuse us when we ask it to do something, or worse, deliberately sabotages us when it thinks we're building competing products. Like Anthropic's Fable.
[flagged]
I've worked a lot with MiMo in my project that pits LLMs against each other in games (clankerfights.ai). It is a very very good model for the price. MiniMax I'd say is smarter, but MiMo really touches near pareto frontier.
This is my favorite of the Chinese models I have tried. I think it would be hard to know if I was using Opus of MiMo if blindfolded in many instances.
Yes, but this has nothing to do with MiMo (the model).
This is what Claude Code is to Claude
I found it relevant and actually just the information I was looking for. Having a highly recommended model behind the tool makes it worth further investigation.
1 reply →
Uh, what model do you think this is using?
MiMo Code is not a model, it's a harness like Claude Code / OpenCode / Codex (which is still open source, Apache 2.0, btw).
You might mean the MiMo-V2.5-Pro model?
He didn't say MiMo Code
2 replies →
Sorry for confusion. I indeed meant the model itself.
Good timing, I was looking for alternatives earlier today. opencode didn't install properly and I wasn't a fan of oh-my-pi and nanocoder.
MiMo code (via my z.ai coding plan) is very pleasant so far, nice UI and seems to respond faster than Claude Code. It might be injecting much less cruft into the conversation.
I also got access to the mimo-2.5-pro ultraspeed model yesterday, which is really quite snappy. It does cost more than DeepSeek, though, so I'm not sure whether it's worth it yet. Definitely fast though.
Opencode didn't install properly? Its just "mise use -g opencode@latest"
It did install, but then would hang when trying to configure the provider (specifically I was trying diffusiongemma via Nvidia). I gave up after a couple of attempts and moved to the next one.
is it local compatible and does it have telemetry?
it does have telemetry, enabled by default, that sends metrics to tracking.miui.com, including what model you are using. it can be turned off by environment variable (MIMOCODE_ENABLE_ANALYSIS=false), and yes it still has all the normal OpenCode provider logic so it will work with other/local models. it also automatically looks for updates and fetches a mimo model list, including when the telemetry is off, though those can also be disabled.
telemetry enabled by default and named "analysis" is not great.
"MiMoCode is built as a fork of OpenCode."
Why not just contribute to OpenCode instead of creating a clone :/
Because they want to optimize it for their models and don't want to be blocked by waiting for PRs to merge or be rejected.
There's plenty of reasons to start your own fork that you have full agency of, as long as the OSS License is maintained anyone will be able to benefit from any new features they want to make use of.
This is the beauty of open source :) KHTML -> WebKit -> Blink is a good example.
6 replies →
Because its currently impossible to "contribute to Opencode".
There are over 500 pages of open issues, up from 78 less than a month ago. They are doing nothing to halt the garbage/duplicates that pop up, and not even addressing legitimate PRs/reports.
Opencode sits on a ton of important PR's, so they didn't want to wait. Everybody else switched to omp (oh my pi) already.
To go a different path perhaps? You can't expect that all your ideas will land into a main repo and you really want to implement your vision while using a sane base.
Could just be a courtesy - Americans tend to be rather suspicious and hostile to contributions coming from China, and it might draw unwarranted attention from agencies and bad media.
OpenCode can merge in all their changes if they want.
There's a blog link https://mimo.xiaomi.com/blog/mimo-code-long-horizon
I think there's simply too much changed.
> Why not just contribute to OpenCode instead of creating a clone :/
It's controlled by a different organization; in particular a startup in a "competing" space.
have you ever tried contributing a large number of changes to OSS?
Why not?
[flagged]
I don't think that's true? AFAIK OpenCode started as a TUI and their GUI app is Tauri-based, so don't think it was forked from OpenCode. You might be thinking of Cursor
Are you thinking of Cursor? OpenCode is a TUI like Codex.
Do you even know what you're talking about?
OpenCode started as an independent CLI project. Their desktop app is still in beta, and it was never a fork of VS Code.
I believe they contain no code derived from VS Code.
What does “shamelessly forked” mean? It’s literally software meant to be forked lol
There were once two harnesses named OpenCode, one written in Go & the other in TypeScript (the more popular one).
https://news.ycombinator.com/item?id=44741894
1 reply →
I like that they do their thing, but I'd be happier if there was a harness that could be tweaked, trivially, to do what their custom harness does. E.g. would this have been possible to do within Pi, as an extension? Then I could just "pi install mimo" and get the same results. I guess it would be more work on their part, to make sure it doesn't regress when something changes in pi itself.
I thought this was a wireless/MIMO radio project at first
yeah, was also expecting some disruption in the RF-design space.
Kinda RF-nerd clickbait... :)
Well Xiaomi is first and foremost a mobile phone company.
I also thought the same lol. It also happened with lora
Redditors are unhappy about their coding plan: https://www.reddit.com/r/opencodeCLI/comments/1t37dz3/xiaomi...
I guess the way to use their models is through another provider, like https://opencode.ai/go
My comment on same: https://news.ycombinator.com/item?id=48493358
Things have improved a lot since last month. Now their cheapest plan gives you 4.1B credits and I think the cache situation has improved too.
Yes the subscription plan costs literally the same thing as just paying API-per-token pricing.
> Unlimited Context
>Knowledge accumulates automatically with lossless compression, preserving every critical detail even across million-line projects.
As much as I absolutely love Mimo V2.5 Pro (it's a genuinely good model), I absolutely hate the way they calculate usage in their token plan.
For example: For a super small task in a small project that should not be consuming more than 500K total tokens after all tool calls included, their shown usage shot up to 152 million tokens.
But, when I scroll down on the same page, a table shows usage as 3 million tokens, out of which 2.5 million were cached.
This is such a huge conflict on the very same page. The bad thing is that the usage progress bar is shown against that 150 million token usage, not against that 3 million one.
This has been in discussions for at least past 3 months on reddit as well, and was precisely the reason I subscribed to their lowest tier, and for a single month only.
Update: their own harness, mimocode, shows total token usage as just 63.1K. We now have 3 entirely different values, differing in 3 orders of magnitude.
Update 2: So, I did the exact same task this time using DS4Pro, and total token usage was just 101K (as shown by opencode).
It's very confusing. They have tokens for their API and credits for their "token plan".
Even worse... they use both terms on the same page in dashboard.
"""
Credits 4,100,000,000 Credits
Total Token Consumption
"""
3 replies →
My coding agent VT Code has recently become a Xiaomi Orbit partner. If you want to try out Xiaomi Mimo V2.5 and V2.5 Pro in a different harness, feel free to use my VT Code. VT Code supports Mimo V2.5/Pro via official Xiaomi endpoints and via OpenRouter. Thank you!
[0] https://github.com/vinhnx/VTCode/blob/main/README.md#Provide...
I'm a fool for thinking that MiMo, in the context of Xiaomi who makes WiFi equipment (smartphones and routers), would be about network technology to manage parallel data streams (multiple inputs, multiple outputs https://en.wikipedia.org/wiki/MIMO)
Authorization for their own API doesn't work.. the web 'Authorize' page denies it, eventually goes through, but then you get stuck on 'Waiting for authorization' in the app. The web page says 'Paste this into MiMo Code' but there is nowhere to paste in.
Token plan works fine.
I'm pretty sure most of the comments in this thread are AI bots propping up this product?
I tried the free model and it's nowhere near Sonnet 4.6 in terms of capabilities. The fact that token speed will randomly get stuck at 0/s makes sense given it's a free service, but the way it performs is more reminiscent of AI from 2025.
> MiMo Code is a terminal-based coding agent built by Xiaomi's MiMo team on top of OpenCode and open-sourced under the MIT license.
I think it is great that they built it on top of open code. Open Code harness is good and I want it to grow. Harness is very important and more projects use it, the more it is adopted.
It's "just" an opencode fork but it adds some nice features to try out while not being a full orchestrator metapackage like oh-my-opencode. Quite nice! Though it would be even nicer if this stuff came upstream or as an easy extension instead in the future
Are AI people using LLMs to name things? Just take a widely popular thing that already means something, capitalise it differently, and you're done.
Microsoft's LoRA (already a thing called LoRa) and now MiMo (already a thing called MIMO)
Maybe a classic Google search is not so bad, eh?
I'm kind of surprised the demo UI is macOS. Are they mainly using Apple products to develop these things?
The more advanced devs all use apple laptops, sure.
All the Ai related companies use Macs.
Who isn’t?
I'm slapping debian on any crap hardware around, but that's just me with different ideological standards.
1 reply →
Only tangentially related: MiMo-2.5Pro is fast, cheap and very capable, although not quite gpt5.5 level iontelligence (I dont use the claudes). It works flawlessly in Pi and is an excellent workhorse. I expect big things from their next model.
Let the battle for the harnesses give free tokens for all, until the next competition arena does the same. It seems that's the only way AI will remain acessible.
This website is gorgeous, by the way. The mouse reveal on the background, amazing.
The installation method they officially propagate is dangerous. curl -fsSL https://mimo.xiaomi.com/install | bash
This is usually a PoC (Proof of concept) way to install something on a temporary container or temporary VM, but not for production use during daily desktop operation.
I was hoping their documentation would provide better installation instructions. But strangely, only for Windows do they recommend "npm install -g @mimo-ai/cli," which is a much better approach to managing installed packages.
For Mac/Linux, they have the strange recommendation to use the dangerous "curl <some_url> | bash." Quote:
> (for the best experience, Mac users are strongly encouraged to use iTerm or the VSCode Terminal) > curl -fsSL https://mimo.xiaomi.com/install | bash
:(
This is how everyone does it now. Including Anthropic.
To be fair, is that any different from naively trusting NPM? It's not like NPM is doing any vetting. They're every threat actors favorite sandbox these days.
https://code.claude.com/docs/en/quickstart
You're right that it's as dangerous as it's executing random third-party code on your machine, but the method also has propagated far beyond PoCs and such at this point. All of these projects and many others push that install method: Bun, Deno, rustup, k3s, Docker (if using their helper script), Homebrew, Tailscale...
Frankly, it's not really more insecure than any other installation method. Apt packages and the like generally have the ability to specify pre/post-install scripts, so `sudo dpkg -i ./random.deb` is equivalent to `sudo bash ./random.sh`. Even if they didn't have pre/post-install scripts, they're still writing arbitrary files to arbitrary locations on your disk, so they can trigger execution the next time you boot or log in or whatever.
And at the end of the day, no matter the installation method (even just unpacking a tarball and executing the program directly from that directory), you're going to run their program on your computer, and then the program can do whatever it wants. Maybe you don't run it with sudo, but https://xkcd.com/1200/ seems relevant.
2 replies →
Codex use this (for update).
> sh -c 'curl -fsSL https://chatgpt.com/codex/install.sh | CODEX_NON_INTERACTIVE=1 sh'
This is just sh, not bash, but I doubt it would be any better.
Thats exactly same as Claude Code offer: https://code.claude.com/docs/en/quickstart
We've had this discussion since Eazel Linux desktop popularized bash | curl in 2001.
> npm install ... is a much better approach to managing installed packages.
No. Until the upcoming version of npm is out, npm will also run arbitrary code. Almost all common installation tools run arbitrary code. Not doing that is sadly the exception for now.
I use npm 11.16.0 and it did this
npm warn allow-scripts Run `npm approve-scripts --allow-scripts-pending` to review, or `npm approve-scripts <pkg>` to allow.
Isn't executing arbitrary code kind of the entire point of NPM though? Any chance you have a link to something that describes their plans?
3 replies →
Unlimited context catches m :D I understand that it's not possible but it's worth to be checked to understand how you do your context compressions, etc.
Is that Open-Source like, run it locally, no phone home included, or open source like the thin front-end layer is all that is actually open-source but it’s an empty shell without the remote API it relies on?
They default it to talking to a free version of their model (which is incredibly cheap if you decide you like it.)
But it seems trivially easy to run it against local models. Their onboarding guide offers that option, though I have no idea if it changes any functionality.
The latter. It looks like it's meant to be a batteries-included agent to promote their free-for-a-limited time AI service that it connects to by default.
Ok, fair enough compared to the rest of the proeminent actors I guess, but quite confusing from dev point of view. Lately I started to experiment with model like Qwen2.5 on local. Good enough to ask simple question, but didn’t manage to do anything remotely close a agents I started to experiment with through Copilot.
2 replies →
It defaults to the API, but you can download the weights for the 1T-parameter Pro model and selfhost if you have the hardware for it.
It's free to use because you are the product they are selling.
Hm, can I just use free tokens without using MiMo-Code?
OpenCode or pi.dev are enough. I don't like CC-style agent lock-in, regardless if it's Anthropic or Xiaomi doing it.
sounds really cool if it was coming out of anywhere except China, which has laws to exfiltrate your data and send it back to the government for espionage purposes [0].
[0] https://www.fbi.gov/investigate/counterintelligence/the-chin...
Isn't Unlimited Context pretty difficult to promise? What exactly do they mean, could I just have two agents locked into a TTRPG back and forth forever?
Do you plan to ask them some master plan to live forever?
You can run the two agents in GAN like loop.. each trying to better the other. Give them a common task like design a better alternative to transformer model that uses max O(nlogn) memory, and the result comes closest to existing n^n implementations.
Good idea actually.. why haven't I tried this before.
That is an incredibly annoying grunge font. And what is the point of the hidden image in the background that reveals under your mouse cursor.
Open source and open weight AI is very important to protect freedom of speech. OpenAI, and ESPECIALLY Anthropic, will try to ban them through regulatory capture / safety fearmongering. We need to make sure that does not happen. It’s not society’s problem if these frontier labs have no moats.
It's interesting that it renders Chinese in a TUI. I wonder if that breaks anything that assumes a character is always a column wide.
Terminals and things that live in terminals have relied on wcwidth() to handle this since time immemorial (which is always fun when they are out of sync, e. g. remote over ssh, but in the vast majority of the cases it just works).
Can this be used as an alternative to Claude backend? For Ralph loops? Replacing `claude -p`? Anyone can shed a light on this?
No 'training data', not 'open source'.
MiMo Code is a coding harness, not the model.
I wonder what the minimum required memory specification is
This is super exciting, can't wait to try it out
macOS binary (mimocode-darwin-arm64.zip ) seems broken: "“mimo” is damaged and can’t be opened. You should move it to the Trash."
No, you are just experiencing the best of Apple. How dare you download non notarized binaries on your own computer? Do you have a license for that?
Terminal > sudo xattr -rd com.apple.quarantine > Drag and drop the app into terminal > enter and enter your password
Strange, I reckon I installed Tahoe just a while ago and still didn't have a similar issue, but I remember on previous MacOS versions the error message for unotarized binaries, was to warn that there was indeed a security issue with the binary, not that it was simply "damaged".
A bit crappy on Apple's side.
Thank you.
"damaged" by not paying 99 bucks.
1 reply →
the OS is broken, in this case.
Only worked for about 5m, then Too many requests.
mimocode gets it. This is actually, impressive! Chinese models are really up there with the rest.
Not bad at all
Yeah, by the way this is also opensource (do not run)
Looks an awful lot like OpenCode
> MiMoCode is built as a fork of OpenCode.
That’s why
Why is OpenCode awful?
He didn't say that.
https://www.merriam-webster.com/dictionary/an%20awful%20lot
Who said that?
looks great. surprised that Xiaomi has made such great advancements in AI
isn't this just opencode with a different logo?
Reading "xiaomi"in the headline I was thrilled to see MiMo only to find out it's NOT this one: https://en.wikipedia.org/wiki/MIMO What a disappointment!
Any english links?
You can change language from the top right-most dropdown, and select English
Top right corner
I got an invite to test their ultra fast model only to be geofenced when trying to use it. Pff!
Yeah that sucks. Now imagine the average Chinese developer who encounters this all the time!
You know they are benchmaxxing when they end up writing their coding harness in TypeScript npm slop
Their models can't help them build it with something better?
That's the only benchmark people need, whether or not their model can raise the bar of their own product
And so far it's looking pretty sad
Today on open source vampires: xaomi forks an open source project, doesn't contribute upstream, attaches usage restrictions that are probably incompatible with the license, and wants good PR. Fuck these people.
[flagged]
[flagged]
[dead]
[flagged]
[dead]
[dead]
[dead]
It was already open-source `https://github.com/anomalyco/opencode`