Comment by seizethecheese

1 day ago

I'm somewhat surprised that this is not open source (from what I can tell). Compare to Mimo Code https://github.com/XiaomiMiMo/MiMo-Code (which is a CLI, while this is a desktop app).

82 comments

seizethecheese

SwellJoe 1 day ago

I don't even know what I would do with a desktop app. I'm running these things in headless VMs, so I can run them with `--dangerously-skip-permissions` or whatever. I don't trust them, even without that flag, on my desktop/laptop.

teaspoon 1 day ago
Good desktop apps in this category can manage agents across any number of remote SSH hosts.
- SwellJoe 1 day ago
  
  But, it's still running on my desktop/laptop. I don't trust them to run on my machine. But, I guess I could run one VM with a desktop to contain the desktop app. Or, just keep using CLI agents.
  
  25 replies →
- mattnewton 19 hours ago
  
  But then I close my laptop and it’s not running on the headless host anymore right
  
  3 replies →
- htrp 18 hours ago
  
  Examples here?
- TimXare 17 hours ago
  
  [dead]
- nutjob2 1 day ago
  
  What's stopping a CLI from doing the same?
  I've never used IDEs and never will, why are these things being constantly shoved down our throats?
nicoty 13 hours ago
I've contributed to https://github.com/0xferrous/agent-box which allows you to bind-mount git repositories into containers that agents operate in, preventing the agents from accessing files that aren't bind-mounted. Your usual .gitignore can then be used to also ignore files within the repo to be bind-mounted, which prevents agents from accessing them at all, essentially working as a sandbox.
I also maintain https://github.com/nothingnesses/agent-images which allows you to use Nix to reproducibly spin up OCI container images containing agents and any other tools you need for development and use these with agent-box.
I use both at the moment to work on some personal projects with agents, where I set up multiple separate git worktrees for the agents to work in, preventing them from accessing anything outside of the worktrees and from trampling over each other's work.
- raphinou 13 hours ago
  
  In case anyone is interested, I'm also using bash scripts to run my agents in containers. It's simple, but has only bash and docker as dependency: https://github.com/asfaload/agents_container
ahmadyan 21 hours ago

a well-design IDE should abstract that away, i.e. run the agent in the headless VMs while give you an abstraction that you would feel like you are running the agent locally with all the benefits (editor, browser, diffs, debugger, etc)
knocte 14 hours ago

I shared your fear some weeks/months ago so I was always using my harness in the cloud. However, latency started to become an issue when I traveled to other countries where I needed a VPN... so I ended up cooking skynot to be able to trust running my harness in my own computer: https://github.com/tarsgate/skynot (PRs welcome if you want to add support for another harness different than Pi)
Scrounger 12 hours ago

> I'm running these things in headless VMs
What's your setup like and what do you use it for?
I have a M2 Max MBP with plenty of ram and I use VSCode + Zoo Code plugin with Qwen3-Coder-Next-GGUF:UD-Q4_K_XL to run local agentic coding sessions, but I'm intrigued by being able to run headless as I could probably run multiple instances in parallel to do stuff?
Like are you using UTM with some pre-built VM and a local LLM?
Curious.
LuD1161 13 hours ago
Might wanna check out https://github.com/LuD1161/agentjail - policy guardrails for coding agents.
shameless self-plug. I've been dogfooding it for the last 3 weeks now.
- jpeeler 7 hours ago
  
  Looks similar to https://github.com/nolabs-ai/nono. Maybe one day you can fill out https://github.com/LuD1161/agentjail/issues/10 with a comparison to that project too.
InsideOutSanta 1 day ago

Zcode allows you to connect to a Docker container, or to a VM using ssh.
FergusArgyll 1 day ago

I finally repurposed an old server just for that and for anyone reading who has not had a chance to use --dangerously-etc. it's awesome, do it :)
aussieguy1234 18 hours ago

I just back up my entire home folder to another device, then let it rip

dizhn 1 day ago

It's only a cli because they yanked out the opencode desktop code. (As well as the opencode go/zen model provider)

Edit: my theory is they wanted to mimic being the primary provider in a quick way with a lot of string replace. Though they could have added opencode back as a regular provider.

versteegen 20 hours ago
MiMo Code adds a lot of cool orchestration features to OpenCode! It definitely is NOT a quick find-replace job, it's genuinely someone's research project to create a better agent harness building on top of free software, and that's awesome. See https://mimo.xiaomi.com/blog/mimo-code-long-horizon
- dizhn 10 hours ago
  
  They did remove the opencode provider though and the desktop and web interfaces. I was trying to be charitable.
  By the way, their repo was a bit weird with no changelogs at all. It seems to be picking up speed now with their communication. I actually read in the changelog just now that their Compose (plan/executre/review etc. something like that) flow is now deterministic with software instead of just prompts. That could be really good.

cco 1 day ago

You're surprised? I think harnesses are almost as important as the underlying model. Folks have been able to improve benchmark results by nearly 2x based on harness alone.

Harnesses are quickly becoming critical components of the "model" itself imo. Not shocking to me at all that a company that spots a revenue opportunity is keeping its harness closed source.

MrDrMcCoy 17 hours ago
I'm a neophyte. What makes a harness special or all that unique from another? I've had a reasonable experience with Zed and local models, but could be persuaded to put something else in the mix if there is a measurable benefit to be had.
- tl 17 hours ago
  
  Simple example: a while back LLMs would trip over questions like "how many Rs are in strawberry". Now, the system prompts have a line like "when a user asks for a count, actually count the value by calling a tool if needed". The LLMs cannot get smarter in this regard, next token predictors will hallucinate here.
  A harness is that covering every blind spot or sub-optimal but probable output people have hit in the wild, and a lot of problems just have better solutions if you say "break problem A into subproblem B and subproblem C, then solve".
bermudi 18 hours ago
Source? The most trusted benchmark right now (deepSWE) scores better or just as well on their minimal harness than when using CC or codex
- theshrike79 1 hour ago
  
  deepSWE clearly doesn't need complex tool calling?

LaurensBER 1 day ago

They might be sending some user requests to Anthropic to gather trading data for their own models. If they do so, perhaps they need to add some tracer to request that they prefer to hide.

bermudi 18 hours ago
I wonder if you're as cynical and untrustworthy of American companies as well or is it more of a racism kinda thing
- MrDrMcCoy 17 hours ago
  
  Everyone should distrust them equally. Only local agents in a detached network namespace are safe from data leaks. It is perfectly reasonable to assume they are using our sessions to train on, since everything else short of nuclear launch codes is already there, and they need to keep feeding it.
- LaurensBER 13 hours ago
  
  This is an extremely weird comment that doesn't add anything to the conversation.
  Here on HN we discuss facts, jumping straight into racism has no place here.
fwip 1 day ago
Wireshark would catch that easy-peasy.
- benatkin 21 hours ago
  
  The request would need to be done from their service, so as not to expose the API key, and because it just makes sense. They could probably directly proxy it and Wireshark couldn't catch it, due to everything being HTTPS. But people could probably catch it by decompiling, so it would make more sense to have the server make the request as part of a GLM request. Not that I think this is plausible - I'm not sure.
bogdan 1 day ago
Source? Or is it "trust me bro"?
- DonsDiscountGas 1 day ago
  
  "might" means pure speculation
- embedding-shape 1 day ago
  
  Literally just FUD unless someone has code to point at.
  
  2 replies →
jijji 19 hours ago
or more likely, sending it to the CCP
- neonstatic 19 hours ago
  
  Californian Communist Party?
  
  1 reply →

saghm 1 day ago

Given that there's such severe concern being expressed by Anthropic about Claude being distilled, and the idea that the harness is part of the the moat, it doesn't seem super surprising that the other side of that would try to also make it harder for them to tell how well they're doing and what their approach is.

JSR_FDED 21 hours ago

Unlikely considering they’re publishing the Crown Jewels (GLM 5.2) as open weights.
lelanthran 13 hours ago
> and the idea that the harness is part of the the moat,
That idea is wrong, though. These same people thinking harnesses are part of a moat are also boasting that s/ware is easily writable now.
There's no secret sauce in a harness that you can't vibe-code into your own harness.
- saghm 5 hours ago
  
  Why don't the major players open source their harnesses then? As far as I'm aware, the only time the source code for the Claude harness became available, it was due to a mistake (which is it's own whole thing).
  I'm not saying you're wrong necessarily, but I do think that when the actions and words of a company conflict, it's a pretty safe bet that the words are just posturing and the actions better reflect their actual belief. In this case, regardless of what they're saying about software being easily writable now, they clearly seem to at least think there's something valuable in the harness if they're not open sourcing it.
- jorisw 11 hours ago
  
  > vibe-code into your own
  Except you'd need the knowledge of what to vibe-code, no?
  
  1 reply →

maxloh 1 day ago

I don't find a closed-source Chinese agent system trustworthy.

It is essentially a black box with full user permissions, meaning you are just handing over your entire system to a Chinese-owned server. With OpenCode and its GLM provider, at least I can monitor which files were read, which were edited, and what commands were executed.

Not to mention that Chinese national security laws legally obligate companies to cooperate with state intelligence and counter-espionage efforts [0]. If you have this installed on a corporate workstation, and your company is large enough, the possibility of them spying on you is not just a risk—it's almost a certainty.

[0]: https://en.wikipedia.org/wiki/National_Intelligence_Law_of_t...

sabedevops 1 day ago

You shouldn’t find American ones trustworthy either.

_pdp_ 1 day ago

I am not surprised it is not open source. These harnesses are hard to build - they are not just wrappers - and often they contain business logic that is not suitable for public distribution for all kinds of reasons.

NamlchakKhandro 21 hours ago
hard? wut lol....
no. they. are. not.
Some people are just terrible at it.
- _pdp_ 19 hours ago
  
  I was thinking the same and I changed my mind.
  Also you don't need to believe me. There is enough evidence in the open source space.

addozhang 15 hours ago

I'd prefer a CLI over a desktop. But then why don't I just use OpenCode?

anderber 20 hours ago

That looks to be a copy of OpenCode

russelg 20 hours ago

A fork, yes.