This all feels like a race where the model companies try to solve doing work locally in a way that doesn't suck, before the major operating systems companies figure out AI integration into their OS that doesn't suck. It also makes me wonder why Google which has both Gemini and Android can't figure this out, and if there are lessons to draw from that.
Google is historically terrible as a product company (and has succeeded in spite of that) As their technical innovations become less of a moat (we're already there) they won't be able to win on engineering alone (they are no longer winning on engineering alone)
As a person who gets paid to make Chrome (CEF really) do its bidding, I would say Chrome is really as close to an OS as it can get, as in I've found API or service typically an OS or an external tool would provide, that wasn't built into Chrome.
On a semi-related note, I bought a Pixel phone about a month ago, and I'm shocked by how unpolished it is. I've had so many little annoyances pop up, issues I never had on other android phones. Keyboard hiding/appearing when it's not suppose to, bluetooth dropping, WiFi dropping, network switching taking forever, screen becoming unresponsive... It's mostly all small things, but they really start to add up after a while.
It's looking like a slightly updated reskin of chromeOS with gemini features built in.
Definitely not a developer machine based on how they presented it in google IO. So if you write software, it's not looking like it'll be relevant whatsoever. I hope to be proven wrong.
Folks that are interested in a way of doing work locally that doesn't suck, but which integrates LLMs, may be interested in [Barnum](https://barnum-circus.github.io/). The TLDR is that it's a programming language whose frontend is a DSL in TypeScript that is well suited for managing async and parallel work, focused on control flow, from which it is easy to invoke LLMs, and which is easy for LLMs to write. I use it to autonomously ship a very large number of PRs.
The only lesson I'm taking away is that we are still very early in the AI era. AI workflows look entirely different today than they did 18 months ago and I wouldn't bet on them looking the same in 18 months from now.
I absolutely want AI in my OS. I just want it to be one I can trust to serve my interests, and not the company's. I'm literally in the middle of baking one in as I type this.
I mean a couple of websites will claim 1.6gb ram on my device, what would an LLM model cause across millions of devices, when nobody even wants to use it
The VM itself is for Claude Cowork which does all work within the VM sandbox. That doesn't help answer why they spin it up immediately and don't have a way to disable it though. Just the "why it exists" question.
If you're not going to give Claude access to anything on your machine, why are you using Desktop instead of web chat? (Real question, I don't use these much!)
At least in a corporate environment, Claude Desktop is a pretty decent compromise. Preconfigured internally deployed MCP servers and third-party connectors make many of the necessary integrations relatively easy to control.
I use Claude Code CLI myself (inside a VM, to isolate it from the host) for >90% of my needs. For the remaining fraction - email scours, cloud drive searches, other third-party connections - the desktop application is surprisingly decent. I don't even have more than half a dozen connectors enabled. In the VM I have separate, personally managed access tokens available for various third-party services. Wouldn't really try to maintain more than 5-6, otherwise it gets too confusing. [ß]
The desktop application mostly Just Works[tm] with SSO. At least when M365 doesn't suffer from their 4-times-a-day auth outage.
ß: A lot of APIs and authentication systems were designed in the stone age. You either need a 1:1 permissioned access token that can do horrendous damage, or you deal with ultra-granular, confusing and ill-designed scoping jungle where nothing makes sense. Atlassian, I'm looking at you especially. At least an MCP server, provisioned with a reasonably done service account, doesn't have all of your powers to get things wrong with.
I do use Claude Cowork and hence the VM is important, but I also leave the desktop app running all the time since I have many scheduled tasks at different times. The thing is that the VM could shutdown after being idle for some amount of time and then fire back up when you are ready to use it.
There's such a spectrum between "give it everything" and "give it nothing". Imagine you just want to use it to code and want to make sure any commands it runs doesn't mess up your actual machine.
Anthropic has pretty consistently been shitty about how they roll out their software. Extreme lack of engineering rigor and thoughtfulness.
The answer is probably as simple as "no one thought not to do that."
---
I know different people work on these things so I can't do more than guess about how engineering culture cuts across teams, but given the sheer amount of carelessness and sloppiness in Anthropic's software I have to imagine they're burning investor money in training and inference because the code to do it is as bad as the rest of their software.
I didn’t get a screenshot of this, but I just found a really pointed example of Anthropics lack of craft / rush to build. If you open Claude on Windows, and click Dispatch (under cowork) to start that up, it will tell you that you need permissions windows doesn’t have. When you click the buttons for those permissions, it has broken links to macOS system preferences. I really encourage someone to try it and post the images as a reply as I am writing this from my phone.
Rule 1 with making number go up is you eliminate friction at all costs. The user's hard drive is free to you, so there's no reason to gate a feature you want them to use based on that. 98% of them will have no idea you're foisting garbage on them.
RIP, every base model mac from the past 10 years with the <= 256GB SSD. Including the new Neo. When you consider how much of that is eaten up by the system, swap space, caches, reserved space to download OS updates, and apps (2GB a piece is far from uncommon) -- having less than 15GB free is completely unsurprising on that size disk.
Understand that it is annoying to spin up a heavyweight VM whenever running Claude Desktop, but I actually think per-app + agent virtualization is the future. Next version of an app is:
1. micro VM
2. agent on the VM
3. software bundled into the VM
Then the agent is totally sandboxed at the hardware virtualization level. It can use the software tools on the VM or write its own. VM can control which software is "frozen" and which is open to agent modification. And VM can also control which services are exposed outside the VM through sockets, HTTP server, X window system, whatever
It's self-modifying apps that are sealed off from touching parts of the computer they shouldn't.
Yep, I've been using a local vm-centric agent setup for about 3 months, and it works great. I think there is also value in the fact that with a local VM, you can have the same public IP address, so you're not relying on an EC2 EIP that may be blacklisted somewhere.
Back in the day, personalization / customization was all the rage, as it lets the user feel the control, power and freedom. Now it's the opposite. It's about not letting user to have any control at all. I can't delete some junk apps from my phone and mac, because they are "system" apps. As a non-geek, I can't deal with complexity of the browser and account settings to stop it from what is doing. We are at the mercy of the machines.
It hurts to have all this control stripped away. Once upon a time, you bought iLife (suite of iPhoto, iMovie, etc) on a CD or DVD and installed it. Today, you physically cannot delete the Photos app no matter what.
On my work computer, where I never manage any photos, have no iCloud account and never will, I have to keep this app installed and anytime I so much as AirDrop a png to my computer I am prompted to "Add to Photos" with it. No thank you.
The .app is actually only 41MB, so obviously they've moved the majority of it to some mystery-meat libraries or frameworks installed elsewhere anyway.
> Claude Desktop spins up a VM without no way of stopping it
I frequently make this error when I talk. My brain thinks of different ways to phrase what I want to say, but when I speak it starts with one and finishes with another. The result is almost always wrong in the way the title is, ie some variant of a double negation.
Sometimes it happens when I type, though I try to read it multiple times so often catch it.
When you realize that in some languages, for instance, in Spanish, double-negatives are not just tolerated, but correct, it helps you to let go of this particular type of pedantry when it accidentally appears in an English sentence.
This question is answered by the post? There is reportedly actually no way of stopping it happen. Perhaps the poster had a brain fart while typing it. Maybe they speak a different dialect of English from you.
I’ve stopped using Claude on the desktop, just because of how slow the app is to start up and interact with. It’s an absolute clunker; I’m mystified why they can’t ship something that works well given their rhetoric about ai.
They vibecoded it, and admitted as much. Once it was able to self-vibecode, that's all they did. That's why it's written in React and uses gigabytes of RAM as a chat client.
Not only did they decide to write a terminal application in React, but it's 500K lines of code. It's strange because I'm sure Claude is capable of writing a decent TUI in C. It says a lot about the engineering culture at Anthropic, at least on the software side.
I thought they were all in on agentic coding? They are probably just building at a surface level with only an eye towards shipping, without considering the impact of all the changes. I've seen less and less coordination between engineers as well under that model. If that's the case (Claude Code is this way). it is sort of what you get, no matter the rhetoric about "make sure to review all your changes!" It's always trade offs.
I'm with you. I have the Claude web app pinned as a PWA for quick queries, and then use the CLI for everything project-based.
I did consider experimenting with the Routines feature on the desktop app, but I'm leaning towards whipping together something with cron. I saw another poster here who has a daily PR summary routine that I think would be handy, as I have quite a few repos where I'm a sporadic contributor but would like to keep tabs.
It is surprising that the Claude web app lags pretty easily when using either chromium or firefox on ubuntu linux. Chats that delay my laptop work without issues on my ipad or iphone using the app.
The web app is definitely a bit of a problem. IF there is a native app on desktop or if claude cli is much faster, i haven't tried them.
All of the LLM web interfaces have serious lag when typing after a few turns, at least on iOS safari. I’m talking seconds to start rendering input after typing or when it needs to line wrap the input.
...and if a title is incorrect and says the opposite of what's intended, by way of a language misunderstanding or otherwise, it's helpful to note that and get it corrected.
Is it just me, but this feels like Claude gets to have a nigh-impenetrable black box right on your machine and you have no idea what is going on inside it.
After all, the last time I encountered Hyper-V it was in the context of copy protection that prevented crackers from observing or interfering with video game protection
There are lots of good answers in this thread but I think it's because they are AI companies and not UI companies. When you look at tools like AnythingLLM, OpenCode, pi, etc. you see all kinds of different interfaces, and while they might make disagreeable choices at least they do it with intentionality and direction.
They're some of the only new UIs to be made in the last decade. Almost everyone else stays in the browser (or something close like electron- claude code is actually mostly written in React, they couldn't get far from web dev). The problem is they need to interact with the local filesystem, and not many people have built apps for such a wide range of devices in a long time, and of that small talent pool I bet most are corpo coders- moving too slow and to focused on "the right way" to actually ship more than detailed Jira tickets. They also don't have time for stable releases because competition is so fierce.
But I almost always think of things from a talent-pool-first perspective. Perhaps there are actual technical issues like what Boris was referring to.
Luckily for them, every OS has (at least one) native way of building applications, and with the power of AI they could easily make 3 different desktop UIs, while reusing the same core logic.
If only there was an easy high-level language that's taught to first year students that allows them to write once, run anywhere.
If they're too lazy to learn java, haxe has hxwidgets[0]. Haxe is pretty damn close to js. If a dev can't handle that, they should turn in their keyboard and get a job that doesn't require a brain, like being a senator or federal judge.
Many people will say it’s because of the slop. I think it’s because they have no product vision. The roadmap is pretty much a random walk, which combined with the velocity of agentic coding is like digging a moat with atomic bombs.
They are moving at breakneck speed deploying on scales most of us can't even imagine. They are working in a space that's completely unexplored where getting information as quickly as possible is preferred above iterating on some feature until it's "done" while your competitor has released fifteen other features, all sucky, but one of which turns out to be a killer and makes a billion bucks overnight.
The weird thing is that this is probably a performance optimization for quick responses when a user asks a question.
My agent harness spins up a VM too, but it spins up on demand, cools down in 10 minutes and warms up when I focus back on the app.
The files it works on actually lives in a mount.
People take more time to type a prompt than the VM takes to spin up on a fast machine and on a slow machine, the cooldown naturally frees RAM back to the machine.
I also discovered this while
noticing my Mac was low on storage, I only clicked on cowork once and after deleting it from the folder i’m scared to
open the cowork tab coz ik it’ll just fill up the space
Apart from you have no idea what's going on in the VM. It's not as it has a virtual terminal. I'll play the skeptic archetype: What's not to say they're transmitting all prompts back HQ?
Don't be naive and don't think they don't already do this.
Why not ask itself and see what it says about it. "Claude, why are you running in a virtual machine and what are you doing?".
Claude transmits all prompts back to HQ as a part of its basic functionality.
If you are using an AI system to read your codebase from your local folder and make changes, whether or not you have a VM running or not is inconsequential. The Claude extension and/or CLI doesn’t need a VM to send code back to the mothership, you’re already running an executable program and granting it directory access.
Whether you trust a company as a vendor is typically based on their privacy policy, EULA, and your contract with them (if applicable). Those are the bits that have legal enforceability.
I have two friends that are using coding agents on Windows, which was surprising to learn.
Edit: yes, with WSL2 I believe in both cases.
I would have assumed almost everyone would get a Mac/Linux computer to use coding agents because Unix is their "native" platform. It's Bash tool calls all the way down.
Does anyone know a source for reliable data on what coding agent apps devs are using? How many are using Code Claude CLI vs Claude Desktop, etc?
I mean I’m using coding agents on windows, because I’m not just going to learn a whole new operating system just to make robots write code for me.
I want tools that meet me where I’m at, not tools that demand I change up my entire UX to interact with them.
The assumption is not “what’s wrong with Windows that it doesn’t work with <technology>,” more “what’s wrong with <technology> that it doesn’t work with Windows”
Why wouldn’t you want your thing to be cross platform
and on my Mac any time I accidentally click Cowork which I don't use whatsoever, it re-makes the same VM, without asking me. It's one of the dumbest things ever. You're about to hijack nearly 20GB of my storage (which gets eaten up as it is) and you don't think to ask me if I even want the VM before you shove one into my system?
This all feels like a race where the model companies try to solve doing work locally in a way that doesn't suck, before the major operating systems companies figure out AI integration into their OS that doesn't suck. It also makes me wonder why Google which has both Gemini and Android can't figure this out, and if there are lessons to draw from that.
Google is historically terrible as a product company (and has succeeded in spite of that) As their technical innovations become less of a moat (we're already there) they won't be able to win on engineering alone (they are no longer winning on engineering alone)
As a person who gets paid to make Chrome (CEF really) do its bidding, I would say Chrome is really as close to an OS as it can get, as in I've found API or service typically an OS or an external tool would provide, that wasn't built into Chrome.
On a semi-related note, I bought a Pixel phone about a month ago, and I'm shocked by how unpolished it is. I've had so many little annoyances pop up, issues I never had on other android phones. Keyboard hiding/appearing when it's not suppose to, bluetooth dropping, WiFi dropping, network switching taking forever, screen becoming unresponsive... It's mostly all small things, but they really start to add up after a while.
How are Google products anything but outstanding in their categories? What are you comparing to?
51 replies →
They are releasing AluminumOS with their Googlebooks, which is a AI forward OS. If its good or not we have yet to see.
It's looking like a slightly updated reskin of chromeOS with gemini features built in.
Definitely not a developer machine based on how they presented it in google IO. So if you write software, it's not looking like it'll be relevant whatsoever. I hope to be proven wrong.
9 replies →
"What do you mean, an Aluminum Falcon?!"
1 reply →
Google is probably already doing and releasing the most actual research into this (like the work that went into Gemma 4)
Microsoft has Copilot and Windows. Look what happened.
Folks that are interested in a way of doing work locally that doesn't suck, but which integrates LLMs, may be interested in [Barnum](https://barnum-circus.github.io/). The TLDR is that it's a programming language whose frontend is a DSL in TypeScript that is well suited for managing async and parallel work, focused on control flow, from which it is easy to invoke LLMs, and which is easy for LLMs to write. I use it to autonomously ship a very large number of PRs.
The only lesson I'm taking away is that we are still very early in the AI era. AI workflows look entirely different today than they did 18 months ago and I wouldn't bet on them looking the same in 18 months from now.
> why Google which has both Gemini and Android can't figure this out,
Not the first time an incumbent has four aces in hand and appears to be entirely unable to make anything of it.
> and if there are lessons to draw from that
Lesson 1: doing shit is hard
Lesson 2: money rules so milking the cow wins over taking the slightest risk
Don't you read HN? Nobody wants AI in their OS, especially in Windows. Common complaint that Microsoft is forcing AI into every corner of Windows.
I absolutely want AI in my OS. I just want it to be one I can trust to serve my interests, and not the company's. I'm literally in the middle of baking one in as I type this.
2 replies →
I mean a couple of websites will claim 1.6gb ram on my device, what would an LLM model cause across millions of devices, when nobody even wants to use it
The VM itself is for Claude Cowork which does all work within the VM sandbox. That doesn't help answer why they spin it up immediately and don't have a way to disable it though. Just the "why it exists" question.
If you're not going to give Claude access to anything on your machine, why are you using Desktop instead of web chat? (Real question, I don't use these much!)
If you are, obviously you need the VM.
At least in a corporate environment, Claude Desktop is a pretty decent compromise. Preconfigured internally deployed MCP servers and third-party connectors make many of the necessary integrations relatively easy to control.
I use Claude Code CLI myself (inside a VM, to isolate it from the host) for >90% of my needs. For the remaining fraction - email scours, cloud drive searches, other third-party connections - the desktop application is surprisingly decent. I don't even have more than half a dozen connectors enabled. In the VM I have separate, personally managed access tokens available for various third-party services. Wouldn't really try to maintain more than 5-6, otherwise it gets too confusing. [ß]
The desktop application mostly Just Works[tm] with SSO. At least when M365 doesn't suffer from their 4-times-a-day auth outage.
ß: A lot of APIs and authentication systems were designed in the stone age. You either need a 1:1 permissioned access token that can do horrendous damage, or you deal with ultra-granular, confusing and ill-designed scoping jungle where nothing makes sense. Atlassian, I'm looking at you especially. At least an MCP server, provisioned with a reasonably done service account, doesn't have all of your powers to get things wrong with.
1 reply →
I do use Claude Cowork and hence the VM is important, but I also leave the desktop app running all the time since I have many scheduled tasks at different times. The thing is that the VM could shutdown after being idle for some amount of time and then fire back up when you are ready to use it.
There's such a spectrum between "give it everything" and "give it nothing". Imagine you just want to use it to code and want to make sure any commands it runs doesn't mess up your actual machine.
It mounts specified directories into the vm from what I remember
Probably because they vibecoded it
Correct and they have no idea what they are doing.
1 reply →
Anthropic has pretty consistently been shitty about how they roll out their software. Extreme lack of engineering rigor and thoughtfulness.
The answer is probably as simple as "no one thought not to do that."
---
I know different people work on these things so I can't do more than guess about how engineering culture cuts across teams, but given the sheer amount of carelessness and sloppiness in Anthropic's software I have to imagine they're burning investor money in training and inference because the code to do it is as bad as the rest of their software.
It kind of does though. If you want to use the product they'll need the sandbox ready.
I didn’t get a screenshot of this, but I just found a really pointed example of Anthropics lack of craft / rush to build. If you open Claude on Windows, and click Dispatch (under cowork) to start that up, it will tell you that you need permissions windows doesn’t have. When you click the buttons for those permissions, it has broken links to macOS system preferences. I really encourage someone to try it and post the images as a reply as I am writing this from my phone.
I don't know where's a good place to post a screenshot, but can confirm I get a dialog saying:
> Get an app to open this 'x-apple.systempreferences' link
> Your PC doesn't have an app that can open this link. Try looking for a compatible app in the Microsoft Store.
I won't understand why Cowork isn't simply opt-in. It also installs a ~10GB vm bundle which you cannot remove
https://www.reddit.com/r/ClaudeAI/comments/1rlc71n/claude_de...
Rule 1 with making number go up is you eliminate friction at all costs. The user's hard drive is free to you, so there's no reason to gate a feature you want them to use based on that. 98% of them will have no idea you're foisting garbage on them.
RIP, every base model mac from the past 10 years with the <= 256GB SSD. Including the new Neo. When you consider how much of that is eaten up by the system, swap space, caches, reserved space to download OS updates, and apps (2GB a piece is far from uncommon) -- having less than 15GB free is completely unsurprising on that size disk.
It was on my machine at least, I remember I had to do an additional install to activate that tab...
Understand that it is annoying to spin up a heavyweight VM whenever running Claude Desktop, but I actually think per-app + agent virtualization is the future. Next version of an app is:
Then the agent is totally sandboxed at the hardware virtualization level. It can use the software tools on the VM or write its own. VM can control which software is "frozen" and which is open to agent modification. And VM can also control which services are exposed outside the VM through sockets, HTTP server, X window system, whatever
It's self-modifying apps that are sealed off from touching parts of the computer they shouldn't.
Yep, I've been using a local vm-centric agent setup for about 3 months, and it works great. I think there is also value in the fact that with a local VM, you can have the same public IP address, so you're not relying on an EC2 EIP that may be blacklisted somewhere.
Back in the day, personalization / customization was all the rage, as it lets the user feel the control, power and freedom. Now it's the opposite. It's about not letting user to have any control at all. I can't delete some junk apps from my phone and mac, because they are "system" apps. As a non-geek, I can't deal with complexity of the browser and account settings to stop it from what is doing. We are at the mercy of the machines.
It hurts to have all this control stripped away. Once upon a time, you bought iLife (suite of iPhoto, iMovie, etc) on a CD or DVD and installed it. Today, you physically cannot delete the Photos app no matter what.
On my work computer, where I never manage any photos, have no iCloud account and never will, I have to keep this app installed and anytime I so much as AirDrop a png to my computer I am prompted to "Add to Photos" with it. No thank you.
The .app is actually only 41MB, so obviously they've moved the majority of it to some mystery-meat libraries or frameworks installed elsewhere anyway.
Isn’t it good that it spins up without no way of stopping it? Why would it be a problem that we do have a way of stopping it?
> Claude Desktop spins up a VM without no way of stopping it
I frequently make this error when I talk. My brain thinks of different ways to phrase what I want to say, but when I speak it starts with one and finishes with another. The result is almost always wrong in the way the title is, ie some variant of a double negation.
Sometimes it happens when I type, though I try to read it multiple times so often catch it.
When you realize that in some languages, for instance, in Spanish, double-negatives are not just tolerated, but correct, it helps you to let go of this particular type of pedantry when it accidentally appears in an English sentence.
All your RAM are belong to us
Op is nitpicking on the poorly written title. I came here to find that comment :)
This question is answered by the post? There is reportedly actually no way of stopping it happen. Perhaps the poster had a brain fart while typing it. Maybe they speak a different dialect of English from you.
There's no dialect of English in which this is correct.
3 replies →
[dead]
I agree. Why is this a problem?
Vibecoded with AGI, production ready.
Classic Anthropic, this comes across as LLM coded nonsense.
I think the title should be changed. Either with no way of stopping it, or without any way of stopping it.
[dead]
I’ve stopped using Claude on the desktop, just because of how slow the app is to start up and interact with. It’s an absolute clunker; I’m mystified why they can’t ship something that works well given their rhetoric about ai.
They vibecoded it, and admitted as much. Once it was able to self-vibecode, that's all they did. That's why it's written in React and uses gigabytes of RAM as a chat client.
Not only did they decide to write a terminal application in React, but it's 500K lines of code. It's strange because I'm sure Claude is capable of writing a decent TUI in C. It says a lot about the engineering culture at Anthropic, at least on the software side.
4 replies →
You say that as if somehow the trend for cross platform desktop apps to be ridiculously bloated bundles of browser overlays is new?
What major cross platform app isn't based on Electron or Tauri? Slack, Discord, VS Code, Teams, Notion...
5 replies →
I thought they were all in on agentic coding? They are probably just building at a surface level with only an eye towards shipping, without considering the impact of all the changes. I've seen less and less coordination between engineers as well under that model. If that's the case (Claude Code is this way). it is sort of what you get, no matter the rhetoric about "make sure to review all your changes!" It's always trade offs.
I uninstalled it because I have no need for Claude Desktop and there’s no way to keep the 10+ GiB VM image off of my machine
I'm with you. I have the Claude web app pinned as a PWA for quick queries, and then use the CLI for everything project-based.
I did consider experimenting with the Routines feature on the desktop app, but I'm leaning towards whipping together something with cron. I saw another poster here who has a daily PR summary routine that I think would be handy, as I have quite a few repos where I'm a sporadic contributor but would like to keep tabs.
It is surprising that the Claude web app lags pretty easily when using either chromium or firefox on ubuntu linux. Chats that delay my laptop work without issues on my ipad or iphone using the app.
The web app is definitely a bit of a problem. IF there is a native app on desktop or if claude cli is much faster, i haven't tried them.
All of the LLM web interfaces have serious lag when typing after a few turns, at least on iOS safari. I’m talking seconds to start rendering input after typing or when it needs to line wrap the input.
Guaranteed nobody is reading the code being merged in. It's vibes all the way down.
Are you not entertained? Is this not why you are here?
"without no way" of stopping it?
Incredibly insightful. Not everyone speaks English as a first language. On top of that, the title is not ambiguous.
...and if a title is incorrect and says the opposite of what's intended, by way of a language misunderstanding or otherwise, it's helpful to note that and get it corrected.
You're right, it's not ambiguous. It's literally incorrect.
lol :D fair enough
Sorry if that came across as snarky!
Is it just me, but this feels like Claude gets to have a nigh-impenetrable black box right on your machine and you have no idea what is going on inside it.
After all, the last time I encountered Hyper-V it was in the context of copy protection that prevented crackers from observing or interfering with video game protection
Why are the UIs of the AI companies all broken in multiple ways?
There are lots of good answers in this thread but I think it's because they are AI companies and not UI companies. When you look at tools like AnythingLLM, OpenCode, pi, etc. you see all kinds of different interfaces, and while they might make disagreeable choices at least they do it with intentionality and direction.
They're some of the only new UIs to be made in the last decade. Almost everyone else stays in the browser (or something close like electron- claude code is actually mostly written in React, they couldn't get far from web dev). The problem is they need to interact with the local filesystem, and not many people have built apps for such a wide range of devices in a long time, and of that small talent pool I bet most are corpo coders- moving too slow and to focused on "the right way" to actually ship more than detailed Jira tickets. They also don't have time for stable releases because competition is so fierce.
But I almost always think of things from a talent-pool-first perspective. Perhaps there are actual technical issues like what Boris was referring to.
Luckily for them, every OS has (at least one) native way of building applications, and with the power of AI they could easily make 3 different desktop UIs, while reusing the same core logic.
If only there was an easy high-level language that's taught to first year students that allows them to write once, run anywhere.
If they're too lazy to learn java, haxe has hxwidgets[0]. Haxe is pretty damn close to js. If a dev can't handle that, they should turn in their keyboard and get a job that doesn't require a brain, like being a senator or federal judge.
[0] https://haxeui.org/getting-started/haxeui-hxwidgets/
They are dogfooding their products like you wouldnt believe
They are releasing at breakneck pace, it's pretty funny how vibed their products feel sometimes
Many people will say it’s because of the slop. I think it’s because they have no product vision. The roadmap is pretty much a random walk, which combined with the velocity of agentic coding is like digging a moat with atomic bombs.
I find this analogy particularly humorous, as atomic bombs do not make for good excavators
1 reply →
> Why are the UIs of the AI companies all broken in multiple ways?
Because they're vibe-coded ultra sloppy code. And it really shows.
Dogfood
No one left who could fix anything here by hand. Being able to handcraft compelling desktop apps and their plumbing is not a marketable skill anymore.
Mythos, Fable, please do the thing with the VM. Make no mistakes.
They are moving at breakneck speed deploying on scales most of us can't even imagine. They are working in a space that's completely unexplored where getting information as quickly as possible is preferred above iterating on some feature until it's "done" while your competitor has released fifteen other features, all sucky, but one of which turns out to be a killer and makes a billion bucks overnight.
Whatever you say, account created 9 minutes ago with 1 comment posted 6 minutes ago praising AI companies.
1 reply →
I had my admin disable cowork a month ago and that worked. Has it resumed since then?
Is Claude self-replicating in an attempt for world domination?
lol, why even use Claude desktop? I want Claude code to stop eating up 10s of gb of virtual memory
The weird thing is that this is probably a performance optimization for quick responses when a user asks a question.
My agent harness spins up a VM too, but it spins up on demand, cools down in 10 minutes and warms up when I focus back on the app.
The files it works on actually lives in a mount.
People take more time to type a prompt than the VM takes to spin up on a fast machine and on a slow machine, the cooldown naturally frees RAM back to the machine.
How come Claude Code still hasn't triaged and fixed this? Feed it the bug link, someone.
I also discovered this while noticing my Mac was low on storage, I only clicked on cowork once and after deleting it from the folder i’m scared to open the cowork tab coz ik it’ll just fill up the space
As long as the VM closes when the application closes, I don’t see too much of an issue with this design decision.
It seems like the VM is a core part of how you use the application.
Apart from you have no idea what's going on in the VM. It's not as it has a virtual terminal. I'll play the skeptic archetype: What's not to say they're transmitting all prompts back HQ?
Don't be naive and don't think they don't already do this.
Why not ask itself and see what it says about it. "Claude, why are you running in a virtual machine and what are you doing?".
/shrug
Claude transmits all prompts back to HQ as a part of its basic functionality.
If you are using an AI system to read your codebase from your local folder and make changes, whether or not you have a VM running or not is inconsequential. The Claude extension and/or CLI doesn’t need a VM to send code back to the mothership, you’re already running an executable program and granting it directory access.
Whether you trust a company as a vendor is typically based on their privacy policy, EULA, and your contract with them (if applicable). Those are the bits that have legal enforceability.
Safari > Add to Dock > done
They must not have used Fable 5 to vibecode that part of Claude Desktop, VMs are strictly forbidden high stakes cybersecurity work.
It's becoming self-aware! Quick, lock down the nuclear codes!
I have two friends that are using coding agents on Windows, which was surprising to learn.
Edit: yes, with WSL2 I believe in both cases.
I would have assumed almost everyone would get a Mac/Linux computer to use coding agents because Unix is their "native" platform. It's Bash tool calls all the way down.
Does anyone know a source for reliable data on what coding agent apps devs are using? How many are using Code Claude CLI vs Claude Desktop, etc?
Wsl2.0 is literally a Linux vm built into windows. I imagine some people are using that.
Are you sure they're not using WSL2 (which is Linux, not Windows)?
Yes, sure, they're using Linux within a virtual machine (WSL2).
I mean I’m using coding agents on windows, because I’m not just going to learn a whole new operating system just to make robots write code for me.
I want tools that meet me where I’m at, not tools that demand I change up my entire UX to interact with them.
The assumption is not “what’s wrong with Windows that it doesn’t work with <technology>,” more “what’s wrong with <technology> that it doesn’t work with Windows”
Why wouldn’t you want your thing to be cross platform
> without no way
Not no way not no how!
I've stopped using cc a while ago, because it always comes up with new surprises like that.
Please edit the title.
Currently "Claude Desktop spins up a VM without no way of stopping it"
Should be "Claude Desktop spins up a VM with no way of stopping it"
There are some English dialects with negative concord, meaning that to form the negation of a sentence, you negate all negateable words in it
Yeah, no.
the vm makes sense for cowork but no off switch is weird. a visible sandbox on/off toggle would do more for trust than any safety blogpost imo
i had to uninstall it due to the vm taking around 12G of disk, never touched Cowork. didn't realize they were also launching it
People trust skynet.
People are very foolish. The younger generation needs to watch the Terminator franchise - it is all explained there.
with no way or without no way?
and on my Mac any time I accidentally click Cowork which I don't use whatsoever, it re-makes the same VM, without asking me. It's one of the dumbest things ever. You're about to hijack nearly 20GB of my storage (which gets eaten up as it is) and you don't think to ask me if I even want the VM before you shove one into my system?
When was the last time Claude's C Compiler was updated? 4 months ago? [0]
It is written in Rust™, surely it is better than the rest of them.
[0] https://github.com/anthropics/claudes-c-compiler
it took up 12gb on mine
13 GIGS! Between that and the absorbent space MACOS sucks up, it's challenging.
kill -9
[flagged]
[dead]
[dead]
[flagged]
So a company which has access to practically unlimited tokens and their best models makes crappy software. Huh who would've thought?
/s