Comment by jillesvangurp
7 days ago
The meta issue here with AI companies is that they while they excel at producing LLMs, they don't have any inherent advantages when eating their own dog food and it shows in the quality and UX of their products.
Both Anthropic and OpenAI have been maintaining a high pace of releasing often poorly thought through new products and experimenting with features. A lot of their product releases show all the hallmarks of vibe coding: randomly breaking features, poor QA and testing on releases, etc.
OpenAI seems to have the upper hand in UX currently. Their products feel a bit more polished and they've clearly tried to up their game. Taking over Jony Ive's company a few months ago is a clear signal that they want to do better. The Codex AI desktop app was a clear step up from their web app and cli. I've been using both before that was released.
Both companies are spread very thin trying to do both end user and developer oriented products and features while keeping existing paying users happy as well. Both companies also have had a string of rushed product releases that kind of fizzled out: OpenAI's Atlas, which was a response to Anthropic's Comet. Neither of which seem to be very popular at this point. Several false starts with apps (OpenAI), Claude Cowork, etc. There are a lot of half formed product ideas there that than don't get the attention they deserve.
And it's not like MS, Google, and Apple are any better. If anything they are more hesitant and out of their depth here. They are all dancing around the hard issues here which are UX and security/trust models. Also, while coders get a lot of toys, nailing agentic tools for business users is proving to be a lot harder. Blanket access to everything via an agentic browser is not a viable solution. I can agentically code a structured document via latex or markdown. But the same tools are relatively useless in spreadsheets, presentations, and documents. And while you can do a lot of potentially interesting things if you surrender your inbox, the security failure modes around that remain a show stopping obstacle for wide adoption.
There's a lot of stage fright, hesitation, and immature product management in this sector. There's a bit of gold rush in terms of rapid experimentation. But as the stakes get higher, a lot of these companies are increasingly lacking the freedom to move as fast as needed. Fear of liability issues is preventing them to do a lot. Which is why most progress is concentrated around developer tools.
No comments yet
Contribute on Hacker News ↗