Claude Cowork first impression: Cowork Deleted 11GB of files [video]

2 days ago (youtube.com)

The YouTuber discovers the deletion around 9m 20s in the video.

Also I don't think many non programmers will even know "rm -rf" command and what it does. So even if a non programmer was doing it command by command by giving permissions, he/she will have a hard time figuring out what those commands do.

Not saying he’s engagement farming but if he were engagement farming what would he have done differently?

The thing is that you can’t actually trust it did run the rm command.

As soon as you ask “give me a list of all the commands that led to the deletion”, isn’t it extremely likely to just invent an rm in there?

Furthermore—and granted, I didn’t watch the video in detail—what data was actually deleted? Maybe the hallucination was that some data was there when it wasn’t, and then Claude convinced itself it deleted something in the move process. Notice that it never says “I accidentally ran rm instead of mv”. That only happens when the user asks to backfill the commands.

Does coworker give Claude access to historical commands, or does Claude just generate based on its “memories”?

I’ve been using Claude quite a bit over the past few weeks, and this is a pattern I’ve noticed a few times.

  • Claude Code is smart enough to search its session traces and give you the real info.

    • Naive question, but isn’t every output token generated in roughly the same, non-deterministic, way? Even if it uses its actual history as context, couldn’t the output still be incorrect?

      Not trolling, asking as a regular user

      2 replies →