Comment by brianush1

19 hours ago

claude is stupid but not malicious; chroot is sufficient

8 comments

brianush1

Sure, it's not malicious. But it is very eager to get things done, and surprisingly inventive and knowledgeable in all kinds of workarounds.

furyofantares 19 hours ago

I've many times seen Claude try to execute a command that it's not supposed to, the harness prevents it, and then it writes and executes a python script to do it.

j16sdiz 17 hours ago
breaking a chroot takes more than that..
- furyofantares 6 hours ago
  
  How much more? Depends on the system doesn't it? I don't know how many systems have proc mounted but don't you get it from /proc/self/root?
  Anyway that's beside the point, which is that it doesn't have to "be malicious" to try to overcome what look like errors on its way to accomplishing the task you asked it to do.
- hoppp 9 hours ago
  
  That doesn't mean claude can't do it, chroot is better than nothing but not a real solution

nofriend 19 hours ago

Malice is not required. If it thinks it is in the right, then it will do whatever it takes to get around limitations.

lxgr 12 hours ago

Until it gets prompt injected. Are you reading every single file your agent reads as part of the tasks you give it, including content fetched from the web or third-party packages?

karhagba 19 hours ago

Claude is far from stupid from my experience. I've used so many models and Claude is king.