← Back to context

Comment by kuerbel

8 hours ago

Might be but I just can't imagine a customer being fine with a loose cannon agent in their environment. E.g. coding agents are ignoring instructions. Who is to say that Claudes solution to a, say, slow backup isn't deleting the backup?

Imagine an agent shadowing all your terminals, providing ideas and asking to run commands that will let it verify the hypotheses it comes up with, while at the same time doing research on vendor docs, etc...

Quite safe, and already a force multiplier - this would be a harness. Maybe have it be able to write to a shadow system with similar (ideally same) hardware to verify it's hypothesis on how the system works, etc...