Comment by IntelliAvatar

2 months ago

This looks great.

One thing I’ve been bitten by with desktop agents is execution-time safety: the plan is correct, but a single malformed path or OS call causes real damage.

Do you enforce any guardrails at the tool boundary (e.g. path sandboxing, network allowlists, dry-run / replay)?

Curious how you’re thinking about this.

2 comments

IntelliAvatar

schnetzlerjoe 2 months ago

Phenomenal questions. Sandboxing would be a PHENOMENAL idea. And allowlist it currently is capable of this but does require code changes so configuration based would probably be more what you are referring to?

The replay feature is similar to the record feature. It's not a "guardrail" I would say though.

All stuff that definitely would be great idea.

IntelliAvatar 2 months ago

Makes sense, thanks for the clarification.
I mostly worry about the gap between a correct plan and execution-time behavior — especially when tools touch the filesystem or OS APIs. Even a single malformed argument can have irreversible effects.
Totally agree these guardrails are non-trivial, but it’s great to see the project thinking in this direction.