Comment by danielbenzvi

4 hours ago

Interesting. Their versioned storage sandbox seems to be what really sets them apart

I don't get it, it looks like they are copying data to the sandbox filesystem why would that impact production data? Because the agent can re-upload the file to s3?

  • That's exactly how I tried to address that problem with https://github.com/afshinm/zerobox -- you control what network access (e.g. `--deny-net *.amazonaws.com`) your agent has and you also get snapshotting out of the box.

    That said, using LakeFS is probably a better long term solution and I like this approach.

  • Good question - the filesystem is Fuse-mounted into the sandbox, not copied into it. This way agents can modify data directly simply by interacting with the "local" files.