Ask HN: Do you know what data your AI coding agent sends to the cloud?

9 hours ago

Every session my AI coding agent reads files, runs commands, makes API calls. I have no idea exactly what ends up in the cloud. Is anyone actually tracking this at a granular level, or do we just trust the tool?

11 comments

lbrauer

zambelli 9 hours ago

I trust the tool in that I don't send anything sensitive in there! Unless I built it, I assume it's going somewhere.

We have a policy at work around this where our most sensitive data can only be passed to on prem models.

That being said, I have no evidence of anything going to the cloud or frontier providers doing anything with chat history other than storing it for later.

Self-hosted + custom harness for anything I don't want getting out at all.

lbrauer 8 hours ago
Makes sense. Does your custom harness give you a record of what actually crossed the boundary, or is it mostly trust-based blocking?
- zambelli 2 hours ago
  
  My harness is only being used with on prem models, so I don't have any checks in place. If the gguf is somehow calling home, I'm not catching it.

aianisulislam 6 hours ago

You don't. Even if you read the policy, it would be jumbled in legalese. Instead, give it access to only the kind of data you are okay with being sent to the cloud. Also, the company reputation at stake matters more than their policies.

SyntaxErrorist 6 hours ago

I have started treating AI coding tools more like giving temporary contractor access to my machine than just using auto complete.

lbrauer 5 hours ago

[flagged]

Leena-ch 4 hours ago

[flagged]

lukassbrad 7 hours ago

[flagged]

maryamshafaqat 6 hours ago

[dead]

utilvox 5 hours ago

[dead]

TuahaJawaid 6 hours ago

[dead]