← Back to context

Comment by freehorse

8 hours ago

What I understand reading this is that if you use their online services, incl AI-agents, llm based tab-completion, auto-updates etc, you send data to their servers, and on that part they run analytics. Frankly, this is what I would expect anyway, ie if I disable telemetry locally, it would affect what I do locally, ie no data about how I use my software locally would leave the machine, but if I sent data to some server I would not expect people not to run analytics on their servers.

> AI tab completion feature... it's safe to assume that all of the code you edit is sent to them

Yes, this is quite obvious, how else could they provide AI tab completion? I hope anybody understands this before using sth like this. They do specify that "[...] telemetry expressly does not include Customer Data" though.

> They do specify that "[...] telemetry expressly does not include Customer Data" though.

Yes and no. They first grant themselves a license "to derive and generate Telemetry" from the users copyrighted material, something that they only need if they're deriving it from the actual creative works the customer updates, and not just the metadata about them.

And they define telemetry extremely broadly, effectively "anything useful for lawful business purposes except customer data".

So this agreement would seem to cover things like "an update to an AI model trained off of your code" or even "an AI summary of what you're working on and any relevant business information contained therein". As long as they process it to something new, it's not "customer data" (a term defined narrowly in the agreement). I don't expect that they are doing that, but I think they've given themselves permission to. The agreement is far too broad.

I agree that I expect that they are deriving metadata, and would expect that regardless of this agreement, but this agreement doesn't seem necessary for that.