Comment by plagasul

2 days ago

possibly not or grey area under GDPR if I use identifiable information, as it is sent to Anthropic for processing, no matter if used for training or not, but I am unsure about this, I should probably anonymize and research it more, thanks for pointing it out

7 comments

plagasul

bix6 2 days ago

You could just send Anthropic scrambled names / emails and then unscramble locally?

plagasul 2 days ago
yes something like that, additionally most steps do not require data going through claude anymore, as it already wrote the script that take the student list and the qualifications model and produce a model per student, AND the script that takes that and sends each to its right email. The problematic part is when claude reads my notes and formats them into each of those student qualification sheets. There I would need some form of scrambling as you suggest, not to hijack the thread but ideas appreciated for a minimal setup. I believe claude respects .gitignore.
- bix6 1 day ago
  
  Maybe you could run a local script or smaller local model that takes a first pass through the notes and replaces every instance of a given name with their assigned number?
  
  1 reply →

plagasul 2 days ago

There is another institution I teach at that gives us Gemini, but not via API, which limits its use for this kind of work to an extent, I could do it via drive, I assume. There being a contract puts the institution and Google as responsible of the data. The first institution I was talking about has MS Teams, without AI afaik, but if they contract it I guess I can do the same with sharepoint, etc.

47282847 2 days ago

Sorry to tell you but it’s not grey area, it’s full on black. You do not have permission to share such data with a third party provider that doesn’t have strict privacy guarantees and that you have a data processing agreement with. TOS are not sufficient.

plagasul 5 hours ago

Yes, thank you, I developed and shared, above, a workflow for anonymization.