Comment by appplication

1 day ago

Maybe I’m entirely uncreative here, but if all they have is identity data and the implied data of having triggered a verification event, it feels like at best anything trained on this is really sketchy and could lead to some really messed up analysis. Like “we determined brown people trigger perform Claude queries that trigger identify verification at a rate 70% higher than white people”.

They would have access to IDs and whatever photos or real time video/audio is requires to verify. That's a big step towards building quite a large dataset for ID systems used for surveillance, for example.

There's also always the risk of not knowing who gets those documents later. The Dutch didn't think much of keeping detailed records including religious affiliation until the Nazis rolled into town.

LLM use is obviously much less politically charged today than religion was then (or ever), but that can always change, especially when an administration has already attacked said LLM provider as being something along the lines of a dangerous enemy of the state.