Comment by soulofmischief
1 day ago
I totally agree, the problem is difficult however because even if we create a perfectly anonymous system for registering with social media, modern LLMs make semantic analysis trivial. It's going to be impossible to remain anonymous without also using an LLM to strip the unique footprint of your text. Which leads to a very strange and monotonous culture for internet discussions. Might be unavoidable, though, at least for certain kinds of discussions.
Yes this is indeed a problem. But this is even more reason to keep private conversations private. It will stop those LLMs learning your footprint from your private conversations.
It's an interesting thing what you're saying. I've been thinking about this happening (I think it's inevitable) and also about using an LLM to sanitise my semantic footprint.
Yeah, it's been on my mind since the original transformers paper, because persona and identity management has been a long-time interest of mine. Both offensive and defensive tooling, such as fingerprinting and anti-fingerprinting.
If you're interested in building an open source persona management suite to distribute as freedom software and level the playing field against State agents who are already building and improving such tools, I would love to find a partner to help with such a project. Even if you don't code, there are other duties besides coding involved with successfully promoting such a project and developing a community around it.
Yes I work in cyber and I've always thought about the ability for fingerprinting people by their content. Not really semantically (that was just not really possible until LLMs came up) but more in terms of interests on social media.
But semantic analysis adds a whole new level with so much entropy that it's bound to be unique. And LLMs are just ideal for pattern recognition. There's not much we can do about that as a human, trying to fool it won't work. It really needs an artificial sanitiser. One that really builds a persona and aligns to it deeply (like little colloquialisms from the purported origin of the persona).
And also things like comment posting hours. I have identified several accounts from people who said they were chatting with me and I could prove they were doing something completely different at that same time. Us humans aren't consistent enough for that. Especially if you have multiple sockpuppets.
I don't think I could help much with that though. I'm neither a developer nor a promotor, I'm too much of an introvert for that. But it sounds really interesting.
But yeah I'm sure that within 5 years, if you are still writing comments yourself, it won't matter whether they know your phone number or email address, you will be uniquely identified by just what you write. I wouldn't be surprised if the darker forces in society have this capability already.
2 replies →