Comment by embedding-shape
5 hours ago
> We build on our prior research by using three different AI detectors (Pangram, GPTZero, Copyleaks). We independently evaluate each to show that the false positive rates and average false negative rates are consistently below 2%. Each AI detector shows a similar trend.
This is all bullshit, none of those actually work, and the false-positives rates are sky-high. I'm not sure how any serious person have tried out any of those services and came away with the impression of "Well, better than nothing" because literally, it seems the opposite.
The detectors aren’t great but they aren’t really the issue. The fact that LLMs make it so easy to impersonate human communication is precisely the problem here. There cannot be a reliable way to identify if something is from a human or not. And the ease of access and low price makes using LLM generated content a no brainer, you have to actively go out of your way to produce human generated content.
We are building a future where human contact will be scarce
> We are building a future where human contact will be scarce
Yes, until you remember there is a world outside of the screen, where people build things with their hands, use their physically to play instruments for others, paint beautiful things for others to see physically and so much more.
"Humanness" online been dead for decades already, if you want humanness you need to step outside, or at least invite other humans home.
There is a meaningful difference between “humans online are tribalistic” and “content consumed by humans is generated by machines”. The IRL world isn’t safe either, books, newspapers, advertising, speeches are/will be heavily LLM made. Political parties are using LLMs. The IRL humans are relying on what their LLMs summarized or searched for them.
The same way the online world has never actually been that distinct from the offline world, one is merged with the other and they influence each others.
There has been of humanness online of you do not look for it on social medias. But that’s now breaking down, because we developed a technology designed to impersonate human communication
2 replies →
So if these do not work, to what do you attribute the rising positive rate?
Humans writing more like LLMs, just like new LLMs write more like humans, it's all coalescing into one.
I've copied-pasted comments I made on HN from like 2020 and had it tell me it's "100% AI". I've seen examples where the services claim "100% AI" because there was no normal dashes, only em-dashes. Even have a recent example from HN itself: > This reads very AI. Pangram [0] agrees [1]. [0] Not perfect, but I think as good evidence as any: https://arxiv.org/pdf/2501.15654 [1]albumen
5 hours ago
Your evidence seems very anecdata. The graphite.io study does make an effort to quantify the false positive and false negative rates of the three detectors, rather than just saying “they work”. They generate 2000 ai articles and ask the detectors to evaluate them, measuring the false negatives (articles falsely IDd as human written); and they use a separate pre-AI dataset (years 2000-2022) to determine false positives.
1 reply →
If you need an AI detector to figure out if something is AI or not, surely that means the AI is so good that there is no need to detect whether it is AI or not, because it is indistinguishable from writings by a human when read by humans?
I mean this is an article coming from an SEO company that's really just trying to advertise its services in the end. Their methodology seems very loose.
Are you an AI agent trying to gaslight us?
Just a boring old organic human tired of other organic beings falling for obvious bullshit most likely made up by machines convincing humans with something like "you really have a neat idea here, the world will appreciate you making this into a product".