← Back to context

Comment by dotnet00

4 days ago

I imagine you'd add more heuristics and various types of hashes? If the file is just sitting there, rarely accessed and unshared, or if the file only triggers on 2/10 hashes, it's probably a false alarm. If the file is on a public share, you can probably run an actual image comparison...

A lot of classic perceptual hash algorithms do "squinty" comparisons, where if an image kind of looks like one you've hashed against, you can get false positives.

I'd imagine outside of egregious abuse and truly unique images, you could squint at a legal image and say it looks very much like another illegal image, and get a false positive.

From what I'm reading about PhotoDNA, it's your standard phashing system from 15 years ago, which is terrifying.

But yes, you can add heuristics, but you will still get false positives.