← Back to context

Comment by jannyfer

14 hours ago

That was fun, and the script on github looks hand-written which is refreshing after having been reading AI-written code for months.

I have 120k photos in iCloud that I'm sure have duplicates (I exported my library to Google Photos years ago and exported it back to iCloud). The iOS duplicate detection stopped flagging duplicates for me to merge a while back. I gotta do something like this script...

yeah there's something you can still feel in hand-written code - the variable names show how someone was actually thinking about the problem. AI code is grammatically perfect but quite hollow

If you're really wanting to do perceptual hash based deduplication, use multiple, heterogeneous hash algorithms (phash, dct hash, mean hash, ...) as it is likely that a given hash algo will happily lossily match with very very different images--but if all hashes match, you're much less likely to have false positives.

I wrote up what I do here: https://photostructure.com/guide/what-do-you-mean-by-dedupli...

> and the script on github looks hand-written which is refreshing after having been reading AI-written code for months.

We really need a short for "is it AI or not? has entered the discussion".