Comment by ilamont
10 hours ago
A Pew study of a random sample of Internet links conducted in October 2023 found significant “link rot”: almost 40 percent of links that had been active 10 years earlier were broken. And that’s probably an underestimate: the study was based on the Common Crawl web archive (the same one that AI labs use to train their models), which is quite comprehensive but probably contains some bias toward more prominent sites.
"Random sample of Internet links" is going to include a lot of absolute garbage.
If we're talking about news sites, or commentary, or blogs, or magazines, or newspapers, or other publishers, the number of dead links will be far higher. Those are the types of sites that are likely to fail, be acquired, get migrated, or become paywalled.
I worked as a technology journalist for years starting in the late 90s. I did a lot of freelance work as well, and almost nothing survives online. There were media brands that were shut down, content migrated to another site, the CMS was migrated from Drupal to Wordpress to something else, there were two or three acquisitions, and so on. Last week, I checked some articles that I worked on between 3 and 10 years ago and they were either 404s or paywalled.
When I left one of the higher-profile pubs in the aftermath of the 2008 financial crisis, I knew my articles wouldn't last even though they were migrated to a sister publication. I made PDF copies of every single one. I still have them in a folder somewhere, not sure what to do with them.
My personal blogs are still up, but even those will die at some point.
> "Random sample of Internet links" is going to include a lot of absolute garbage.
It's also likely to include a lot of non-content links, e.g. links to index and navigation pages, interstitials, search results, user profiles, image galleries, etc. These sorts of links don't reliably address specific content, and it's natural that they'll change or die over time. This doesn't necessarily mean that anything valuable has been lost.
https://blog.archive.org/2026/04/23/introducing-vanishing-cu...
https://archive.org/details/vanishing-culture-2026
(when able, please consider donating to the Internet Archive; they are the durable, long term storage system of last resort)
> Yes, you can still access (for now) Disney-era FiveThirtyEight content via the invaluable Internet Archive, and pre-Disney-era content from The New York Times (which I partnered with from 2010 through 2013). And obviously, we’re trying to recreate some of the most popular parts of FiveThirtyEight at Silver Bulletin. The election models and polling averages are here, and new-and-improved versions of the sports models (PELE, ELWAY, COOPER) are gradually returning too.2 Galen Druke, Clare Malone and I have even been getting the old podcast crew back together for live shows.
With regards to:
> When I left one of the higher-profile pubs in the aftermath of the 2008 financial crisis, I knew my articles wouldn't last even though they were migrated to a sister publication. I made PDF copies of every single one. I still have them in a folder somewhere, not sure what to do with them.
May I suggest:
https://help.archive.org/help/uploading-a-basic-guide/
You can upload them all as a single item, or as individual items per piece and asking IA Patron Services to create a collection for you.
> My personal blogs are still up, but even those will die at some point.
Drop links, and they will be queued for crawling, if not already archived. If you would like to self serve, https://web.archive.org/save
Thank you!