Comment by simonh
2 days ago
The reason they're blocking archives is people can go to the archive, to bypass paywalls and avoid targeted adverts, instead of the news site. It's also to prevent AI scrapers harvesting articles.
2 days ago
The reason they're blocking archives is people can go to the archive, to bypass paywalls and avoid targeted adverts, instead of the news site. It's also to prevent AI scrapers harvesting articles.
I meant that news sites should provide an API for Internet Archive to scrape their articles at all times to catch changes, but not provide any public access for an indefinite period of time (as an escrow) but eventually release it once the AI scraping issues blows over.
True, it's the main reason archive.today exists really