Comment by 1vuio0pswjnm7
8 days ago
"Currently, however, the Internet Archive does not disallow any specific crawlers through its robots.txt file, including those of major AI companies. As of January 12, [2026,] the robots.txt file for archive.org read: "Welcome to the Archive! Please crawl our files. We appreciate it if you can crawl responsibly. Stay open!" Shortly after we inquired about this language, it was changed. The file now reads, simply, "Welcome to the Internet Archive!""
No comments yet
Contribute on Hacker News ↗