Comment by gusgus01
3 days ago
I don't say this lightly, but I don't think you read my reply or at least didn't understand the implications, especially because you don't actually argue against anything I say. You only say generic statements about justifications and logical conclusions and conclude with assumptions about RIAA.
I stated that the open internet as a whole is the commons, not any specific person's pet project, and thus, that AI scraping (or any bulk scraping done commonly and wholesale) makes it untenable for most people to keep participating. Twitter for example has gone your preferred way, mostly requiring authentication to access. There are many arguments on HN about whether that's a good move, or even a move that others could take and expect success. And that's a huge platform. Just recently there have been front page posts on HN about bringing back personal blogs, and also posts about how personal blogs not behind the great wall of Cloudflare led to TBs of "false" traffic because of scrapers, which costs real money.
I stated I think piracy, ad block, and AI scraping to be part of the same spectrum. I think the justification for ad blocking has a much lower level of burden than the justification for AI scraping to the point you need multiple IPs and argue for whitelisting as the only option to stop it, because of the amount of effect you are having.
Much like how bandwidth has different levels of payment if you use less than 100 MB or more than 1 TB, or how delivering a package that weighs 10 lbs is way cheaper than a package that weighs 1000 lbs, or how at some level of effort times repetition it makes sense to automate something programmatically vs just doing it manually. There are of course situations where each makes sense, but the expectations can vary, and the results are not always linear depending on the inputs. This all completely ignores the social aspect of it that can add a whole new layer of complexity that has it's own logic.
Scraping (or access without ads eg ad blockiing, or outside sharing of data eg piracy) has always been complained about by those that have data that people want to scrape, eg airlines or hbo or disney, it's just that now all data is data that is being scraped absolutely non-stop to the detriment of many and the gain of few that everyone has a reason to complain. It also explains why people have differing opinions.
No comments yet
Contribute on Hacker News ↗