Comment by Nextgrid

4 days ago

Piracy involves obtaining media content for free for which you should normally pay for, as a result of someone sharing the media meant for their own personal use to the general public.

YouTube does not ask for payment, it sends the video data you want alongside some bullshit you’ll ignore and waste precious human time doing so.

Ad blocking just involves offloading the ignoring to the computer, as it should, since computers are meant to automate menial tasks.

I've tried to explain this to people repeatedly and they don't get it. They're always like "oh no the AI scraper is slamming my website it's ruining everything". Um, maybe configure your web browser to not send me data if you don't want me 'scraping' your website. It's literally your server's choice to send me data. I'm just asking from a few IPs. If you want to send data to all of them that's your server's choice.

But I think people don't get the fact that they can just request payment or only send to authenticated users from authorized IPs and so on. Instead they want to send to all IPs without payment but then get upset when I use a bunch of IPs without paying. Weird.

I'm trying to read a bunch of stuff. The entire point of a computer is to make that easy. I'm not going to repetitively click through a bunch of links when a bot can do that way faster.

  • And what is the surefire way to stop AI scrapers from accessing your website? If there is no way, how can this be an acceptable ask?

    It already sounds like you're using several IPs to access sites, which seems like a work around to someone somewhere trying to limit the use of one IP (or just lack of desire to host and distribute the data yourself to your various hosts).

    Just because you can do something doesn't mean everyone must accept and like that you are doing that thing.

    • The answer is right there: use authentication with cost per load, or an IP whitelist.

      GP is absolutely right. If your server is just going to send me traffic when I ask I’m just going to ask and do what I want with the response.

      Your server will respond fine if I click through with different IPs and it’s just a menial task to have this distribution of requests to IPs, which is what we made computers for.

      Yeah, you’re right of course that no one has to like the “piracy” or “scraping” or whatever other name you’re giving to a completely normal request-response interaction between machines. They can complain. And I can say they’re silly for complaining. No one has to like anything. Heck you could hate ice cream.

      4 replies →