Comment by rnhmjoj

3 days ago

I don't understand, why do people resort to this tool instead of simply blocking by UA string or IP address. Are there so many people running these AI crawlers?

I blackholed some IP blocks of OpenAI, Mistral and another handful of companies and 100% of this crap traffic to my webserver disappeared.

Because that solution simply does not work for all. People tried and the crawlers started using proxies with residential IPs.

less savory crawlers use residential proxies and are indistinguishable from malware traffic

Lots of companies run these kind of crawlers now as part of their products.

They buy proxies and rotate through proxy lists constantly. It's all residential IPs, so blocking IPs actually hurts end users. Often it's the real IPs of VPN service customers, etc.

There are lots of companies around that you can buy this type of proxy service from.

You should read more. AI companies use residential proxies and mask their user agents with legitimate browser ones, so good luck blocking that.