Comment by marginalia_nu
18 hours ago
It's not really too big of a problem for a well-implemented crawler. You basically need to define an upper bound both in terms of document count and time for your crawls, since crawler traps are pretty common and have been around since the cretaceous.
No comments yet
Contribute on Hacker News ↗