Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A "few" more would be fine - but the sheer scale of the malicious AI training bot crawling that's happening now is enough to cause real availability problems (and expense) for numerous sites.

One web forum I regularly read went through a patch a few months ago where it was unavailable for about 90% of the time due to being hammered by crawlers. It's only up again now because the owner managed to find a way to block them that hasn't yet been circumvented.

So it's easy to see why people would allow googlebot and little else.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: