Websites are blocking the wrong AI scrapers (Because AI companies keep making new ones)
18
votes
Is anyone here familiar with crawling the web? I’m interested in broad crawling, rather than focusing on particular sites. I’d appreciate pretty much any information about how this is usually done, and things to watch out for if attempting it.