hylobates@jlai.lu to Selfhosted@lemmy.worldEnglish · 2 days agoBased on this graph, and this graph alone, guess at what time I completely blocked OpenAI crawlersjlai.luimagemessage-square73fedilinkarrow-up1548arrow-down16file-text
arrow-up1542arrow-down1imageBased on this graph, and this graph alone, guess at what time I completely blocked OpenAI crawlersjlai.luhylobates@jlai.lu to Selfhosted@lemmy.worldEnglish · 2 days agomessage-square73fedilinkfile-text
minus-squareAHemlocksLie@lemmy.ziplinkfedilinkEnglisharrow-up13arrow-down1·1 day agoPretty sure I’ve repeatedly heard about the crawlers completely ignoring robots.txt, so does Cloudflare really do that much?
minus-squaretomjuggler@lemmy.worldlinkfedilinkEnglisharrow-up3·4 hours agoYes, CloudFlare blocks agents completely if they ignore it’s restrictions. The key is scale - CloudFlare has a birds eye view of traffic patterns across millions of sites and can do statistical analysis to determine who is a bot. I hate the necessity but it works
minus-squareSv443@sh.itjust.workslinkfedilinkEnglisharrow-up8arrow-down1·1 day agoLike a lock on a door, it stops the vast majority but can’t do shit about the actual professional bad guys
minus-squareFreedomAdvocate@lemmy.net.aulinkfedilinkEnglisharrow-up1·4 hours agoCloudflare definitely can and does stop the vast majority of actual professional bad guys.
Pretty sure I’ve repeatedly heard about the crawlers completely ignoring robots.txt, so does Cloudflare really do that much?
Yes, CloudFlare blocks agents completely if they ignore it’s restrictions. The key is scale - CloudFlare has a birds eye view of traffic patterns across millions of sites and can do statistical analysis to determine who is a bot.
I hate the necessity but it works
Like a lock on a door, it stops the vast majority but can’t do shit about the actual professional bad guys
Cloudflare definitely can and does stop the vast majority of actual professional bad guys.