Blocking AI crawlers with Caddy

Deckweiss@lemmy.world · edit-2 7 months ago

Blocking AI crawlers with Caddy

arisunz@lemmy.blahaj.zone · 7 months ago

I got meaner with them :3c

winnie@lemmy.ml · 6 months ago

Suggestion at the end:

  <a class="boom" href="https://boom .arielaw.ar">hehe</a>

Wouldn’t it destroy GoogleBot (and other search engine) those making your site delisted from Search?

acockworkorange@mander.xyz · 6 months ago

That’s devilishly and deliciously devious.

JustARegularNerd@aussie.zone · 7 months ago

I just want you to know that was an amazing read, was actually thinking “It gets worse? Oh it does. Oh, IT GETS EVEN WORSE?”

arisunz@lemmy.blahaj.zone · 6 months ago

lmao that means a lot, thanks <3

Deckweiss@lemmy.world · 7 months ago

The nobots module I’ve linked bombs them

jkrtn@lemmy.ml · 7 months ago

This is one of the best things I’ve ever read.

I’d love to see a robots.txt do a couple safe listings, then a zip bomb, then a safe listing. It would be fun to see how many log entries from an IP look like get a, get b, get zip bomb… no more requests.

pvq@lemmy.ml · 6 months ago

I really like your site’s color scheme, fonts, and overall aesthetics. Very nice!

not_amm@lemmy.ml · 6 months ago

I agree, it’s readable and very cute!

A Basil Plant@lemmy.world · edit-2 6 months ago

In dark mode, the anchor tags are difficult to read. They’re dark blue on a dark background. Perhaps consider something with a much higher contrast?

A picture of a website with a dark purple background and dark blue links.

Apart from that, nice idea - I’m going to deploy the zipbomb today!

arisunz@lemmy.blahaj.zone · 6 months ago

nice catch, thanks (i use light mode most of the time)

hollyberries@programming.dev · 7 months ago

I’m a fan of hellpotting them.

SayCyberOnceMore@feddit.uk · 7 months ago

Ooh, didn’t know about that one… thanks

Para_lyzed@lemmy.world · 6 months ago

From your recommendation, I found a related project pandoras_pot that I am able to run in a Docker container, and seems to run more efficiently on my Pi home server. I now use it in my Caddyfile to redirect a number of fake subdomains and paths that are likely to be found by a malicious bot (of course all are excluded in my robots.txt for bots that actually respect it). Thanks for the recommendation!

Arghblarg@lemmy.ca · 7 months ago

We should do more than block them, they need to be teergrubed.

Deckweiss@lemmy.world · edit-2 7 months ago

Thats an easy modification. Just redirect or reverse proxy to the tarpit instead of abort .

I was even thinking about redirecting the AI to a data-poisoned html document, but after some research I gave up.

Personally I don’t want my servers to waste traffic or CPU cycles on that.

boredsquirrel@slrpnk.net · 7 months ago

Such a cool person making the video available for download

LiveLM@lemmy.zip · 6 months ago

Huh, looks like the post in r/linux got removed for not being relevant.
What a joke.