this post was submitted on 09 Jan 2025
57 points (95.2% liked)

Selfhosted

60366 readers
678 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Detailed Rules Post

  1. Be civil.

  2. No spam.

  3. Posts are to be related to self-hosting.

  4. Don't duplicate the full text of your blog or readme if you're providing a link.

  5. Submission headline should match the article title.

  6. No trolling.

  7. Promotion posts require active participation, with an account that is at least 30 days old. F/LOSS without a paywall has exceptions, with requirements. See the rules link for details.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 3 years ago
MODERATORS
 

Now that we know AI bots will ignore robots.txt and churn residential IP addresses to scrape websites, does anyone know of a method to block them that doesn't entail handing over your website to Cloudflare?

you are viewing a single comment's thread
view the rest of the comments
[โ€“] Atherel@lemmy.dbzer0.com 6 points 1 year ago (1 children)

Its not AI but take a look at nG-firewall, it blocks most know unwanted stuff and gets regular updates.:

https://perishablepress.com/ng-firewall/

[โ€“] ctag@lemmy.sdf.org 1 points 1 year ago

Will check this out. Thanks!