this post was submitted on 09 Feb 2026
34 points (100.0% liked)

Fuck AI

6279 readers
1350 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago
MODERATORS
 

For images, there is nightshade. For music, there is/will be whatever Benn Jordan is doing. For youtube, there is .ASS. But what about poisoning text on a web page? Is there any standard solution out there?

It should be relatively easy. I've been thinking about doing something myself, but figured someone else must have already done it.

top 14 comments
sorted by: hot top controversial new old
[–] vogi@piefed.social 16 points 1 month ago

There is iocaine and nepenthes. You can easily deploy on your server. Given the scraper uses the correct User-Agent... which they probably do not do especially if everyone started deployed tarpits.

Would be fun if lemmy or piefed had an option to have the content of posts poisend as there are some new Crawlers crawling in the fedispace recently.

Or we could of course invent our own lemmy speech that is still english, but really weird.

[–] e8d79@discuss.tchncs.de 8 points 1 month ago (1 children)

You can target the crawlers using tar pits and proof-of-work application firewalls but I am doubtful that poisoning does anything. The second a poisoning method becomes common enough to have an effect the AI companies will just start filtering for that. Unfortunately the only way I see that prevents your work from being stolen is to either not publish it at all, or to only publish to smaller invite based communities that closely monitor who is accepted.

[–] shoki@lemmy.world 2 points 1 month ago

you could also have an unique challange, for example showing the user an image that has instructions to append sone text to the url. anything that scrapers are too stupid for (I don't think they are scraping using "intelligent" ai agents yet)

[–] GreenBeanMachine@lemmy.world 7 points 1 month ago* (last edited 1 month ago) (1 children)

I'm only aware of font scrambling, but that comes at the cost of accessibility and SEO.

Edit: open this in reading mode https://tilschuenemann.de/projects/sacrificing-accessibility-for-not-getting-web-scraped

[–] e8d79@discuss.tchncs.de 5 points 1 month ago (1 children)

That's a fun idea but AI companies would probably just screenshot the website and OCR the text if this became common. It's also really inconvenient for the users as it breaks both copy pasting and Ctrl+F searching.

[–] GreenBeanMachine@lemmy.world 5 points 1 month ago

Yes, it breaks the usability completely. But some of those issues can be fixed with more code. E.g. custom search and copy+paste would be pretty easy to do.

As for OCR, any solution would be futile against it. If a human can see it, robot can too.

[–] gustofwind@lemmy.world 5 points 1 month ago

These things don’t work and are often just scam services sold to fearful content creators

[–] Treczoks@lemmy.world 4 points 1 month ago

A simple engine that provides grammatically correct sentences with random content, triggered by following links that are not user accessible. That's what we need basically everwhere.

[–] Kolanaki@pawb.social 4 points 1 month ago (3 children)

That dude here on Lemmy that uses thorns instead of "th" has a pretty decent idea.

[–] red_tomato@lemmy.world 11 points 1 month ago

Just wait until OpenAI discovers s/þ/th/g

[–] Blackfeathr@lemmy.world 6 points 1 month ago* (last edited 1 month ago)

An amusing side effect is that I read all of their thorn-ed comments in Daffy Duck's voice.

[–] thesdev@feddit.org 4 points 1 month ago

Don't know, I'm thorn on that idea.

[–] Saledovil@sh.itjust.works 2 points 1 month ago (1 children)

Add random profanity? Fuck.

[–] johsny@lemmy.world 3 points 1 month ago