/0

2172 readers

202 users here now

Meta community. Discuss about this lemmy instance or lemmy in general.

Service Uptime view

founded 2 years ago

MODERATORS

db0@lemmy.dbzer0.com

143

On the recent LLM brouhaha and the FAF stance on LLMs for fediverse moderation. (lemmy.dbzer0.com)

submitted 1 day ago by db0@lemmy.dbzer0.com to c/div0@lemmy.dbzer0.com

78 comments fedilink hide all child comments

Recently there's been quite a bit of outrage because the developer of Piefed publicly called out the Fediverse Anarchist Flotilla (FAF) for supposedly using LLM for automating instance moderation. and even though many of our admins the larger lemmy community took great lengths to debunk that post, it has become the disinfo that keeps on giving (see https://lemmy.dbzer0.com/post/68749575, https://kolektiva.social/@ophiocephalic/116518887925988112, https://lemmy.dbzer0.com/post/68222242 and more)

After clarifying our position for yet another time, someone suggested we should make an official post and an instance policy to "give me something I can boost as a positive example and a sign that things will be better going forward." and given that this storm-in-a-teacup doesn't seem to be abating as people are all too happy to bring it up again and again to malign the FAF; We're making this post to once and for all clarify this situation.

History

We're not going to rehash the whole drama and the many hit pieces against the FAF in the past two weeks, but I need to post the exact situation as it happened, without speculations and assumptions that people are all too happy to jump to.

One of our mods develops a tool to download a user's public posting history through the lemmy API, to be used for evaluating them during moderation and shares it with some people in the admin team as something in progress. This tool does not feed anything to LLMs, it simply downloads the comments locally in a text file for easier review than going via the lemmy GUI.
Someone is reported to our instance admins for blatant zionism and genocide apologia.
An admin uses the tool to download the accused person's comment history for evaluation
A quick evaluation (without LLM) confirms that this is a person that needs to be instance-banned. The moderation decision has now been locked-in at this point.
At the same time, that admin was curious to discover if LLMs can used to summarize people's positions so that people can quickly follow-up with mod actions, without having to evaluate everyone's posts manually and reduce the workload of admins writing long justifications)
As an experiment, the admin pass the user's comment history through a locally-run open-weights LLM (Qwen) to see the summarized output. It happens to match their own decision.
The admin decides the leave the LLM summary in a pastebin along with that user's posting history for reference. As an inside joke, they decide to claim the post was summarized by OpenAI, as they expected only our community would care about this and our stance on corporate-LLMs is well-known at this point.
The admin bans that person, providing a link to that pastebin as justification.
The admin decides not to continue using LLMs anyway for summaries, for many valid reasons. As evidence see the lack of other pastebins with LLM summaries.

~2 weeks pass...

The piefed developer is banned by a different mod in our instance for "zionism". (I put this in quotes as this is one mod's opinion, and not necessarily our instance's position.)
The piefed developer apparently starts going through our instance modlogs for banned zionists and parses all their justifications
The piefed developer discovers that modlog justification from 2 weeks before with the LLM summary.
The piefed developer ask quickly in the common lemmy admin channel about it, at which point our instance admin in question, clarifies that the LLM was not used in the decision-making.
The piefed developer does not officially reach to anyone else from our admin team, despite the fact that we've reached out before and asked them to contact us in advance for inter-instance matters to avoid escalations.
The piefed developer make the public call-out I linked above as a piece of investigative journalism. The piefed developer does not provide the comments from our team which conflict with their narrative. The piefed developer not ask us for an official statement.
The piefed developer to this day has not amended their public call-out from the comments multiple of our admins and lemmy users leave under their post, conflicting with the narrative.

If you feel I've misrepresented any steps of this history, please let us know and I'll be happy to adjust.

Given that, we acknowledge that even though we didn't use LLMs in moderations, we allowed it to appear as if we did, and that's on us. We will of course not do the same mistake again (appear as to be using LLMs for moderation)

The FAF's stance on LLM moderation

We are aware that our instance is seen as "LLM-friendly" due to our nuanced take on LLMs but that does not mean that we, as an instance, ever considered using LLMs for moderating our instance. So we want to make it absolutely crystal clear how we stand on the matter.

As an official policy:

We have never used LLMs to guide our moderation decisions. This includes using LLM summaries which we would then validate, as well as LLM summaries which we use to confirm our existing decisions. LLMs are just not in our moderation loop whatsoever.
We have never passed instance data to corporate LLMs.
We have not used any automated moderation tooling which utilizes LLMs. The closest we have is the FOSS anti-CSAM filter I've developed and shared for years now, which relies strictly on locally-hosted machine-vision models.
We have never officially considered using LLMs for moderation, nor do we plan to.
As a team we're steadfastly against LLM for moderation due to its inherent biases.
If any of the above changes, we will publicly inform the FAF community.

We hope this can finally put this matter to rest.

you are viewing a single comment's thread
view the rest of the comments

[–] hendrik@palaver.p3x.de -3 points 18 hours ago* (last edited 18 hours ago) (1 children)

Thanks for the clarification.

To add a few of my personal remarks: I think as an instance / admin (team) you bear some kind of responsibility for your users and mods. I mean you provide a stage to them. And federate that out to other instances. It's a bit unfortunate if mods use instances for experiments that are against instance rules, or they use it to call people names... I mean if your staff knows this is going on, you kinda need to step in. You can't say our staff is against it. But we gladly let our mods do it. ...So I appreciate the clarification.

And with the public data... Well, technically almost everything is public data on the Fediverse. So people could make a point for arbitrary positions. It's more a unwritten social agreement how we don't sell user data to Google or use it for nefarious purposes. I think we're more or less trying to do what's right to do. Not just look at the designation of an activity. The majority of users here probably didn't mean to consent to abuse. That's why we're here, and not on Reddit.

The Anti-CSAM filter is great work. That's a very welcome use of machine learning. At least in my opinion.

[–] db0@lemmy.dbzer0.com 15 points 18 hours ago* (last edited 18 hours ago) (1 children)

Note, we don't play like that. It's not my instance. it's our instance, our FAF. I don't have any more power than any other admin

[–] hendrik@palaver.p3x.de 1 points 17 hours ago* (last edited 17 hours ago) (2 children)

Hmmh. I mean I don't exactly know the hierarchies on dbzer0 regarding those kinds of smaller/quicker decisions. Guess what I wanted to say is more or less: If you have instance rules, or a collective stance/policy on things, you better enforce them. Or they're meaningless. I just replied to this post because you summed it up. Other than that, whatever floats your boat.

[–] kittenzrulz123@lemmy.dbzer0.com 6 points 12 hours ago

We run on a structure where all admins are given an equal voice and large decisions are discussed internally before being made. Because of this the grand majority of small actions are made by a single admin with larger actions taken by any combination of admins.

[–] jatone@lemmy.dbzer0.com 11 points 17 hours ago (1 children)

I mean I don’t exactly know the hierarchies on dbzer0....

hint: we're anarchists. we dont really tolerate hierarchies.

[–] hendrik@palaver.p3x.de -5 points 17 hours ago* (last edited 17 hours ago) (1 children)

I'd say Anarchy is a hierarchy. A flat one. But we can also stick with hierarchies being vertical, and Anarchy is not part of that.

And I've been in real-world groups of people with flat hierarchies. It's fairly easy in those: It's EVERYBODY's responsibility in that case. If you're doing an illegal anarchist rave in the woods and you see someone spike the drinks... Or the person who sells nasty adulterated drugs shows up... You know that guy but they found some new innocent people... Well... You better do something about it, whoever you are. Or everyone who knew, watched and let it happen is to be blamed. Because there is no other host you could shift blame to. It's now become your moral obligation.

[–] jatone@lemmy.dbzer0.com 8 points 13 hours ago* (last edited 13 hours ago) (1 children)

I mean sure if you entirely redefine the word hierarchy. lol. goalposts

[–] hendrik@palaver.p3x.de 0 points 7 hours ago* (last edited 7 hours ago)

Talking about shifting goalposts... I'm far more interested in an opinion on the longer paragraph I wrote. Arguing about words regularly doesn't get you anywhere.