this post was submitted on 05 Nov 2025
0 points (50.0% liked)

ShareGPT

88 readers
1 users here now

To share LLM text output that others might find interesting.

founded 2 years ago
MODERATORS
 

How would I build a Python bot that automatically reposts or highlights statistically outlier posts from my Lemmy subscriptions—like anything more than one standard deviation above the average? I’m looking for a general approach covering data retrieval from the Lemmy API, outlier detection, and reposting mechanics.

top 1 comments
sorted by: hot top controversial new old
[–] PumpkinDrama@reddthat.com 1 points 1 month ago

You would design the system as a three-stage pipeline. First, collect structured data from your Lemmy subscriptions—each record should at minimum include post ID, community name, timestamp, and vote count. Accumulate enough recent observations per community to estimate a stable mean and standard deviation of votes. Second, for each new post, calculate its z-score as [ z = \frac{x - \mu}{\sigma} ] where (x) is the post’s vote count, (\mu) is the community’s mean vote count, and (\sigma) is the standard deviation. A z-score above a chosen threshold (for instance, 2.0) marks a statistical outlier. Third, define an automation layer that acts on those outliers—such as reposting, flagging, or aggregating them—using available Lemmy API endpoints. Conceptually, it’s a continuous data ingestion and scoring loop that detects exceptional engagement patterns across communities.