Lemmy Shitpost
Welcome to Lemmy Shitpost. Here you can shitpost to your hearts content.
Anything and everything goes. Memes, Jokes, Vents and Banter. Though we still have to comply with lemmy.world instance rules. So behave!
Rules:
1. Be Respectful
Refrain from using harmful language pertaining to a protected characteristic: e.g. race, gender, sexuality, disability or religion.
Refrain from being argumentative when responding or commenting to posts/replies. Personal attacks are not welcome here.
...
2. No Illegal Content
Content that violates the law. Any post/comment found to be in breach of common law will be removed and given to the authorities if required.
That means:
-No promoting violence/threats against any individuals
-No CSA content or Revenge Porn
-No sharing private/personal information (Doxxing)
...
3. No Spam
Posting the same post, no matter the intent is against the rules.
-If you have posted content, please refrain from re-posting said content within this community.
-Do not spam posts with intent to harass, annoy, bully, advertise, scam or harm this community.
-No posting Scams/Advertisements/Phishing Links/IP Grabbers
-No Bots, Bots will be banned from the community.
...
4. No Porn/Explicit
Content
-Do not post explicit content. Lemmy.World is not the instance for NSFW content.
-Do not post Gore or Shock Content.
...
5. No Enciting Harassment,
Brigading, Doxxing or Witch Hunts
-Do not Brigade other Communities
-No calls to action against other communities/users within Lemmy or outside of Lemmy.
-No Witch Hunts against users/communities.
-No content that harasses members within or outside of the community.
...
6. NSFW should be behind NSFW tags.
-Content that is NSFW should be behind NSFW tags.
-Content that might be distressing should be kept behind NSFW tags.
...
If you see content that is a breach of the rules, please flag and report the comment and a moderator will take action where they can.
Also check out:
Partnered Communities:
1.Memes
10.LinuxMemes (Linux themed memes)
Reach out to
All communities included on the sidebar are to be made in compliance with the instance rules. Striker
view the rest of the comments
There's a 'meme' trend of local ML tinkerers messing with the Epstein files as a dataset: https://huggingface.co/datasets/tensonaut/EPSTEIN_FILES_20K/
See: text embeddings https://huggingface.co/datasets/svetfm/epstein-files-nov11-25-house-post-ocr-embeddings
Edit: Now I’m pondering making an “EpsteinGPT” finetune myself. Maybe like a 4B-14B model for the sole purpose of Epstein RAG? Or a 32B responding in the style of the Epstein email text, just because.
This is interesting. Do you have any thoughts on why someone would want to utilize the epstein data for ML? Like, what's the point, in your opinion? Just lulz? Or, something else?
Lulz.
It’s an interesting coding exercise, though. Trying to (for example) OCR all the documents, or generate a relations graph between the documents or concepts, is a great into to language modeling (which is not prompt engineering, like most seem to think).
If you’re like a reporter or something, it’s also the obvious way to comb through the documents looking for clues to actually make headlines. I dunno what techniques they use at big outlets, though.
Just imagine having to explain being in possession of a handmade "EpsteinGPT" to someone 🤣🤣
Meme finetunes are nothing new.
As an example, there are DPO datasets with positive/negative examples intended to train LLMs to respond politely and helpfully (as opposed to the negative response). There are some that include toxic comments plucked from the web as negative examples.
And the immediate community thought was "...What if I reversed them?"
My immediate thought was "...What if I reversed them?"
haha just imaging people showing off their collections, "here's my Mr. Rogers chatbot, and Thomas Jefferson, and even Luffy from One Piece! And uh...oh yeah over here we have EpsteinGPT for when I, I mean for if, um...its for lulz ok?! Don't look at me like that, where are you going?!"
It's literally "this one is my fursona. This one won't refuse BDSM, but its not as eloquent. Oh, this one is lobotimized but really creative." I kid you not. Here is an example, and note that is one of 115 uploads from one account:
https://huggingface.co/Mawdistical/RAWMAW-70B?not-for-all-audiences=true
And I love that madness. It feels like the old internet. In fact, furries and horny roleplayers have made some good code contributions to the space.
Early on, there were a few 'character' finetunes or more generic ones like 'talk like a pirate' or 'talk only in emojiis.' But as local models got more advanced, they got so good at adopting personas that the finetuning focused more on writing 'style' and storytelling than emulating specific characters. For example, one trained specifically to stick to the role of a dungeonmaster: https://huggingface.co/LatitudeGames/Nova-70B-Llama-3.3
Or this one, where you can look at the datasets and see the anime 'style' they're trying to massage in: https://huggingface.co/zerofata/GLM-4.5-Iceblink-106B-A12B
Hey this is really cool and thanks for sharing it.
We need more of this.
Not data hidden in some file.
How did people parse data sets before we had LLMs to do it for us?
...The same way Google Search has forever?
Ranking, reranking, oldschool RAG.
Instead of em dashes, it's full of extra spaces before and after each period and extra period.
I dunno what the 'writing style' would end up as. The bulk of the text seems to be formatted like this:
I'd have to generate prompt/response wrappers too. But it would definitely bring up Trump and Clinton randomly, heh.
...There are automated metrics to rank English text by reading level, 'quality' and such. I guess it could be filtered to most 'interesting' emails and reformatted.
Ah I misinterpreted as the most recent email dump. Like you could email back and forth with an avatar of jee