this post was submitted on 18 Nov 2025
655 points (98.8% liked)

politics

26404 readers
2261 users here now

Welcome to the discussion of US Politics!

Rules:

  1. Post only links to articles, Title must fairly describe link contents. If your title differs from the site’s, it should only be to add context or be more descriptive. Do not post entire articles in the body or in the comments.

Links must be to the original source, not an aggregator like Google Amp, MSN, or Yahoo.

Example:

  1. Articles must be relevant to politics. Links must be to quality and original content. Articles should be worth reading. Clickbait, stub articles, and rehosted or stolen content are not allowed. Check your source for Reliability and Bias here.
  2. Be civil, No violations of TOS. It’s OK to say the subject of an article is behaving like a (pejorative, pejorative). It’s NOT OK to say another USER is (pejorative). Strong language is fine, just not directed at other members. Engage in good-faith and with respect! This includes accusing another user of being a bot or paid actor. Trolling is uncivil and is grounds for removal and/or a community ban.
  3. No memes, trolling, or low-effort comments. Reposts, misinformation, off-topic, trolling, or offensive. Similarly, if you see posts along these lines, do not engage. Report them, block them, and live a happier life than they do. We see too many slapfights that boil down to "Mom! He's bugging me!" and "I'm not touching you!" Going forward, slapfights will result in removed comments and temp bans to cool off.
  4. Vote based on comment quality, not agreement. This community aims to foster discussion; please reward people for putting effort into articulating their viewpoint, even if you disagree with it.
  5. No hate speech, slurs, celebrating death, advocating violence, or abusive language. This will result in a ban. Usernames containing racist, or inappropriate slurs will be banned without warning

We ask that the users report any comment or post that violate the rules, to use critical thinking when reading, posting or commenting. Users that post off-topic spam, advocate violence, have multiple comments or posts removed, weaponize reports or violate the code of conduct will be banned.

All posts and comments will be reviewed on a case-by-case basis. This means that some content that violates the rules may be allowed, while other content that does not violate the rules may be removed. The moderators retain the right to remove any content and ban users.

That's all the rules!

Civic Links

Register To Vote

Citizenship Resource Center

Congressional Awards Program

Federal Government Agencies

Library of Congress Legislative Resources

The White House

U.S. House of Representatives

U.S. Senate

Partnered Communities:

News

World News

Business News

Political Discussion

Ask Politics

Military News

Global Politics

Moderate Politics

Progressive Politics

UK Politics

Canadian Politics

Australian Politics

New Zealand Politics

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] pelespirit@sh.itjust.works 16 points 1 day ago (4 children)
[–] defaultusername@lemmy.dbzer0.com 16 points 1 day ago* (last edited 1 day ago) (1 children)

Those are the Epstein emails, not the Epstein files.

[–] pelespirit@sh.itjust.works 4 points 1 day ago

True, but the family released what they had. That's a response to:

Maybe he should release the originals?

[–] BlameThePeacock@lemmy.ca 19 points 1 day ago

That's not all of it.

[–] TropicalDingdong@lemmy.world 6 points 1 day ago* (last edited 1 day ago) (2 children)

Thanks for sharing this.

I've been running OCR on the images folder of the files since last week and just reached out to the creator to see if they want the data I've processed. Right now that entire graph is ONLY the "text" portion of the dump. There are 26k images, which are mostly pictures of emails and other documents. I'm like 80% through processing them (although I've had some hiccups in the past 24 hours).

https://codeberg.org/sillyhonu/Image_OCR_Processing_Epstein

[–] TheFinn@discuss.tchncs.de 3 points 1 day ago (1 children)

How good is deepseek OCR? I've heard AI does a lot better than the older methods. Is it difficult to use?

[–] TropicalDingdong@lemmy.world 3 points 1 day ago

Its phenomenal. I have found a few places where it falls down, and its usually when the text is incredibly small. You can see its being down sampled before it gets handed off to the model. It falls down on like, one example I found, some bank disclosure documentation from bank of america:

It just came out as all I's and o's.

For the emails, book text, letters, etc.. I genuinely haven't found a place it didn't work correctly as I've been spot checking the output.

If you have colab you can just try the script I put up. All you need to do to have it run is to book mark the house oversite committee google drive folder to your local google drive.

[–] pelespirit@sh.itjust.works 3 points 1 day ago (1 children)

Whoa, I hope they're interested. I didn't realize the pics info wasn't included. Thanks for doing all that work. I looked through some of it and there's a ton there.

[–] TropicalDingdong@lemmy.world 3 points 1 day ago* (last edited 1 day ago)

Yeah its ridiculous how much is in there. I'm pulling their current repo to see how they are building their DB so if they don't get back to me, I can at least combine the two databases.

And if any one reading this wants a copy of what I've processed so far, I'm more than happy to share.

But it looks to me like they dropped a couple hundred on just processing those text files. It would be north of 2.5k additional to process the data I'm creating.

That being said, mine only goes as far as extracting the contents and creating a sha256 hash to keep track of the documents themselves/ document tampering. It doesn't take the next step to extract names, locations, dates, etc..

I'm working that out now but it seems like the way to do this would be so it fits into their DB seamlessly.

[–] webghost0101@sopuli.xyz 8 points 1 day ago* (last edited 1 day ago) (2 children)

This is a legit official site??

This is an article on “the biggest political scandal in presidential history” not only does presidential use of autopen go back for decades, it reads like it was written by ai.

https://oversight.house.gov/landing/the-biden-autopen-presidency/

[–] floofloof@lemmy.ca 8 points 1 day ago* (last edited 1 day ago)

Presidential use of autopen-type devices goes back hundreds of years:

https://www.shapell.org/behind-the-scenes/the-robot-pen/

The first president to use the autopen extensively was Thomas Jefferson. ... Since Jefferson, various US presidents have made use of the autopen; some were guarded about it while others were more open about its use. Whereas once the official White House position was to deny the existence or usage of the autopen, today its existence is more of an open secret.

Harry Truman was rumored to make use of the device; Gerald Ford was open about his utilization of the autopen, but it was Lyndon B. Johnson who blew the doors off the entire affair by allowing the device to be photographed in the White House, appearing on the cover of The National Enquirer with the article “The Robot That Sits in for the President.”

John F. Kennedy was so dependent on the autopen, that he became the subject of a book entitled The Robot That Helped to Make a President.

But sure, the biggest political scandal of all time is not putting a Russia-compromised 34-times convicted felon and serial pedophile rapist who aims to turn the USA into a fascist autocracy into the presidency twice, but when Biden used a machine (as Trump also does) to sign things.

[–] pelespirit@sh.itjust.works 1 points 1 day ago

This is the release of the files from the oversight committee. Yes, it's legit, lol. It depends on what committee is releasing what.