this post was submitted on 19 Nov 2025
70 points (96.1% liked)

Data Hoarder

747 readers
70 users here now

Keep it about datahoarding.

Rules

founded 2 years ago
MODERATORS
 

I've been running OCR on the recent house epstein email dump. Making this available now that its close to finishing (20k/ 23k emails processed).

Processing script available here: https://codeberg.org/sillyhonu/Image_OCR_Processing_Epstein

I also put an analysis script in there if you want to use drive/ colab.

Currently finished files are available here:

https://files.catbox.moe/xrgts0.sqlite

you are viewing a single comment's thread
view the rest of the comments
[–] TropicalDingdong@lemmy.world 10 points 1 day ago (1 children)

I literally have already processed this into an sql database you can download right now.

Literally the point of the post.

Just download and search.

https://files.catbox.moe/xrgts0.sqlite