this post was submitted on 19 Nov 2025
74 points (96.2% liked)

Data Hoarder

838 readers
1 users here now

Keep it about datahoarding.

Rules

founded 2 years ago
MODERATORS
 

I've been running OCR on the recent house epstein email dump. Making this available now that its close to finishing (20k/ 23k emails processed).

Processing script available here: https://codeberg.org/sillyhonu/Image_OCR_Processing_Epstein

I also put an analysis script in there if you want to use drive/ colab.

Currently finished files are available here:

https://files.catbox.moe/xrgts0.sqlite

you are viewing a single comment's thread
view the rest of the comments
[–] mojofrododojo@lemmy.world 2 points 3 months ago

PFFFT it's obviously George Clinton from Parliament-Funkadelic.