Data Hoarder

830 readers
1 users here now

Keep it about datahoarding.

Rules

founded 2 years ago
MODERATORS
1
 
 

cross-posted from: https://lemmy.world/post/43115555

Here’s an overview of community efforts to make The Files more accessible. I’ve written a small description and possible warnings alongside them.

Epstein Research GitHub Mirror

Jmail

  • Access Jeffrey Epstein’s emails through a gmail interface and star important ones.
  • https://jmail.world/

Track The Files

  • A sourced, transparent investigation into the public figures named in the Epstein files — and the tax dollars that flow to them.
  • ⚠️ Made with LLMs
  • https://trackthefiles.org/

Epstein Document Network Explorer

EpsteIn

3D Network Cloud

Epstein Archive


Please add more sources as comments, or let us know if one of them has gone dark or appears to be dodgy.

2
 
 

Hi everyone!

I’ve been busy updating the site over the last few days. Everything is now fully up to date, including 109GB of the VOL9 release.

Also, the image gallery now has a Random feature, just click the button to see a random image of the images in the files.

PS: Some people are saying that the pdf files are not loading in Chrome. If so, try anoher browser. I am working to fix it in Chrome for the people that are having issues!

3
4
 
 

I don't know if this is the right place, but I figured the Hoarder Community would have a good idea on software.

I'm looking for an app that will scan an audio library and pick out duplicates. It has to do this by some other means than a mere filename, file size or audio tags. Ideally it would use all of those criteria, and do an audio analysis. I do have all my music sorted, collated, and tagged correctly tho. Opensource would be awesome. Baring that, Free is also acceptable. LOL

'presh

5
 
 

Hello all, I have a brainstorming question for you all, or maybe I'm just looking to confirm that my proposed solution makes some sense.

I am looking to re-arrange my storage solution to fit my needs, here is the situation:

  • I have a 2TB iCloud drive which is the main source of data (iCloud drive and Photos).
  • I have a self hosted Jellyfin instance that requires storage for the media content.
  • At the moment it's using a 2TB WD Element USB drive, which is terrible, not enough storage and super slow.
  • I have a WD MyCloud EX2 Ultra with 2 bays and 2 WD Red 4TB disks.
  • I have a TrueNAS machine running on an Atom D2700 with 8GB of RAM and 2 old 320Gigabytes disks that I've used just for testing.

The objective:

  • iCloud backup: take a recurring backup of iCloud.
  • Jellyfin storage: I figured at least 4TB.
  • Reuse and costs optimization: reuse as much of the current hardware as possible and minimizing costs.

My proposed solution:

  • iCloud backup:
    • Move the 2x4TB WD Red from the WD MyCloud EX2 Ultra to the TrueNAS machine and set it up in mirror mode.
    • Use icloud-docker to backup iCloud in TrueNAS.
    • Use rclone to keep an additional offsite backup on Scaleway glacier (I'm already using it to store some other backups and very happy with it).
  • Jellyfin storage:
    • Buy either 1 or 2 6TB WD Blue disks (or equivalent) to put into the WD MyCloud EX2 Ultra and use that as storage unit for the Jellyfin instance.
    • The reason for using WD Blue is to contain costs. The multimedia storage is not that important to me. Meaning I will not take a backup of it, and if I loose something is not a problem. I just need space and decent speed.
  • Reuse and costs optimization:
    • I will reuse all the hardware that I currently have plus buying only 1 or 2 more disks.

What do you guys think?

6
 
 

I've been running OCR on the recent house epstein email dump. Making this available now that its close to finishing (20k/ 23k emails processed).

Processing script available here: https://codeberg.org/sillyhonu/Image_OCR_Processing_Epstein

I also put an analysis script in there if you want to use drive/ colab.

Currently finished files are available here:

https://files.catbox.moe/xrgts0.sqlite

7
 
 

cross-posted from: https://swg-empire.de/post/4845931

I've had multiple reads fail on a fairly new drive.

I did a smartctl -t long /dev/sdb but after checking back a few minutes later smartctl -a /dev/sdb showed that no tests were running and that the previous test had "the read element of the test failed".

I did smartctl -t offline /dev/sdb next and after that was done smartctl -x /dev/sdb showed about 1500 errors but it also reported SMART as PASSED.

Here is the output of smartctl -x /dev/sdb: https://pastebin.com/09rNZZfD

How should I interpret these results? Was my assumption that the long test was done wrong? Should I replace the drive? Or might something else be wrong, like the SATA connection?

8
 
 

In this video Charlie Kirk asks if Benjamin Netanyahu gave stand down orders to increase the severity of the Hamas attack in order to declare martial law and avoid prison.

9
 
 

I know banned book lists are often enforced at the state level through public schools and movies aren't really a part of common curriculum, but is there a good list anywhere of films that the government might try to...say...remove from streaming?

10
11
12
 
 

It used to be that 14TB was the sweet spot for recertified price per TB. After looking around, it seems like it's moved up a bit, at least on ServerPartDeals and goHardDrive, even 22 TBs are floating around $14/TB.

I feel like eventually I could migrate all of my 14TB disks to 22TB and save a lot of heat. Of course, I'd need to upgrade my two parity disks first, which would currently cost around $677 after tax. I'm curious as to what parity sizes everyone else is rocking.

I'm also kind of realizing that this probably doesn't need to be a post, but I'm gunna post it anyway in case someone wants to enable my data hoarding.

Edit: I forgot to mention that I'm strictly considering WD HC drives only.

13
23
submitted 6 months ago* (last edited 6 months ago) by bjoern_tantau@swg-empire.de to c/datahoarder@lemmy.world
 
 

Since the enshittification of streaming providers is continuing I'm thinking of going back to my own collection saved on a central server for the family's convenience.

I just need more storage. But the only 8-12 TB drives in the 100-150 € price range are refurbished ones. I could only afford one at the moment so some resilience with RAID is out of the picture.

Can I trust these drives? Or will they fail in a few years? I kind of don't want to have to rip all my stuff twice.

Edit: Thanks for the input. I bit the bullet and was able to find a 12 TB HDD at 142 €.

14
8
submitted 6 months ago* (last edited 6 months ago) by Pamboo@piefed.social to c/datahoarder@lemmy.world
 
 

I already have protection cases for my HDDs but I would like to have box suitcase to protect my HDDs like this one. Can you recommend which hdd storage box suitcase I should use ? I'm from Europe

15
 
 

9.62/TB

16
17
18
19
20
 
 

Original question by @impudentmortal@lemmy.world

Looking to upgrade my NAS hard drives. Currently have two 4TB WD Red Plus hard drives but I wanted to get some large capacity drives. Was looking into getting 16 or 18TB drives. My current drives are basically whisper quiet and have been running great since 2019 but I feel like it's time to upgrade the capacity.

The NAS is currently on a desk beside my computer. I don't have any cabinets to place it in and would prefer not to connect to it through Wi-Fi. Hence why I'd like for the drives to be as quiet as possible.

I was considering getting a Seagate Exos or Ironwolf (and buying used for the great price) but I've read users online saying they regret buying those models because of their noise. I was also looking at the WD Red Pro but WD's own website only rates them at 3.6/5 with most of the negative complaints about dead on arrival drives. Additionally 25% of all reviews are 1 star; both of which don't fill me with much confidence.

TLDR: What's a quiet and reliable hard drive recommendation for a NAS?

Would it be better just to go with the WD Red Plus at a lower capacity?

21
 
 

It's 15k$, do you guys think I could talk a bank into thinking this is a car for a car loan? LMAO

22
 
 

I've never transferred Pokemon between gens and I've never used Pokemon Home, but it seems wild to me to be so invested into such a fickle storage system. Thoughts and prayers for the guy affected

23
24
25
25
submitted 8 months ago* (last edited 8 months ago) by cm0002@lemmy.world to c/datahoarder@lemmy.world
 
 

OC by @muhyb@programming.dev

I’m sure some of you already using it like this but if not, this could be useful for you.

It creates a directory with the channel’s name, create sub-directories with the playlist name, it gives them a number and put them in an order, it can continue to download if you have to cancel it midway.

You can modify it to your needs.

Add this to your ~/.bashrc or your favourite shell config. alias yt='yt-dlp --yes-playlist --no-overwrites --download-archive ~/Downloads/yt-dlp/archive.txt -f "bestvideo[height<=1080]+bestaudio/best[height<=1080]" -o "~/Downloads/yt-dlp/%(uploader)s/%(playlist_title,single_playlist)s/%(playlist_index,00)s - %(title)s - [%(id)s].%(ext)s"'

You can even limit the download speed by adding this parameter: --limit-rate 640K This example is for 5 Mb/s.

view more: next ›