this post was submitted on 25 May 2026
40 points (100.0% liked)
Linux Questions
3952 readers
18 users here now
Linux questions Rules (in addition of the Lemmy.zip rules)
- stay on topic
- be nice (no name calling)
- do not post long blocks of text such as logs
- do not delete your posts
- only post questions (no information posts)
Tips for giving and receiving help
- be as clear and specific
- say thank you if a solution works
- verify your solutions before posting them as facts.
Any rule violations will result in disciplinary actions
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
If the drive's firmware is faulty, SMART data will be faulty too. But can you say the percentage is somewhat high from what you dealt with, a little statistics? What I saw is my personal experience and it's definitely wouldn't be accurate as yours. I only saw a drive died out of nowhere a handful of times which is not high if I make it into a percentage.Though if the drive itself is faulty, it won't take long for it to die too.
The best I saw is a WD Caviar Black 500 GB drive from 2011 we use, still kicking. Took a backup because of its age a couple years ago but haven't died yet. The worst I saw was my friend's NVMe SSD that died in 3 months after he installed. Probably its firmware was also faulty because SMART didn't help that time.
It's nothing to do with faulty firmware, it's that smart will only see 1 in 3 issues and as such is simply not good enough to use as actual diagnostics.
I see. So, you're saying that occasionally checking
smartctl(or havingsmartdas a daemon continuously), runningbadblockstime to time and maybe checkingiostatnot really enough? I mean, Linux is by far the most used OS on servers and datacenters, if these are not enough someone would write a proper tool I guess, don't you think?Not at all. It takes a huge amount of work to do so, and the benefit of using raid etc is redundancy so they can afford for things to fail. Smart mon tools is a great example, the software is great but it needs it's database to support that drives functions to work well and they can't and don't support everything.
I see, they're solving the issue other way around then by using raid. Didn't know smartmontools work with a database, so it works kind of like an antivirus in a way. Interesting.
By the way, again it's just my experience but I want to ask this. Have you noticed faulty disks are more common in NVMe SSDs than HDDs or I happen to noticed that because it's common in cheaper options and more expensive ones are actually worth the price?
60 40 hard disks. Tho many SSDs just die Vs Hard disks which often die a slow death.