Comment by geye1234
7 days ago
I am interested in hearing how to get hard drives to last longer. Should you keep them locked away in the closet? Spin them up occasionally but not too much? Keep them always-on? I understand the less reading and writing, the better.
How does external compare to internal, if at all? Is 3.5" going to last longer than something smaller?
Spinning HDDs will eventually be at risk of failing for purely mechanical reasons, so beyond handling them with care you can't really do all that much. Keeping them always-on may be a viable strategy for drives that are already mostly on, otherwise, just spin them up once in a while, but don't expect this to lower risk significantly. An old drive should simply have its contents transferred to new media, and then be treated as something ephemeral that may fail at any time without warning.
I have found there is really no practical way to predict the bathtub curve for hard drive failures.
The solution is just a lot of redundancy for larger disk arrays whenever practical. I currently have a 15x1TB 7200 RPM zpool in raidz2 I use for "scratch space" for some automation projects. It writes about 500GB-1TB or so a day and has for... over 18 years. I have had exactly one drive fail from that pool, under heavy abuse. That one failed a year or two in. Prior to my personal use it was beat on (mostly reads) as backing storage for uploaded images for a large website where the drives operated at 90% or higher I/O utilization pretty much 24x7.
I have other pools of disks where I have replaced over 50% of them 6 years in, with batches of failures seemingly at random. You start to notice patterns with various drive models - but not until well after the point of purchase where it's far too late to predict based on anything like vendor reputation or whatnot. I've had batches of various WD, Seagate, Toshiba, and HGST all both be incredibly reliable and some incredibly not so. Some of the same model series just different drive sizes have wildly different reliability characteristics.
I don't bother pulling "old" drives out of production preemptively any more. The only thing I do preemptively now is pull drives with very critical SMART prefailure warnings such as a consistently growing number of unrecoverable sector errors. That one and a couple other attributes are worth watching trends for, but the rest are pretty pointless and really do not seem to correlate much. And again, it varies by drive model for which times to pay attention to a particular SMART attribute and which not to.
I simply treat drives as wear items that fail with little to no notice, and just make sure I can survive a number of simultaneous failures at once. Make sure to regularly test your monitoring!
Not power cycling drives is huge as well, as you note. For example these old 1TB spinners:
I’ll add: if you’re using your drive as a NAS, use a drive model that’s meant for this purpose, and avoid SSDs