Comment by bragr
3 years ago
Yeah just coming here to say this. Multiple disk failures are pretty probable. I've had batches of both disks and SSDs with sequential serial numbers, subjected to the same workloads, all fail within the same ~24 hour periods.
Had the same experience with (identical) SSDs, two failures within 10 minutes in a RAID 5 configuration.
(Thankfully, they didn't completely die but just put themselves into read-only)
Seems like it was only a few days ago that there was a comment from a former Dropbox engineer here pointing out that a lot of disk drives they bought when they stood up their own datacenter had been found to all have a common flaw involving tiny metal slivers.