Showing posts from January, 2012


Looks like our S.M.A.R.T. tool at might be ready for slow-ish initial release.

Simultaneous disk failure

Modern hard drives have annual failure rate of about 5%. With this sort of the failure probability you cannot realistically talk about the simultaneous hard drive failures, even if the window of simultaneousness spans several hours. The simultaneous failure of two drives is just not probable enough. The catch is, the statistics only applies to independent drives. The common-mode failure, when several drives fail because of a single cause, is not accounted for.

As a consequence of this, in fault-tolerant RAID applications more effort should be put to eliminate possible single points of failure. Realistically, this means the most effective thing you can do is to eliminate the operator, eh.

Why don't we use RAID 8?

All RAID types are built using three elements:
Striping Data blocks on several disks.Writing redundant data (it is usually the result of calculating particular functions).Writing multiple copies of data (usually two copies are written).Different combinations of these elements allow to get data placement patters (RAID levels) which provide desired balance between speed, reliability and price.

If we only use striping, we get a RAID 0. Using only one function to calculate redundant data we will get us a RAID 5. If we add one more set of redundant data to a RAID 5, it becomes a RAID 6. We get a RAID 1 from identical copies, but if you combine the striping technique with exact copies then you get a RAID 10. These data placement patterns are the most widespread and well-known, forming a so-called raid triangle.

Except these RAID types, we can also see exotic combinations.

For example, if in RAID 10 we use the number of disks which is not an integral multiple of the number of data copies we wil…