Effects Of A Hard Drive Failure; Compromised Fault Tolerance; Recovering From Compromised Fault Tolerance - HP 381513-B21 - Smart Array P800 Controller RAID User Manual

Smart array controllers for hp integrity servers
Hide thumbs Also See for 381513-B21 - Smart Array P800 Controller RAID:
Table of Contents

Advertisement

CAUTION:
be operational after the system is power-cycled or (for a hot-pluggable drive) after the drive
has been removed and reinserted. However, continued use of such marginal drives may
eventually result in data loss. Replace the marginal drive as soon as possible.

Effects of a hard drive failure

When a hard drive fails, all logical drives that are in the same array are affected. Each logical drive in
an array might be using a different fault-tolerance method, so each logical drive can be affected
differently.
RAID 0 configurations cannot tolerate drive failure. If any physical drive in the array fails, all non-
fault-tolerant (RAID 0) logical drives in the same array will also fail.
RAID 1+0 configurations can tolerate multiple drive failures as long as no failed drives are mirrored
to one another.
RAID 5 configurations can tolerate one drive failure.
RAID 6 (ADG) configurations can tolerate the simultaneous failure of two drives.

Compromised fault tolerance

If more hard drives fail than the fault-tolerance method allows, fault tolerance is compromised, and the
logical drive fails. In this case, all requests from the operating system are rejected with unrecoverable
errors. You are likely to lose data, although it can sometimes be recovered (refer to
compromised fault
One example of a situation in which compromised fault tolerance may occur is when a drive in an array
fails while another drive in the array is being rebuilt. If the array has no online spare, any logical drives
in this array that are configured with RAID 5 fault tolerance will fail.
Compromised fault tolerance can also be caused by non-drive problems, such as a faulty cable or
temporary power loss to a storage system. In such cases, you do not need to replace the physical drives.
However, you may still have lost data, especially if the system was busy at the time that the problem
occurred.

Recovering from compromised fault tolerance

If fault tolerance is compromised, inserting replacement drives does not improve the condition of the
logical volume. Instead, if the screen displays unrecoverable error messages, perform the following
procedure to recover data:
Power down the entire system, and then power it back up. In some cases, a marginal drive will work
1.
again for long enough to enable you to make copies of important files.
If a 1779 POST message is displayed, press the F2 key to re-enable the logical volumes. Remember
that data loss has probably occurred and any data on the logical volume is suspect.
Make copies of important data, if possible.
2.
Replace any failed drives.
3.
After you have replaced the failed drives, fault tolerance may again be compromised. If so, cycle the
4.
power again. If the 1779 POST message is displayed:
Press the F2 key to re-enable the logical drives.
a.
Sometimes, a drive that has previously been failed by the controller may seem to
tolerance" on page 34).
"Recovering from
Replacing, moving, or adding hard drives 34

Hide quick links:

Advertisement

Table of Contents
loading

This manual is also suitable for:

Rx2620-2 - integrity - 0 mb ramBl860c381513-b21 p800

Table of Contents