- Community Home
- >
- Storage
- >
- Entry Storage Systems
- >
- Disk Enclosures
- >
- Simultaneous drive failure
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-01-2008 12:01 AM
тАО03-01-2008 12:01 AM
Curly one for you. I have an ML110 G3 with two drives in a mirror.
The drives are 75GB SATA drives.
A drive in the array failed on Wednesday. I figured no worries, we'll get a new drive there Monday.
Well HP's "next business day Carepaq" allowed us to get a new drive on the following Monday. The HP tech installed the new drive and left.
Only the array never rebuilt. It didn't rebuild because, immediately after the HP tech left, the "good" disk encountered its first read error, and RAID mirrors don't build from disks with a single read error.
I confirmed the issue was not the card/cabling/motherboard by inserting the failed disk into a workstation, direct attach to SATA port, and testing with manufacturer tools, and both the disks were definitely dead.
So long story short, I have to get TWO new drives (another few days for a paid Carepaq) rebuild the array and restore the lot from backup.
At this point, I'm naturally questioning:
- Has anyone ever had such bad luck (assuming it was luck)
- Could anything occur, such as a fault on the board perhaps, which caused both attached drives to actually fail? We've been running on the two new drives for almost a month now with no issue.
- The Hp Storage Management "periodic rescan" was, according to the logs, running regularly and never finding issue. Is there anything further you can do to prevent these issues?
Please don't bring up firmware or drivers, HP offer very little in regards to updates for these low end servers, and this was definitely updated. The latest HP Storage Manager was uninstalled then reinstalled twice before HP would consider sending the parts.
The real issue is, I'm in a position where I'm going to be running larger arrays and SAN boxes, and I'm curious what plans people have, if any, to combat such scenarios.
I have noted servers such as the HP DL380 offer "RAID6" with two parity drives, but I actually can't see that option on the larger HP SAN and NAS devices.
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-01-2008 12:21 AM
тАО03-01-2008 12:21 AM
SolutionHaving two drives fail simultaneously or within a short space of time is quite common, especially if both of those drives were bought at the same time.
If you can, check the serial numbers of the failed drives. I would guess they are close together, and therefore probably part of the same manufacturing batch, which would increase the chances of them failing at around the same time.
You can try and get around it by making sure the drives in your mirrorsets or raidsets are from different batches and/or different manufacturers.
Cheers,
Rob
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-01-2008 12:31 AM
тАО03-01-2008 12:31 AM
Re: Simultaneous drive failure
Makes a lot of sense. I'm not sure how I can source drives from different manufacturers if you're ultimately only quoting an HP part number and asking an HP supplier for that drive.
But this experience does concern me somewhat as I look to running much larger arrays.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-01-2008 12:53 AM
тАО03-01-2008 12:53 AM
Re: Simultaneous drive failure
Occasionally manufacturers do have glitches in their processes. I remember a few years ago there was a large batch of Fujitsu 146GB 15K rpm disks that were all faulty. We managed to get quite a few of these installed in our HSG80. Luckily because we were using three member mirrorsets, we never lost any data.
Cheers,
Rob
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-01-2008 10:55 PM
тАО03-01-2008 10:55 PM
Re: Simultaneous drive failure
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-02-2008 11:03 AM
тАО03-02-2008 11:03 AM
Re: Simultaneous drive failure
AFAIK is Cross Vraid an option which you allows to change the Vraid level from a original Vdisk to a clone or snapshot, for example original Vraid is Vraid 1, Snapshot is Vraid 5. Furthermore uses the EVA 5+1 for Vraid 5. Correct me if I'm wrong. :)
But back on topic:
Fortunately I didn't faced a non-recoverable error while rebuilding a RAID. Maybe Joshua has only bad luck. :-|
Best regards,
Patrick
Patrick
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-02-2008 11:45 PM
тАО03-02-2008 11:45 PM
Re: Simultaneous drive failure
I'll be looking into raid ADG on MSA boxes, though it's odd that the bigger EVAs don't offer it.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-03-2008 07:04 AM
тАО03-03-2008 07:04 AM
Re: Simultaneous drive failure
the MTBF of a SATA / FATA drive is much lower then the MTBF of a SCSI / FC drive. I don't think that the missing RAID ADG / 5DP support is a big disadvantage of the EVA. EVA is using RSS (Redundant Storage Sets) for protection. Each RSS is a single RAID protection domain. Vraid 5 Data is stored in a 5+1. Try searching the forum for "RSS", so you will find more informations about it. :)
Best regards,
Patrick
Patrick