- Community Home
- >
- Storage
- >
- Entry Storage Systems
- >
- Disk Enclosures
- >
- Questionable Best Practice MSA2012 disk layout (re...
Disk Enclosures
1752665
Members
5453
Online
108788
Solutions
Forums
Categories
Company
Local Language
back
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Discussions
back
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Blogs
Information
Community
Resources
Community Language
Language
Forums
Blogs
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-02-2010 06:10 AM
11-02-2010 06:10 AM
Questionable Best Practice MSA2012 disk layout (resilience-wise)
Having read as much as I could find on best-practice for configuring an MSA2000 with additional shelves to support a reasonably busy Exchange 2003 cluster late last year we've had a rather busy week in DR mode due to multiple PSU failures (which I agree is probably fairly uncommon) but it raises a serious question in my eyes.
Basically, the disks were configured as described in the attached JPG, striping "vertically" across enclosures rather than horizontally. Database VDisks 1 & two each had two volumes created/presented to hosts, as did TL Vdisks 1 & 2. However, as the disks were there for performance (spindle-counts) rather than capacity, these Vdisks were under 50% utilised from a capacity perspective.
We had a rather unique situation whereby we had a double PSU failure (one each in enclosures .2 & .3) for which we logged a hardware call. By the time the replacement PSUs turned up, a third had died (the redundant one in enclosure .3) resulting in enclosure.3 taking its disks offline.
On resumption of power, all of the disks in enclosure .3 (bottom in the diag) were in "LeftOver" state and all VDisks were "Critical" but still online.
From an O/S perspective (2003) all volumes appeared to be online with the exception of one hosted on DB VDisk 2.
To be fair, for these VDisks to still be online with so many physical disks offline is actually pretty impressive and can only be due to the low capacity utilisation I would imagine (i.e. plenty of spare capacity to stripe all the data).
However, the fact that all the LEFTOVER disks had to have their metadata wiped before they could be added back into the VDisks seems to be a massive issue in my opinion! Effectively, following Best Practice disk layout has left us with a "split-brain" type VDisk which would have most likely failed completely had we utilised more of it.
This makes me wonder if we're not better off (from a resilience perspective) striping *across* enclosures rather than *down* enclosures.
What experience have others had?
I have an earlier post (with no replies) describing another MSA we have which is configured with a VDisk per enclosure which I was asking about reconfiguring to BP but now I have serious doubts and wondered what others thought. The problems we had kicked in the day after I posted that message!
We've since had a 4th PSU fail in the same MSA so I've instigated a question about reliability/bad batches etc. to HP as there's no evidence of any underlying power issues in our data centre.
Any opinions appreciated...
Paul
Basically, the disks were configured as described in the attached JPG, striping "vertically" across enclosures rather than horizontally. Database VDisks 1 & two each had two volumes created/presented to hosts, as did TL Vdisks 1 & 2. However, as the disks were there for performance (spindle-counts) rather than capacity, these Vdisks were under 50% utilised from a capacity perspective.
We had a rather unique situation whereby we had a double PSU failure (one each in enclosures .2 & .3) for which we logged a hardware call. By the time the replacement PSUs turned up, a third had died (the redundant one in enclosure .3) resulting in enclosure.3 taking its disks offline.
On resumption of power, all of the disks in enclosure .3 (bottom in the diag) were in "LeftOver" state and all VDisks were "Critical" but still online.
From an O/S perspective (2003) all volumes appeared to be online with the exception of one hosted on DB VDisk 2.
To be fair, for these VDisks to still be online with so many physical disks offline is actually pretty impressive and can only be due to the low capacity utilisation I would imagine (i.e. plenty of spare capacity to stripe all the data).
However, the fact that all the LEFTOVER disks had to have their metadata wiped before they could be added back into the VDisks seems to be a massive issue in my opinion! Effectively, following Best Practice disk layout has left us with a "split-brain" type VDisk which would have most likely failed completely had we utilised more of it.
This makes me wonder if we're not better off (from a resilience perspective) striping *across* enclosures rather than *down* enclosures.
What experience have others had?
I have an earlier post (with no replies) describing another MSA we have which is configured with a VDisk per enclosure which I was asking about reconfiguring to BP but now I have serious doubts and wondered what others thought. The problems we had kicked in the day after I posted that message!
We've since had a 4th PSU fail in the same MSA so I've instigated a question about reliability/bad batches etc. to HP as there's no evidence of any underlying power issues in our data centre.
Any opinions appreciated...
Paul
The opinions expressed above are the personal opinions of the authors, not of Hewlett Packard Enterprise. By using this site, you accept the Terms of Use and Rules of Participation.
News and Events
Support
© Copyright 2024 Hewlett Packard Enterprise Development LP