- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Alternate Path failover - doesn't always work
Categories
Company
Local Language
Forums
Discussions
- Integrity Servers
- Server Clustering
- HPE NonStop Compute
- HPE Apollo Systems
- High Performance Computing
Knowledge Base
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Knowledge Base
Forums
Discussions
- Cloud Mentoring and Education
- Software - General
- HPE OneView
- HPE Ezmeral Software platform
- HPE OpsRamp Software
Knowledge Base
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-28-2003 09:37 AM
04-28-2003 09:37 AM
Alternate Path failover - doesn't always work
Using alternate path failover on a SAN doesn't provide protection for all potential failures. For example, if I pull the fibre channel cable on the storage side of a switch, PV Links does not failover to the alternate path. If I pull the cable on the server side of the switch, it does failover.
Does anyone know of a patch that may address this, or a whitepaper documenting this? Third-party products like PowerPath do not have this issue, and it seems like a correctable problem.
Thanks
Scott Riley
Stack Computer, Inc.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-28-2003 10:27 AM
04-28-2003 10:27 AM
Re: Alternate Path failover - doesn't always work
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-28-2003 10:35 AM
04-28-2003 10:35 AM
Re: Alternate Path failover - doesn't always work
Appears to me this is a fundamental flaw in your SAN design.
Not only do you want the HBAs going to separate switch ports, but those switch ports should have *separate* paths from the switch to the array.
It sounds like you still have a SPOF (the switch -> array path in this case) that needs to be eliminated. Doesn't have to be dedicated (i.e. you can share those paths with other hosts) but MUST be a separate path from the other HBA in this system.
Rgds,
Jeff
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-28-2003 02:24 PM
04-28-2003 02:24 PM
Re: Alternate Path failover - doesn't always work
The TD driver does perform a login to the storage device, so there is knowledge at least of what devices it has access to on the SAN. But I can see a difference in what the driver may see between losing the HBA->switch link versus losing the Switch->storage link. I think the difference there is with a failure on the HBA side of the switch, the driver can sense this immediately. With a failure on the storage side of the switch, the HBA must not be sensing a failure in the link (it still has link -- to the switch), and therefore PV Links must rely on an I/O timeout rather than a hard error.
Jeff,
There's no SPOF in the design. Dual independent fabrics, dual HBA's, dual storage controllers.
I'm running the test with an open telnet session to each of the Brocade switches. Using a portperfshow, I can see dynamically the flow of data to the storage devices. Pull the cable on the host side, and the alternate path takes over fairly quickly, about 30 seconds or so. Pull the cable on the storage side (of one of the *two* paths to the LUNS), and it just stops. It seems that PV Links is waiting to time out when there is no hard error to act upon.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-29-2003 03:53 AM
04-29-2003 03:53 AM
Re: Alternate Path failover - doesn't always work
I thought about this a lot yesterday and the only reason for failure that I could come up with had to do with marketing and sales, (* buy another utility *) or legacy. (* We've always done it this way. *) For example, a point to point topology doesn't need a switch and doesn't fit into this problem and Point to point existed before fabric.
I'd be interested to know if 'AutoPath' also did this.
Good luck with this worthy endeavor!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-29-2003 03:54 AM
04-29-2003 03:54 AM
Re: Alternate Path failover - doesn't always work
After two months of testing, crash dumps, vendor visits, hp found a bug in lvmkmd that has existed at least since 11.0.
They have plans to fix it in 11.23 but no plans prior to this.