- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- SG - Node reboot upon network cables disconnection...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-01-2005 12:49 AM
07-01-2005 12:49 AM
I would like to have your feedback regarding this issue:
2 Node SG, HP-UX 11i v1 cluster using 2 RP3440 and an EVA storage.
1 NIC for HB, two remaining NICS used for data (1 active, 1 Stand by)
Upon removing all 3 network connections from the stand by node, the active node reboots by itself, furthermore the cluster will not start but manually.
Your response is highly appreciated
Regards,
Farid
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-01-2005 01:10 AM
07-01-2005 01:10 AM
Re: SG - Node reboot upon network cables disconnections
Refer to this document. It details a lot of senarios dealing with this;
UXSGLVKBAN00000010
(http://www2.itrc.hp.com/service/cki/docDisplay.do?docLocale=en_US&docId=200000062686681)
David
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-01-2005 01:15 AM
07-01-2005 01:15 AM
Re: SG - Node reboot upon network cables disconnections
can you please verify
Thank you for your fast response
Farid
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-01-2005 01:27 AM
07-01-2005 01:27 AM
Re: SG - Node reboot upon network cables disconnections
I wonder if there are newer updates on this issue
Thank you David
Farid
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-01-2005 01:29 AM
07-01-2005 01:29 AM
SolutionThis is normal behavior. MC/SG will TOC a node if it cannot communicate with a majority of other cluster member. It does this to protect data integrity.
Try this link at the HP Docs site. It is a section in the Managing MC/ServiceGuard manual, which explains what you are seeing:
http://docs.hp.com/en/B3936-90073/ch03s07.html#d0e4517
JP
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-01-2005 02:11 AM
07-01-2005 02:11 AM
Re: SG - Node reboot upon network cables disconnections
Farid
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-27-2005 11:09 PM
07-27-2005 11:09 PM
Re: SG - Node reboot upon network cables disconnections
When we disconnect all network connections from node1, the TOC occurs on node2
which makes the system unavailable.
One would expect the TOC to happen on node1
leaving node2 operational.
Your opinions are highly appreciated.
Regards,
Farid
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-27-2005 11:30 PM
07-27-2005 11:30 PM
Re: SG - Node reboot upon network cables disconnections
Here is how it works.
When there is a network disconnect between the nodes, the nodes attempt to "reform" the cluster.
Since neither node can communicate with each other they cannot form a cluster of a majority of the nodes (i.e. they cannot form a cluster of 2 of the 2 nodes).
Serviceguard, therefore, uses a tie-breaking system to ensure that both nodes don't try to access the data (and hence corrupt it).
The tie-breaker is known as the lock disk (some systems use a quorem server).
Once the nodes realize that they cannot talk to each other, they "race" to the lock disk to try to get it.
In your case, the node with the "failed" network gets to the lock disk first.
When node 2 gets to the lock disk, it sees that it's already owned by node 1.
To ensure that there is no data corruption, node 2 kills itself with the TOC.
This is standard Serviceguard behavior.
Best regards,
Kent M. Ostby
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-28-2005 12:12 AM
07-28-2005 12:12 AM
Re: SG - Node reboot upon network cables disconnections
So this is by design, can you think of a workaround ? or you believe that loosing all network connections at the same time is not a real life scenario.. would appreciate further comments.
On the other hand if you disconnect all FC connections to the SAN storage system of Node1, Node2 still works but the package does not failover to node2
Do you have an interpretation for this ?
Best regards,
Farid
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-28-2005 12:25 AM
07-28-2005 12:25 AM
Re: SG - Node reboot upon network cables disconnections
Yes, you coul duse the serial heartbeat for a 2 node cluster, but there are issues with this.
>or you believe that loosing all network connections at the same time is not a real life scenario..
Well this is essentially a Multiple Points of Failure, which Serviceguard is generally not designed to cater for.
The suggested option here would be to have all lans set to be HEARTBEAT_IP.
>On the other hand if you disconnect all FC connections to the SAN storage system of Node1, Node2 still works but the package does not failover to node2. Do you have an interpretation for this ?
Yes, the network manager does not monitor FC disk interfaces. To monitor this you need to look at using EMS monitors.
But again, this is an MPOF....
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-28-2005 06:52 PM
07-28-2005 06:52 PM
Re: SG - Node reboot upon network cables disconnections
Do you have some official MC/SG test failure scenarios - this would be great then.
We will follow your suggestions to use EMS and all Network as Heartbeat_IP
Again thank you very much and best regards,
Farid
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-28-2005 07:36 PM
07-28-2005 07:36 PM
Re: SG - Node reboot upon network cables disconnections
There are some suggested tests, and also a short discussion regarding using EMS monitoring.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-03-2005 05:13 AM
08-03-2005 05:13 AM
Re: SG - Node reboot upon network cables disconnections
"or you believe that loosing all network connections at the same time is not a real life scenario.. would appreciate further comments."
I would recommend looking at Quorum Server to decide which node remain up. With QS you will be sure that the node with IP connectivity will be up.