- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- loss of network with MCSG
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2005 01:51 AM
03-25-2005 01:51 AM
loss of network with MCSG
I made the following test on one of our 2 MCSG nodes :
I've disconnected all network links (even HeartBeat) from node A. I was surprised to see that the lock was acquired by node A and that node B (the only node working well) performed a TOC.
Could you please confirm that this is a regular behaviour ?
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2005 03:23 AM
03-25-2005 03:23 AM
Re: loss of network with MCSG
If it was on node A the TOC of node B would make sense.
If it wasn't on node A then the lock race was obviously won by node A resulting in the TOC of node B.
You have to remember that the nodes are not checking themselves but are checking for the existence of their fellow node members. So when node B could no longer "see" node A it decided to TOC itself so that there would be no "split-brain" possibility.
If you had only pulled the public network and not the heartbeat the pkg would have failed over to node B. But by pulling the heartbeat as well you forced a lock contention which node A won.
Rgds,
Jeff
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2005 03:59 AM
03-25-2005 03:59 AM
Re: loss of network with MCSG
Serviceguard only determines that it cannot communicate with the other server.
When both nodes experience the same breakdown in heartbeat connection, they both seek the cluster arbitration device, and whichever one gets to it first is authorized to reform a 1-node cluster... and the last arriving node if forced to TOC(dump/reboot).
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2005 05:13 AM
03-25-2005 05:13 AM
Re: loss of network with MCSG
Keep in mind that all that is going on is a race to the lock disk to decide who will stay up and who will stay up.
You may want to check out this document which discusses the differences between using a lock disk and using a quorem server:
http://www2.itrc.hp.com/service/cki/docDisplay.do?docLocale=en_US&docId=UMCSGKBRC00012642
ITRC DOC ID: UMCSGKBRC00012642
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-25-2005 09:05 PM
03-25-2005 09:05 PM
Re: loss of network with MCSG
Remember, you induced a MPOF which Serviceguard is not generally designed to cater for.
There is one solution and that is to use a serial heartbeat, but this can be troublesome in it's own right.
Also, having a Quorum Server would help as node A would not have been able to get to the lock.