- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Will both node halt if heartbeat fails
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 10:16 PM
10-11-2004 10:16 PM
In the event of a LAN interface failure, a local switch is done to a standby LAN interface if one exists. If a heartbeat LAN interface fails and no standby is configured, the node fails with a TOC. If a data LAN interface fails without a standby, the node fails with a TOC only if Package Failfast (described further in the "Planning" chapter under "Package Configuration Planning") is enabled for the package.
This weekens we're doing some upgrade to the electrical system in one of the server room and therefor node 1 will be turned off completely and the discs.
Question is, will node 2 go into a TOC and halt (shutdown) because it looses allcontact with node 1 and the discs?
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 10:19 PM
10-11-2004 10:19 PM
SolutionIf 'all' heartbeats fail, then the node that cannot acquire the lock disk (or quorum) will TOC itself.
However, if you manually bring down the node using 'cmhaltnode', then you do not need to worry about other node crashing as the heartbeats to that node will no more be checked until it is put into the cluster again.
-Sri
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 10:26 PM
10-11-2004 10:26 PM
Re: Will both node halt if heartbeat fails
Also, just to ponder a bit around this questions. Why does the whole node shut down, isn't it enough to just stop the cluster service? Is there a way to configure it so only the cluster stops in this case and not the whole server?
Are there any drawbacks to not shuting down the server?
I assume TOC here actually means shutting donw the server completely and not just the cluster deamon?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 10:33 PM
10-11-2004 10:33 PM
Re: Will both node halt if heartbeat fails
//Just to be perfectly clear here, if i were to shut down and power down node 1 in a 2 node cluster, node 2 will also shut down since it will loose all contact with node 1 because of the heartbeats beeing lost?//
If you shutdown node1 after halting the cluster daemon (cmhaltnode), then node2 will not go down as the cluster will be reformed with only node2 as the member. This is not a failure. This is one of the common practices used by SAs to do maintenances. Halt the node, do the maintenance, move the package to that node, bring down the primary node for maintenance etc., to minimize the downtime.
//lso, just to ponder a bit around this questions. Why does the whole node shut down, isn't it enough to just stop the cluster service? Is there a way to configure it so only the cluster stops in this case and not the whole server? //
Whole node will have to go down as there is a good possibility of the shared volume groups active on the system in case of 'package failures' which may potentially corrupt the data.
// assume TOC here actually means shutting donw the server completely and not just the cluster deamon?//
TOC means 'transfer of control.' The control will be passed immediately to hardware to ensure a quick shutdown of the system to prevent further data corruption.
-Sri
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 11:20 PM
10-11-2004 11:20 PM
Re: Will both node halt if heartbeat fails
as this is NOT a failure condition, there i sno requirement to check for cluster lock or to verify heartbeats to the other node, so this node should remain in operation during the maintenance period of the other node.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 11:23 PM
10-11-2004 11:23 PM
Re: Will both node halt if heartbeat fails
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 11:28 PM
10-11-2004 11:28 PM
Re: Will both node halt if heartbeat fails
Only one node should TOC if you lose heartbeats, or have network issues.
The only time there should be BOTH nodes TOC'ing is when neither node can get the Cluster lock disc.
You ar eplanning to do a planned outage, by shutting down the node in question.
This causes the node to leave the cluster, resulting in a cluster reformation into a single node cluster. This is normal, expected behaviour, not a failure, and once the node has reformed as a single node cluster it will not try to monitor the other node, until that node is rebooted and it rejoins the running cluster.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 11:34 PM
10-11-2004 11:34 PM
Re: Will both node halt if heartbeat fails
Reason i keep asking is because we started a what if discussion here and i became a bit confused and needed some clarifications.
Someone claimed that that one of the node had a serious crash some time ago and resulted it node 1 going down hard and there was a loss of the network so the hearthbeats and lans between the nodes went down.
This again resulted in node 2 shuting down which i cant quite udnerstand. I can see how it has to halt the cluster service for the reasons mentioned earlier here.
Reason is that node 2 is also out omniback server (not in the cluster) so we really dont what the whole server shuting donw in cases like this.
So basically i'm just wondering if such a scenario could have happened? And if so, is there a safe way to avoid the server from shuting down an rather just stop the cluster deamon. I would think stopping the cluster deamon would achive the same result as shutding down the node to avoid data loss.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 11:43 PM
10-11-2004 11:43 PM
Re: Will both node halt if heartbeat fails
Checking all the OLDsyslogs and package logs at the time of the incident may ahve helped.
For you rplanned outage, simply shutdown the one node, the other node should remain alive.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-11-2004 11:46 PM
10-11-2004 11:46 PM
Re: Will both node halt if heartbeat fails
So should i stop the cluster service on node 2 too?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-12-2004 12:00 AM
10-12-2004 12:00 AM
Re: Will both node halt if heartbeat fails
But this does indicate that in your previous failure, you may have lost the ability to contact the cluster lock disc in a failre scenario, and this WOULD result in the 2nd TOC'ing
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-12-2004 12:02 AM
10-12-2004 12:02 AM