- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- Fencing Issue...RedHat Cluster Suite
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-25-2010 03:15 AM
тАО08-25-2010 03:15 AM
Fencing Issue...RedHat Cluster Suite
Please help me.
Regards
Athar
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-25-2010 03:46 AM
тАО08-25-2010 03:46 AM
Re: Fencing Issue...RedHat Cluster Suite
- ordered (the failover domain has been configured prefer node1 over others)
- failback enabled
In this case, the cluster will try to return the service (and all its resources) to node1 as soon as it joins the cluster again. The cluster is simply doing what it's configured to do. If this automatic failback is not desirable, disable it.
If you use Conga to configure your cluster, check the checkbox labelled "Do not fail back services in this domain" in the failover domain configuration.
If you'd rather edit the XML configuration manually, the attribute is 'nofailback="1"'. It should be added to the failoverdomain tag:
...
...
...
...
...
MK
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-25-2010 04:07 AM
тАО08-25-2010 04:07 AM
Re: Fencing Issue...RedHat Cluster Suite
Autofailback is disabled already. Sir, Cluster resource is failover to the passive node when my the active node is just power on, I mean on "POST". I think there is some issue with my ilo fencing configuration.
When I Power-off the active and remove is from chassis, its ilo also not online, in this situation resource still shwoing on the active node, but when I power on the again the active node, then the resource start failing over to the passive node.
I am attaching my cluster.conf.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-25-2010 10:21 AM
тАО08-25-2010 10:21 AM
Re: Fencing Issue...RedHat Cluster Suite
http://forums.itrc.hp.com/service/forums/questionanswer.do?threadId=1444969
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-26-2010 05:26 AM
тАО08-26-2010 05:26 AM
Re: Fencing Issue...RedHat Cluster Suite
If you physically remove the node1 blade, the cluster will notice the blade will no longer be sending heartbeats, and will attempt to fence it.
The important point is: once fencing is started, cluster operations will continue normally *only after the cluster has received confirmation of successful fencing*.
But when the blade is physically disconnected, the other node(s) won't be able to reach the iLO of the disconnected blade, so the fencing attempt will fail.
Now, node1 is unresponsive, so its status is unknown to the other nodes. Is node1 dead, or is it just on the other end of a bunch of cables destroyed by a server rack tipping over? The cluster has no way of knowing.
Because the attempt to fence node1 failed, the other node can only wait and see: "well, if node1 is alive, it will fence _us_, and then the situation will be resolved. Or perhaps node1 is rebooting and will soon be rejoining the cluster, and all will be well again."
By physically disconnecting the blade, you'll simultaneously cause multiple failures:
- multiple network connection failures
- fencing connection failure
- storage connection failure
This is more failures than RedHat Cluster can deal with.
Do you have a quorum disk? If you don't, and you have only 2 nodes in your cluster, your cluster may become inquorate after you unplug the node1 blade. An inquorate segment of a cluster may not run any services, and it may not make any fencing decisions either.
In a RedHat Cluster, a two-node cluster is a very tricky special case. If the cluster configuration sets the special "two_node" parameter to 1, quorum check is essentially overridden. But the fundamental rule is still the same: if a node vanishes, the remaining node must *successfully* fence the vanished node before failovers or other cluster processing may continue.
MK
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-26-2010 06:38 AM
тАО08-26-2010 06:38 AM
Re: Fencing Issue...RedHat Cluster Suite
I always thought how ServiceGuard did it was the right way: reset yourself if you are alone. Is this perhaps misguided? Or can such behaviour be 'emulated' with the linux clusters on RHEL/SLES?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-26-2010 09:25 AM
тАО08-26-2010 09:25 AM
Re: Fencing Issue...RedHat Cluster Suite
Yes I am using Quorum disk and there is no issue with failover and the resource relocate the other node when I rebooted or shutdown the node. But when I poweroff the node and remove it from the chassis then I am facing the issue regarding the failover.
Please guide me, For e.g If node goes down due the hardware problem probably the motherboard then What is the status of iLO? Is iLO still alive? Fencing will work?
Please help me or share some HP document.
Regards
Athar
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-26-2010 09:28 AM
тАО08-26-2010 09:28 AM
Re: Fencing Issue...RedHat Cluster Suite
Regards
Athar