- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- Re: RH Cluster Suite - Fence manual does not work
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-02-2006 02:48 AM
03-02-2006 02:48 AM
RH Cluster Suite - Fence manual does not work
I configured the fencing method as manual. The problem is that when I power off one node to test it, the fencing does not work, on the log I get:
fenced: fencing node "nodename"
fenced: fence "nodename" failed
This will show forerer and the cluster will hang, until the other node join the cluster again.
If I run fence_manual I get:
sucess: fence_manual "nodename"
In the log I get:
Waiting for "nodename" to rejoing the cluster or for manual acknowledment that it has been reset (i.e. fence_ack_manual -n "nodename")
If I run fence_ack_manual -n "nodename" I get:
can't open /tmp/fence_manual.fif: No such file or directory.
If I do strace of fence_manual, I see:
mknod("/tmp/fence_manual.fifo", S_IFIFO|0600) = 0
write (1, "sucess: ....) = 41
unlink("/tmp/fence_manual.fifo") = 0
Why I get the unlink? Is this removing the file before I run fence_ack_manual?
I'm currently just testing, I know that I should use another fencing method, but it would be nice if I could just have this working to do all other testings.
If I stop the fenced daemon (even if I souldn't), the services will relocate, because it won't try to fence the node (and then hang the cluster), but GFS won't work.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-02-2006 09:51 AM
03-02-2006 09:51 AM
Re: RH Cluster Suite - Fence manual does not work
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-02-2006 10:49 PM
03-02-2006 10:49 PM
Re: RH Cluster Suite - Fence manual does not work
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2006 01:50 AM
03-20-2006 01:50 AM
Re: RH Cluster Suite - Fence manual does not work
I have the same problem here, but wasn't yet able to keep the whole cluster from blocking if one of the nodes crashes.
While powering off one of the nodes over ILO the whole gfs blocks any access. Trying to fence that host afterwards with fence_manual or fence_ack_manual I run in the same error messages you mentioned before.
You closed this case but hopefully you might have an idea, how I can solve this...
I attached the configuration file to this reply. Please take a short look at it.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2006 01:51 AM
03-20-2006 01:51 AM
Re: RH Cluster Suite - Fence manual does not work
I have the same problem here, but wasn't yet able to keep the whole cluster from blocking if one of the nodes crashes.
While powering off one of the nodes over ILO the whole gfs blocks any access. Trying to fence that host afterwards with fence_manual or fence_ack_manual I run in the same error messages you mentioned before.
You closed this case but hopefully you might have an idea, how I can solve this...
I attached the configuration file to this post. Please take a short look at it.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2006 02:06 AM
03-20-2006 02:06 AM
Re: RH Cluster Suite - Fence manual does not work
Ensure that your /etc/hosts file is right, and the node name should point to the interconnect ip address.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-22-2006 05:04 AM
03-22-2006 05:04 AM
Re: RH Cluster Suite - Fence manual does not work
This error "can't open /tmp/fence_manual.fifo: No such file or directory." appears if fence_ack_manual is run without having run "fence_ack" before on the same node. The command fence_manual creates this file and waits for fence_ack_manual.
My problem was my cluster.conf:
Wrong:
Right:
Just if someone finds this thread and needs to know how the story finished... :-)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-22-2006 05:29 AM
03-22-2006 05:29 AM
Re: RH Cluster Suite - Fence manual does not work
I wrote a monitor script for RH4 CL that handles this problem with a fence acknowledge command.
Works well.
Good Luck.
SEP
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com