- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- Rehat AS3 Update 6 Cluster suite
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-20-2006 02:50 AM
тАО01-20-2006 02:50 AM
Rehat AS3 Update 6 Cluster suite
Jan 20 15:49:56 ralph clusvcmgrd[4311]:
Jan 20 15:49:57 ralph clumembd[4144]:
Jan 20 15:49:58 ralph clumembd[4144]:
Jan 20 15:49:59 ralph cluquorumd[4119]:
Jan 20 15:49:59 ralph cluquorumd[4119]:
huey-c has been fenced
Jan 20 15:49:59 ralph cluquorumd[4119]:
mpromised!
Jan 20 15:50:00 ralph clusvcmgrd[4311]:
001
Jan 20 15:50:00 ralph clusvcmgrd[4311]:
Jan 20 15:50:08 ralph clumembd[4144]:
Jan 20 15:50:12 ralph clumembd[4144]:
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-20-2006 03:04 AM
тАО01-20-2006 03:04 AM
Re: Rehat AS3 Update 6 Cluster suite
I do not have any power switches and the external disks are on an MSA100 via a fibre chanel.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-21-2006 08:29 AM
тАО01-21-2006 08:29 AM
Re: Rehat AS3 Update 6 Cluster suite
I don't think you've fully configured the cluster.
STONITH: Falsely claiming that
huey-c has been fenced
Shoot
The
Other
Node
In
The
Head
Its trying to shut down the other node becasue it thinks its down or there is a risk of data corruption.
Checklist:
MSA1000 firmware up to date
sansurfer package on both servers to check the state of shared storage
shared storage is configured so the sd# devices are the same on both nodes.
Firmware on the qlogic cards is the same on all cards, all servers and reasonably up to date.
Cluster configuration files.
SEP
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-21-2006 11:03 PM
тАО01-21-2006 11:03 PM
Re: Rehat AS3 Update 6 Cluster suite
(Chapter 3)
Rgds,
Vitaly
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-21-2006 11:27 PM
тАО01-21-2006 11:27 PM
Re: Rehat AS3 Update 6 Cluster suite
Thanks for the advise but we found the problem. The STONITH errors were a red herring. This cluster has no Power Switches so its not possible to STONITH a node that the cluster perceives has changed to a "down" state.
The cause of "huey" dropping in and out of the cluster every few seconds turned out to be a clash between two Redhat clusters using the same 255.0.0.11 multicast address elsewhere on the same network. We changed the multicast address to be unique, reloaded the config, restarted the cluster and the problem has gone away. The cluster is stable now.
Would have been nice for Redhat to have reported this somewhere. We only discovered what was going on after pinging the multicast address and seeing more DUP responses than we were expecting and from IP addresses belonging to the other Redhat cluster.