- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- after disk replacement, node won't join cluster
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 03:31 AM
09-29-2005 03:31 AM
B3936AA_APZ A.10.06 MC / Service Guard
B5125AA_APZ A.10.05 MC/ServiceGuard NFS Toolkit
PHSS_10340 B.10.00.00.AA MC/ServiceGuard NFS Toolkit cumulative patch
PHSS_20577 B.10.00.00.AA MC/ServiceGuard and MC/LockManager A.10.06 patch
When I run cmrunnode on the system that just had to have a disk replaced (/usr), I get:
{porky:root}# cmrunnode
Unable to receive message from configuration daemon on porky: Software caused connection abort
There apparently was some disk corruption. I have already ftp'd over the cmcld executable from one of the working nodes:
{froggy:root}# file cmcld
cmcld: s800 shared executable dynamically linked -not stripped
{porky:root}# file cmcld
cmcld: commands text
I'm filling in for someone and my service guard knowledge is quite rusty. i see on the two other nodes the following are running:
froggy:
{froggy:root}# ps -ef |grep cm
root 11444 1 0 Sep 23 ? 56:33 /usr/lbin/cmcld -j
root 11469 11444 0 Sep 23 ? 0:00 /usr/lbin/cmlvmd
I don't seem to have a man page on cmcld so I don't know what the j option (join?) does. I see that cmcld is run with different options on the two running nodes.
can someone point me in the right direction?
Thanks
butch:
butch:root}# ps -ef |grep cm
root 2842 2836 0 Sep 20 ? 0:00 /usr/lbin/cmlvmd
root 2922 2836 0 Sep 20 ? root 2836 1 0 Sep 20 ? 109:31 /usr/lbin/cmcld -m -n froggy -n butch
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 03:41 AM
09-29-2005 03:41 AM
Re: after disk replacement, node won't join cluster
What ans all files replated to service guard you have ftped from other system. Is these include binaries you need to check and apply the configuration. Were there some problem in restoring /usr or it was done without problems?
HTH,
Devender
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 03:45 AM
09-29-2005 03:45 AM
Re: after disk replacement, node won't join cluster
what does it mean:
"There apparently was some disk corruption"?
Please tell us more about the failed disk and the way you replaced it. Are the disk mirrored? Your system has to be stable to run production. If there is any corruption, restore the backup.
Hope this helps!
Regards
Torsten.
__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.
__________________________________________________
No support by private messages. Please ask the forum!
If you feel this was helpful please click the KUDOS! thumb below!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 03:46 AM
09-29-2005 03:46 AM
SolutionWhat is bad news is that the cmcld binary file on the suspect system has been corrupted. What else may have been corrupted??
You may be better off recovering a known good backup to this system.
As an aside, you are running a totally obsolete unsupported version of Serviceguard, on an unsupported OS version.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 03:59 AM
09-29-2005 03:59 AM
Re: after disk replacement, node won't join cluster
That is the only file I have recovered so far (just ftp'd from a known good, working node).
Any thoughts on what to recover?
/usr/lbin/cm*?
/etc/cmcluster
??
The disk was mirrored but there were apparently some stale extents even prior to the one disk failing. As you can surmise, this is an old cluster and we are moving away from it soon. we can live without this node up, but as long as we are using it i wanted to have all three nodes up and running if i can.
thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 04:05 AM
09-29-2005 04:05 AM
Re: after disk replacement, node won't join cluster
Recovery here meant recovering complete OS using ignite backups assuming that like this file there may be some other files in /usr which would have got corrupted.
HTH,
Devender
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 04:24 AM
09-29-2005 04:24 AM
Re: after disk replacement, node won't join cluster
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 05:07 AM
09-29-2005 05:07 AM
Re: after disk replacement, node won't join cluster
If its vg00, you probably need to replace the disk and restore your make_net_recovery or make_sys_recovery from Ignite.
Otherwise, you probably have to use vgreduce -f to force the reduction of the volume group in question, and then rebuild it after following the mandatory steps that the vgreduce -f command displays after runtime.
SEP
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 05:09 AM
09-29-2005 05:09 AM
Re: after disk replacement, node won't join cluster
Thanks again!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-29-2005 05:10 AM
09-29-2005 05:10 AM