- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Re: host fails to join cluster after reboot
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 02:29 AM
05-08-2002 02:29 AM
host fails to join cluster after reboot
I have attached syslog note the "permission denied for root user" and the filename it shows (this file lists the node and root in it). This node is called "saturn"
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 02:43 AM
05-08-2002 02:43 AM
Re: host fails to join cluster after reboot
It is ok the .rhosts files in all the nodes?
Regards,
Justo.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 03:00 AM
05-08-2002 03:00 AM
Re: host fails to join cluster after reboot
Check
1)That saturn is in
/etc/cmcluster/cmclnodelist
2)The reason why some home directories are not found.
3)Question
Is there another cluster on this subnet.
Steve Steel
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 03:08 AM
05-08-2002 03:08 AM
Re: host fails to join cluster after reboot
not sure what you mean about the home dirs (if its /home then yeah they seem ok).
This is a 5 node cluster.
any other ideas?
cheers,
mark.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 03:17 AM
05-08-2002 03:17 AM
Re: host fails to join cluster after reboot
Regards,
John
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 03:32 AM
05-08-2002 03:32 AM
Re: host fails to join cluster after reboot
Also, have you run cmscancl to check things out that way?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 04:03 AM
05-08-2002 04:03 AM
Re: host fails to join cluster after reboot
(1) You will need user root in the .rhosts and cmnodelist files for the LOCAL server as well as the remote servers.
(2) Did you attempt to rebuild the cluster? Your syslog file shows several
vgchange -c n
followed by
vgchange -a y
which is potentially disasterous if your volume group can be activated on another node at the same time. i/o diags related? Why do you want to remove cluster-awareness of VGs when activating your package? Normally you keep vgchange -c y (permanent) and activate the VG using vgchange -a e (exclusive access to the node) or vgchange -a S for mc/lockmanager.
(3) Have you implemented NIS, DNS or other potential hostname-losing network tools?
(4) Check the network interface IP/s vs. hostnames in hosts, DNS or wherever it is held.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 04:20 AM
05-08-2002 04:20 AM
Re: host fails to join cluster after reboot
As stated
For the ServiceGuard commands to work properly each host in the
node must have its own name as well as the other nodes in its
own .rhosts file.
You need to add this and try cmquerycl again. Then run cmruncl,
and it should work fine this time.
Maybe your name resolution is bad.
Check .rhosts and name resolution for saturn.
steve Steel
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 04:23 AM
05-08-2002 04:23 AM
Re: host fails to join cluster after reboot
Check this as well
fully qualified hostnames in .rhosts
If found then reduce to simple hostname
Steve steel
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 05:15 AM
05-08-2002 05:15 AM
Re: host fails to join cluster after reboot
1. yes cmclnodelist has entries for all 5 servers.
It looks like:
saturn root
saturn_h root
saturn_100bt root
Entries like these above for the other 4 nodes
2. version A.11.13 with patch PHSS_25915
3. The cluster was not re-built
4. No network tools have been implemented ie. DNS/NIS etc...
Will double check the .rhosts files are all the same (i will attach a copy)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-08-2002 07:40 AM
05-08-2002 07:40 AM
Re: host fails to join cluster after reboot
This configuration file must be set if you want the cluster to automatically start after the reboot.
/etc/rc.config.d/cmcluster
#*************************** CMCLUSTER *************************
# Highly Available Cluster configuration
#
# @(#) $Revision: 81.2 $
#
# AUTOSTART_CMCLD: If set to 1, the node will attempt to
# join its CM cluster automatically when
# the system boots.
# If set to 0, the node will not attempt
# to join its CM cluster.
#
AUTOSTART_CMCLD=0
If this is not the problem I would ask for some sample info from the /etc/hosts file.
Good luck!
jad
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-09-2002 04:59 AM
05-09-2002 04:59 AM
Re: host fails to join cluster after reboot
Well, many others have taken a shot at it, so I might as well.
The initial trouble messages in syslog.log are:
May 5 10:30:11 saturn CM-CMD[5797]: /usr/sbin/cmrunnode -v
May 5 10:30:14 saturn cmclconfd[5820]: Permission denied for user root on node saturn (/etc/cmcluster/cmclnodelist)
This suggests a permissions issue allowing root to access the local system via networking services.
ServiceGuard utilizes "hacl" network services listed in /etc/services (9 lines)and /etc/inetd.conf (3 lines for 11.12 and later) when performing ANY command.
If network services render "permission denied", first check for the existence of the primary permission file that SG seeks - the /etc/cmcluster/cmclnodelist file. Make certain it is on all servers and that each permits root access to EVERY node including itself.
If that file is not used, inspect ~/.rhosts to insure it allows root priviledges to ALL nodes (including self).
If this is not the problem, begin to suspect hostname services (/etc/hosts, /etc/nsswitch.conf, /etc/resolv.conf, or even more fundamentally, a configuration problem with the hostname (fully qualified domain names vs. simple hostnames (preferred).
This issue has many generation points, so call the Response Center if you can't get to the bottom of it.
-s.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-09-2002 05:46 AM
05-09-2002 05:46 AM
Re: host fails to join cluster after reboot
I dont know how much will this help you.
But in your control.sh script you are specifying exit 0 for the package startup script. This is a problem which I also had faced earlier.
I used to invoke a shell script in the control.sh like
. ./etc/cmcluster/pkg1/pkg_script.sh
In the pkg_script.sh file, I used to return with exit 0. As this script is executed in the same shell, it used to fail.
And, it depends on the failover policy too. Have you configured that the cluster must halt the node if the package fails on the node saturn. This is what the log file indicates
May 8 01:45:41 saturn cmcld: Service PKG*62990 terminated due to an exit(0).
May 8 01:45:41 saturn cmcld: Halted package bcv on node saturn.
Just my two cents.
-Sukant
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-10-2002 05:38 AM
05-10-2002 05:38 AM
Re: host fails to join cluster after reboot
jad - How do i check if the rc files are not starting the serviceguard daemons? I presumed this would be in syslog?
I also check that the "cmcld" process was running and it was.
I had to manually execute the "cmrunnode" and the node joined the cluster 1st time with no problems.
AUTOSTART_CMCLD=1 !
stephen - i presume root can access the local system via networking because my swlist commands etc... work and I am sure these processes access the local system via the networking!
The cmclnodelist is the same on all 5 nodes and does include root access for each machine. Also the /.rhosts is the same. The /etc/hosts file seems fine (attached)
nsswitch.conf is all set to "files"
there is no /etc/resolv.conf file
Sukant - No packages were attempting to run. We start them manually plus no scripts are executed within the same shell! The message you were viewing for the bcv package was part of our backups and Im sure hasnt impacted on the node joining the cluster.
cheers, :-)
mark.