- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- cmhaltpkg hangs the system
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-16-2009 07:53 PM
тАО02-16-2009 07:53 PM
we are trying to failover a package to node2 but hangs, so we had to reboot the system.
After rebooting, failing the package works fine now.
#cmhaltpkg -v pkg1
we noticed the following in syslog:
Feb 16 14:30:07 node1 cmcld: Request from node node2 to halt package pkg1 on node node1.
Feb 16 14:30:07 node1 cmcld: Executing '/etc/cmcluster/pkg1/pkg1.cntl stop' for package pkg1, as service PKG*54273.
Feb 16 14:30:07 node1 cmsrvassistd[20952]: Unable to communicate with ServiceGuard main daemon (cmcld): Can't assign requested address
Feb 16 14:30:07 node1 cmcld: Service PKG*54273 terminated due to an exit(118).
Feb 16 14:30:07 node1 cmcld: Halted package pkg1 on node node1.
Feb 16 14:30:07 node1 cmcld: Package pkg1 halt script exited abnormally.
Feb 16 14:30:07 node1 cmcld: Examine the file /etc/cmcluster/pkg1/pkg1.cntl.log for more details.
Feb 16 14:30:07 node1 cmcld: Switching disabled on package pkg1.
after rebooting, the system is up and running fine and cmhaltpkg failover successfully.
Any ideas why the hang happened?
thank you in advance
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-16-2009 09:06 PM
тАО02-16-2009 09:06 PM
SolutionThe cmcld daemon sets a safety timer in the kernel which is used to detect kernel hangs. If this timer is not reset periodically by cmcld, the kernel will cause a system TOC, that is, a Transfer of Control, which means a CPU reset. This could occur because cmcld could not communicate with the majority of the cluster├в s members, or because cmcld exited unexpectedly, aborted, or was unable to run for a significant amount of time and was unable to update the kernel timer, indicating a kernel hang. Before a TOC due to the expiration of the safety timer, messages will be written to the syslog file and the kernel├в s message buffer.
Regards
Prashant
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-16-2009 09:10 PM
тАО02-16-2009 09:10 PM
Re: cmhaltpkg hangs the system
Pls paste the o/p of file
/etc/cmcluster/pkg1/pkg1.cntl.log for more details.
Regards
Prashant
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-16-2009 09:20 PM
тАО02-16-2009 09:20 PM
Re: cmhaltpkg hangs the system
See the 3rd line
>>Feb 16 14:30:07 node1 cmsrvassistd[20952]: Unable to communicate with ServiceGuard main daemon (cmcld): Can't assign requested address
Please check your cmcld demon is running or not ?
Are you able to do cmviewcl ?
Suraj
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-16-2009 09:54 PM
тАО02-16-2009 09:54 PM
Re: cmhaltpkg hangs the system
we also have a crash file created
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-16-2009 10:59 PM
тАО02-16-2009 10:59 PM
Re: cmhaltpkg hangs the system
yes cmcld is running and yes we can run cmviewcl
thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-16-2009 11:47 PM
тАО02-16-2009 11:47 PM
Re: cmhaltpkg hangs the system
It 's just that this souns like a serviceguard internal issue.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-17-2009 01:03 AM
тАО02-17-2009 01:03 AM
Re: cmhaltpkg hangs the system
Agree with Wim about SG patches.
there should be a cmcld core file in /var/adm/cmcluster. It is advisable to install a current Serviceguard patch.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-17-2009 03:29 PM
тАО02-17-2009 03:29 PM
Re: cmhaltpkg hangs the system
#cmgetconf |grep E_T /etc/cmcluster/*
/etc/cmcluster/edlrp.tar:# Enter the package type for this package. PACKAGE_TYPE indicates
/etc/cmcluster/edlrp.tar:# NOTE: Packages which have a PACKAGE_TYPE of SYSTEM_MULTI_NODE are
/etc/cmcluster/edlrp.tar:# Examples : PACKAGE_TYPE FAILOVER (default)
/etc/cmcluster/edlrp.tar:# PACKAGE_TYPE SYSTEM_MULTI_NODE
/etc/cmcluster/edlrp.tar:PACKAGE_TYPE FAILOVER
/etc/cmcluster/gifcl.conf:# ServiceGuard cluster parameters, including NODE_TIMEOUT and
/etc/cmcluster/gifcl.conf:# The NODE_TIMEOUT parameter defaults to 2000000 (2 seconds).
/etc/cmcluster/gifcl.conf:# The maximum value recommended for NODE_TIMEOUT is 30000000
/etc/cmcluster/gifcl.conf:NODE_TIMEOUT 8000000
/etc/cmcluster/gifcl.conf.new:# ServiceGuard cluster parameters, including NODE_TIMEOUT and
/etc/cmcluster/gifcl.conf.new:# The NODE_TIMEOUT parameter defaults to 2000000 (2 seconds).
/etc/cmcluster/gifcl.conf.new:# The maximum value recommended for NODE_TIMEOUT is 30000000
/etc/cmcluster/gifcl.conf.new:NODE_TIMEOUT 8000000
and the q4 stack trace shows:
stack trace for event 0
crash event was a TOC
wait_for_lock+0x144
sl_retry+0x1c
safety_time_check+0xfc
per_spu_hardclock+0xc4
clock_int+0x94
mp_ext_interrupt+0x3ec
ivti_patch_to_nop3+0x0
pset_idle_loop+0x120
idle+0x738
swidle_exit+0x0
thanks in advance
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-18-2009 12:53 AM
тАО02-18-2009 12:53 AM
Re: cmhaltpkg hangs the system
5. Defect: JAGag28374 SR: 8606473752
Serviceguard on uniprocessor systems can lead to
cmcld consuming 100% of cpu resulting in a hang or system
TOC. This does not apply to multi-processor systems.
If you are then you need to upgrade since there is no 11.15 patch since this release is no longer supported.