- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Node keeps crashing
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-06-2005 07:39 AM
12-06-2005 07:39 AM
Node keeps crashing
2 node cluster - one runs a prod db and the other test.
The test crashed - for no apparent reason - other then this in the syslog of the prod node:
Dec 6 13:09:24 svr3001 cmcld: New node svr3000 is joining the cluster
Dec 6 13:09:24 svr3001 cmcld: Attempting to adjust cluster membership
Dec 6 13:09:24 svr3001 cmcld: Beginning standard partial election
Dec 6 13:09:28 svr3001 cmcld: Enabling safety time protection
Dec 6 13:09:28 svr3001 cmcld: Clearing Cluster Lock
Dec 6 13:09:30 svr3001 cmcld: 2 nodes have formed a new cluster, sequence #18
Dec 6 13:09:30 svr3001 cmcld: The new active cluster membership is: svr3001(id=2), svr3000(id=1)
Dec 6 13:09:30 svr3001 cmcld: Package ilogtest cannot run on this node because switching has been disabled for this node
Dec 6 13:09:31 svr3001 cmcld: One or more packages is not currently running because AUTO_RUN is disabled so that it cannot start automatically. To start these packages, enable AUTO_RUN via cmmodpkg -e
Dec 6 13:11:20 svr3001 cmcld: Timed out node svr3000. It may have failed.
Dec 6 13:11:20 svr3001 cmcld: Attempting to adjust cluster membership
Dec 6 13:11:20 svr3001 cmcld: Beginning standard partial election
Dec 6 13:11:22 svr3001 cmclconfd[13667]: Updated file /var/adm/cmcluster/frdump.cmcld.3 for node svr3001 (length = 512096).
Dec 6 13:11:30 svr3001 cmcld: Obtaining Cluster Lock
Dec 6 13:11:31 svr3001 cmcld: Turning off safety time protection since the cluster
Dec 6 13:11:31 svr3001 cmcld: may now consist of a single node. If Serviceguard
Dec 6 13:11:31 svr3001 cmcld: fails, this node will not automatically halt
Dec 6 13:11:31 svr3001 cmcld: This will not affect the behavior of Package Failfast
Dec 6 13:11:31 svr3001 cmcld: or Service Failfast. If such a package or service fails,
Dec 6 13:11:31 svr3001 cmcld: safety timer will be re-enabled and this node will
Dec 6 13:11:31 svr3001 cmcld: automatically halt.
The server crashed (the first time) on it's own.
Since then, everytime I try a cmrunpkg -n svr3000 packtst, it crashes.
And the kicker - NO CRASHDUMP!
/var/adm/crash is configured, as well as dump:
# lvlnboot -v
Boot Definitions for Volume Group /dev/vg00:
Physical Volumes belonging in Root Volume Group:
/dev/dsk/c1t2d0 (0/0/1/1.2.0) -- Boot Disk
/dev/dsk/c2t2d0 (0/0/2/0.2.0) --Boot
Boot: lvol1 on: /dev/dsk/c1t2d0
/dev/dsk/c2t2d0
Root: lvol3 on: /dev/dsk/c1t2d0
/dev/dsk/c2t2d0
Swap: lvol2 on: /dev/dsk/c1t2d0
/dev/dsk/c2t2d0
Dump: lvol2 on: /dev/dsk/c1t2d0, 0
in /etc/rc.config.d/savecrash
SAVECRASH=1
SAVECRASH_DIR=/var/adm/crash
and in /etc/rc.config.d/crashconf
CRASHCONF_ENABLED=1
Last line in svr3000 (test) syslog before crash:
Dec 6 13:10:56 svr3000 CM-packtest[9869]: cmmodnet -a -i 192.44.162.196 192.44.160.0
Last line of package log file - shows it calling another script to startup Oracle...
In all my years of ServiceGuard I have never seen something like this before...
Rgds...Geoff
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-06-2005 09:59 AM
12-06-2005 09:59 AM
Re: Node keeps crashing
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-06-2005 10:02 AM
12-06-2005 10:02 AM
Re: Node keeps crashing
Rgds...Geoff
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-06-2005 11:33 AM
12-06-2005 11:33 AM
Re: Node keeps crashing
It seems that the test node had initiating the TOC of it own.
I would just start the node using cmrunnod
Then the further debugging could be done at the package level i,e. cluster services , application startup and monitoring.
I am just wondering about the line in syslog about cmmodnet . It maybe a case the command is not going through may be
hanged or something. I would very the package IP conflict. Since the node was crashed before , it may be required
to clean up the pakcage IP using cmmodnet -r before starting the package.
Did you check the /etc/shutdownlog on this node?
You can check the SGFR as well using cmfmtfr
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-06-2005 11:46 AM
12-06-2005 11:46 AM
Re: Node keeps crashing
Rgds...Geoff
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-06-2005 03:59 PM
12-06-2005 03:59 PM
Re: Node keeps crashing
Rgds...Geoff