- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - OpenVMS
- >
- Re: Self Restart on One VMS Cluster Member
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 02:06 AM
тАО03-09-2009 02:06 AM
I have an OpenVMS cluster that consists of 3 nodes.
1 node just did self reboot.
I tried to find the root cause, but still not successfull.
Operator.log only tells the moment when the node dissapears and rejoin the cluster.
Please suggests me to check the root cause.
Many Thanks,
Ricky Pardede
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 02:37 AM
тАО03-09-2009 02:37 AM
Re: Self Restart on One VMS Cluster Member
the node may have just crashed and rebooted automatically. You should at least find a bugcheck entry in ERRLOG.SYS.
Depending on OpenVMS version and architecture, there is other crash information to be found.
Does $ ANAL/CRASH SYS$SYSTEM: on that node report any crash information ?
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 04:01 AM
тАО03-09-2009 04:01 AM
Re: Self Restart on One VMS Cluster Member
My machine use OpenVMS V7.3-2.
As your suggestion :
ID72:SYSTEM> ANAL/CRASH SYS$SYSTEM:
%SDA-I-SINGLEMEM, single member shadow set; accessing dump file via _DSA0:
OpenVMS (TM) system dump analyzer
...analyzing an Alpha compressed selective memory dump...
Dump taken on 9-MAR-2009 16:22:25.69
MACHINECHK, Machine check while in kernel mode
Now I still check how to use this utility.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 04:07 AM
тАО03-09-2009 04:07 AM
Re: Self Restart on One VMS Cluster Member
for OpenVMS Alpha V7.3-2, you can easily get a crash summary by typing:
$ TYPE CLUE$HISTORY
This will show one line for each crash. There is also a more detailled CLUE summary file for each crash: CLUE$COLLECT:CLUE$NODE_ddmmyy_hhm.LIS
A MACHINECHK crash is most likely caused by a hardware problem. You need to examine the errorlog (with DECevent or WEBES/SEA, depending on the maschine type).
You can easily extract the most recent errlog entries from the crashdump itself:
$ ANAL/CRASH SYS$SYSTEM:
SDA> CLUE ERRLOG
This will show the most recent errors and also extract them to SYS$SCRATCH:CLUE$ERRLOG.SYS. You can then use this file for detailled analysis of the error leading to the crash.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 04:27 AM
тАО03-09-2009 04:27 AM
Re: Self Restart on One VMS Cluster Member
Thanks for the great help.
$ TYPE CLUE$HISTORY
=> I don't find entry for today crash.
SDA> CLUE ERRLOG
------------------------------------
Sequence Date Time Error Message Type
-------- ----------- ----------- --------------------------------
13818 9-MAR-2009 16:22:25.29 unknown entry
13819 9-MAR-2009 16:22:25.69 Machine Check 670
13820 9-MAR-2009 16:22:25.69 * Crash Entry
I think I need the WEBES tool for further investigation.
Can I get WEBES license for free ?
Thanks,
Ricky Pardede
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 04:34 AM
тАО03-09-2009 04:34 AM
Solutionthe CLUE$SDA process should be run automatically during startup and produce both the entry in CLUE$HISTORY and the CLUE$COLLECT:CLUE$node_ddmmyy_hhmm.LIS file. If this does not work, check SYS$MANAGER:CLUE$STARTUP_node.LOG for errors.
You can freely download WEBES. I would suggest, that you download and install the Windows variant. It can also analyze OpenVMS ERRLOG.SYS files.
http://h18023.www1.hp.com/support/svctools/
Form older system types, use DECevent.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 05:22 AM
тАО03-09-2009 05:22 AM
Re: Self Restart on One VMS Cluster Member
Machine Checks can be fairly simple to fix (re-seating DIMMs or swapping), or can be more involved.
Most any services organization is familiar with the steps involved here, and with decoding the machine check information that will be produced by DECevent or WBEM/WEBES, etc.
The sooner services is on-line for a diagnostic pass, the sooner the box gets fixed.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 10:01 AM
тАО03-09-2009 10:01 AM
Re: Self Restart on One VMS Cluster Member
Thanks a lot for the great help.
I will call HP asap to help analyze the root cause.
Thanks for the new knowledge.
Rergars,
Ricky Pardede
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 10:19 AM
тАО03-09-2009 10:19 AM
Re: Self Restart on One VMS Cluster Member
It seems the CLUE$SDA process not running in all nodes.
Can you suggest to turn on CLUE$SDA properly ?
ID72:SMSC> PIPE SHO SYS /CLUSTER | SEARCH SYS$INPUT NODE, sda, clue
OpenVMS V7.3-2 on node SMID71 10-MAR-2009 01:18:19.95 Uptime 84 08:25:06
OpenVMS V7.3-2 on node SMID72 10-MAR-2009 01:18:19.96 Uptime 0 08:51:31
OpenVMS V7.3-2 on node SMID73 10-MAR-2009 01:18:19.97 Uptime 35 07:32:42
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-09-2009 10:21 AM
тАО03-09-2009 10:21 AM
Re: Self Restart on One VMS Cluster Member
the CLUE$SDA process only runs temporarily during startup. It exits, after diagnosing the dump file. Look at SYS$MANAGER:CLUE$STARTUP_node.LOG for possible error messages.
Volker.