- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Re: unexpected system shutdown
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 12:41 AM
тАО03-24-2003 12:41 AM
unexpected system shutdown
A HP-UX server (rp 5430) running HP-UX i had an unexpected system shutdown. This is an extract of the OLDsyslog.log file:
Mar 22 17:37:48 servername rpcbind: terminate: rpcbind terminating on signal. Restart with "rpcbind -w"
Mar 22 17:37:48 servername diagmond[1211]: Exit due to receipt of unexpected signal (3)
Mar 22 17:37:49 servername /usr/lbin/ups_mond[1497]: /usr/lbin/ups_mond: reboot -halt invoked due to UPS error cited in previous syslog message
Mar 22 17:37:49 servername /usr/lbin/ups_mond[1497]: /usr/lbin/ups_mond: UPS /dev/tty0p2 could not execute command S120
Mar 22 17:37:53 servername vmunix:
Mar 22 17:37:53 servername vmunix: sync'ing disks (1 buffer to flush): 1
Mar 22 17:37:53 servername vmunix: 0 buffers not flushed
Mar 22 17:37:53 servername vmunix: 0 buffers still dirty
Does anybody knows what causes this problem? The UPS seems to have received some kind of error.
Kurt
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 02:00 AM
тАО03-24-2003 02:00 AM
Re: unexpected system shutdown
It is hard to be sure with the extract here but it looks like the server was already on the way down when you got the UPS message. The ups_mond has obviously detected something, if you could maybe attach the OLDsyslog.log I might be able to tell you more.
Also check the last entry in /etc/shutdownlog, it will tell you who invoked the reboot. I would also check the ts99 file in /var/tombstones for a valid timestamp, attach this too if you are unsure how to read this file.
One last thing - I can't believe you got those last few entries in the syslog - hpux is not running at this point so I don't understand how syslog can log them!?
Regards,
James.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 02:19 AM
тАО03-24-2003 02:19 AM
Re: unexpected system shutdown
There are no relevant messages prior to those I've copy/pasted in the post.
What do you mean with checking if there is a valid time stamp in /var/tombstones/ts99?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 02:26 AM
тАО03-24-2003 02:26 AM
Re: unexpected system shutdown
I was just checking in case you had hardware problems. In the ts99 file there should be a line (near the top) either saying "No valid timestamp" or a line with the date and a row of chassis codes in hex.
Does the shutdownlog not have an entry for the reboot? Was a crash dump produced?
Regards,
James.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 02:41 AM
тАО03-24-2003 02:41 AM
Re: unexpected system shutdown
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 03:02 AM
тАО03-24-2003 03:02 AM
Re: unexpected system shutdown
When a server reboots for any reason you only have a few places to look for evidence if the default configuration is in place:
1) /etc/shutdownlog --> reboot/panic message is logged here
2) /var/tombstones --> The ts99 will indicate if there were hardware problems
3) /var/adm/crash --> dumps are defaulted to be produced here
4) /var/adm/syslog/OLDsyslog.log --> this gives an indication of what was happening beforehand, most use in S/G clusters
You mentioned no message in the shutdownlog. The few times I have seen this there was a complete loss of power to the server on each occasion. This would certainly be a reason for the UPS to kick in. However, my first point was based on the fact that rpcbind and diagmond are located suspiciously close in the shutdown sequence that to see both of them being sent kill signals in the same second suggested a shutdown was already in progress. The next thing after these processes to go down too is syslogd.
If there is no dump and no (valid) ts99 entry then you have little evidence of what happened.
Regards,
James.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 04:00 AM
тАО03-24-2003 04:00 AM
Re: unexpected system shutdown
Also check this link
http://forums.itrc.hp.com/cm/QuestionAnswer/1,,0xc67084534efbd5118ff40090279cd0f9,00.html
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 04:27 AM
тАО03-24-2003 04:27 AM
Re: unexpected system shutdown
http://www1.itrc.hp.com/service/cki/cache/200000059144896.pdf
"reboot -halt invoked due to UPS error cited in previous syslog message "
What are the previous messages reg ups_mond
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 04:58 AM
тАО03-24-2003 04:58 AM
Re: unexpected system shutdown
I had this a year ago. The Current went down for ten minutes. It happened in the working hours, so I could observe the behaviour. The system was still working on the UPS. The UPS-manager made the system to go down, while the current was allready up.
Did not find a way to tune this. Now we have a different UPS system, which handles this a bit easier.
HTH
Donald
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 06:06 AM
тАО03-24-2003 06:06 AM
Re: unexpected system shutdown
No core dumps in /var/adm/crash and the file /var/tombstones/ts99 looks normal as well.
In the OLDsyslog.log file are no previous ups_mond messages found.
We'll look further in the Unify database first.
regards,
Kurt
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-24-2003 10:13 AM
тАО03-24-2003 10:13 AM
Re: unexpected system shutdown
GSP should pick this up at the console, Ctrl-b up there.
SEP
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com