Operating System - HP-UX
1833875 Members
1809 Online
110063 Solutions
New Discussion

script to check for system board/CPU failure

 
Arun Kumar Rajamari
Frequent Advisor

script to check for system board/CPU failure

Hi,

In one of our servers while restarting there was System Board failure.
While we were rebooting we could not get noticed one of the CPU failures.
Is there any existing script(command, which we can make into script) which monitors for SB failure checking for numbers of running CPU against expected?
We need to integrate with HP OVO either this failure logging an error for the respective component in any of the log files(using logfile template of HP OVO unix) or any script(command, monitor template of HP OVO Unix) to check for the availability as above.

Thanks,
Arun
2 REPLIES 2
Steven E. Protter
Exalted Contributor

Re: script to check for system board/CPU failure

Shalom Arun,

If your CPU fails while the system is running you will get a High Priority Machine Check HPMC, your system will crash, even if you lose one of eight CPU's.

You don't need a script to detect that, its obvious and will be logged in your GSP card.

There are a number of good scripts that meet these needs in the following thread:

http://forums12.itrc.hp.com/service/forums/questionanswer.do?threadId=178514

There are two other threads and hundreds of scripts in there.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
Tim Nelson
Honored Contributor

Re: script to check for system board/CPU failure

Have you looked at EMS ? This is what it is there for.

Can be reviewd by /etc/opt/resmon/lbin/monconfig

Can even send trap events to the server of your choice.