Operating System - HP-UX
1753518 Members
5233 Online
108795 Solutions
New Discussion юеВ

Event Monitor Notification

 
jamshed
Frequent Advisor

Event Monitor Notification

How can i fix it?

--------------------------------
From root@hp03 Mon Dec 17 05:43:51 PST 2007
Received: (from root@localhost)
by hp03 (8.9.3 (PHNE_29774)/8.9.3) id FAA17769;
Mon, 17 Dec 2007 05:43:51 +0500 (PST)
Date: Mon, 17 Dec 2007 05:43:51 +0500 (PST)
Message-Id: <200712170043.FAA17769@hp03>
To: root@hp03
From: root@hp03
Subject: hp03: Event Monitor Notification
Content-Length: 3017
Status: RO

>------------ Event Monitoring Service Event Notification ------------<

Notification Time: Mon Dec 17 05:43:51 2007

hp03 sent Event Monitor notification information:

/system/events/ipmi_fpl/ipmi_fpl is >= 3.
Its current value is CRITICAL(5).



Event data from monitor:

Event Time..........: Mon Dec 17 05:43:51 2007
Severity............: CRITICAL
Monitor.............: fpl_em
Event #.............: 646
System..............: hp03

Summary:
Partition being reset due to watchdog timeout expiring


Description of Error:

The partition is being reset because its watchdog timer expired and automatic restart is enabled.

Probable Cause / Recommended Action:


The watchdog mechanism triggers the MP to reset a partition if its OS becomes unresponsive. An unresponsive OS is detected when the OS fails to refresh the watchdog timer before it expires. PA systems refresh the watchdog timer by emitting an event with data field set to activity level/timeout, and the timeout field specifies the desired timeout. IPF systems refresh the watchdog timer using the IPMI clear watchdog command. The MP emits this event when timer expiration triggers resetting the partition. OS-specific and platform-specific procedures are used to enable/disable the watchdog timer from resetting the partition. See platform and OS documentation for details.
Find out why the partition's OS had hung. The cause could be bad HW that crashed the partition, or in rare cases, a combination of events that caused the OS to be unable to refresh the watchdog timer. Look for other events preceeding the timeout for clues to the root cause of the partition being unresponsive.


Additional Event Data:
System IP Address...: 1.1.17.112
Event Id............: 0x4765c64700000000
Monitor Version.....: A.01.00
Event Class.........: System
Client Configuration File...........:
/var/stm/config/tools/monitor/default_fpl_em.clcfg
Client Configuration File Version...: A.01.00
Qualification criteria met.
Number of events..: 1
Associated OS error log entry id(s):
None
Additional System Data:
System Model Number.............: 9000/800/rp7420
EMS Version.....................: A.04.20
STM Version.....................: A.52.00
System Serial Number............: DEH44537T4
Latest information on this event:
http://docs.hp.com/hpux/content/hardware/ems/fpl_em.htm#646

v-v-v-v-v-v-v-v-v-v-v-v-v D E T A I L S v-v-v-v-v-v-v-v-v-v-v-v-v


IPMI event hex: 0xf6800ad500e00000 000000000000000000 Time Stamp: Mon Dec 17 00:10:58 2007 Event keyword: WATCHDOG_RESET_PARTITION Alert level name: Fatal Reporting vers: 1 Data field type: Implementation dependent data field Decoded data field:
Reporting entity ID: 0 ( Cab 0 )
Reporting entity Full Name: Service Processor IPMI Event ID : 2773 (0xad5)


>---------- End Event Monitoring Service Event Notification ----------<
5 REPLIES 5
SUDHAKAR_18
Trusted Contributor

Re: Event Monitor Notification

Hi ,
Is system contineously produces the same error ? Or only once ?

Regds,
Sudha
whiteknight
Honored Contributor

Re: Event Monitor Notification

Hi,

I recommended to check the following information

MP:CM>FPL/SEL
MP:CM> ps
MP:CM> cp
MP:CM> sysrev
MP:CM> DF DE
Parstatus command
console log

and keep firmware update too.

WK
Problem never ends, you must know how to fix it
jamshed
Frequent Advisor

Re: Event Monitor Notification

For the current situation main problem is that I cannot log on to web-console. So, I cannot check out any of adviced commands on MP.
Andrew Merritt_2
Honored Contributor

Re: Event Monitor Notification

Hi Jamshed,
I would recommend calling HP support. This problem is sometimes caused by the MP failing to communicate with the FRU bus, and can be cleared by a reset of the MP, but HP support should be able to advise if this is what is happening.

You should also look at updating your OnlineDiags. A.52.00 was released in June 2006. The currently supported versions is A.59.00, released in December 2007 (see http://www.docs.hp.com/en/diag/stm/stm_upd.htm#table ). To download the latest version, see http://www.software.hp.com/portal/swdepot/displayProductInfo.do?productNumber=B6191AAE

Andrew
Ramesh S
Esteemed Contributor