Integrity Servers
cancel
Showing results for 
Search instead for 
Did you mean: 

The system has detected that one of the power supplies has failed.

 
Highlighted
Aamir14
Regular Advisor

The system has detected that one of the power supplies has failed.

Hi,

 

I found error messages on syslog file (power supply failour)

 

appsdev2.mpmkvvcl.com#tail -1000000 /var/adm/syslog/syslog.log |grep CRITICAL
Sep 15 19:29:52 appsdev2 EMS [2010]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 131727371 -r /system/events/ia64_corehw/core_hw -n 131727361 -a
Sep 15 19:29:53 appsdev2 EMS [2010]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 131727371 -r /system/events/ia64_corehw/core_hw -n 131727362 -a
Sep 15 19:29:53 appsdev2 EMS [2010]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 131727371 -r /system/events/ia64_corehw/core_hw -n 131727363 -a
Sep 16 03:32:54 appsdev2 EMS [2010]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 131727371 -r /system/events/ia64_corehw/core_hw -n 131727364 -a
Sep 17 03:32:57 appsdev2 EMS [2010]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 131727371 -r /system/events/ia64_corehw/core_hw -n 131727365 -a
Sep 18 03:33:01 appsdev2 EMS [2010]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/system/events/ia64_corehw/core_hw"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 131727371 -r /system/events/ia64_corehw/core_hw -n 131727366 -a
appsdev2.mpmkvvcl.com#
 
and i also execute the following command:--
 
appsdev2.mpmkvvcl.com#/opt/resmon/bin/resdata -R 131727371 -r /system/events/ia64_corehw/core_hw -n 131727366 -a
 
CURRENT MONITOR DATA:
 
Event Time..........: Tue Sep 18 03:33:00 2012
Severity............: CRITICAL
Monitor.............: ia64_corehw
Event #.............: 103001
System..............: appsdev2.mpmkvvcl.com
 
Summary:
 
     Power Supply : Failure is detected.
 
Description of Error:
 
     The system has detected that one of the power supplies has failed.
 
Probable Cause / Recommended Action:
 
     The power supply has failed. Contact your HP support representative to
     check the power supply.
 
          For information on the sensor that generated this event, refer to
          FRU ID in Event Details section.
 
Additional Event Data:
     System IP Address...: 192.168.2.3
     Event Id............: 103001620120918033258
     Monitor Version.....: C.04.00.05
     Event Class.........: System
     Client Configuration File............:
     /var/stm/config/tools/monitor/default_ia64_corehw.clcfg
     Client Configuration File Version....: A.01.00
          Qualification criteria met.
               Number of events: 1
     Associated OS error log entry id(s)
          None
     Additional System Data:
          System Model Number.............: ia64 hp server rx3600
          EMS Version.....................: A.04.20
          STM Version.....................: NA
          System Serial Number............: SGH4913M4C
     Latest information on this event:
 
 
v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S     v-v-v-v-v-v-v-v-v-v-v-v-v
 
 
Event Details :
 
     Event Date  ...................: Sat Sep 15 19:29:12 2012
     Sensor Number .................: 0x40
     Sensor Type ...................: Power Supply
     Sensor Class ..................: Sensor specific
     Sensor Reading/Offset .........: 0x1 (Sensor Reading)
     Event  Type ...................: Assertion
     Entity ID .....................: 0xa
     Generic Message ...............:
       Power Supply Failure detected
     Entity FRU Id Info ............: Power Supply 0(Sensor ID Power Supply 00(ff-ff-ff-ff-ff-0-ff-44))
 
Error Details:
 
     Additional information on this event can be obtained from evweb
     logviewer (Refer SFM User Guide) with the following log id: 3589
 
 
appsdev2.mpmkvvcl.com#
 
My Server has two power supply and both led are green.
How can resolved the problem
Please help me .....
 
Thanks 
Aamir 
 
 
P.S. This thread has been moved from HP-UX > System Administration to Servers > Integrity Servers - HP Forums Moderator
 
2 REPLIES 2
S_Logan
HPE Pro

Re: The system has detected that one of the power supplies has failed.

Hi Amir,

 

The actual event occured on:

Event Date  : Sat Sep 15 19:29:12 2012

 

 

If a persistent event keeps occurring after the repair has been made (usually hot repair).

There usually is a reminder file that SFM that triggers the event.

 

There are two files that can trigger a persistent event:

 

  • /var/opt/sfm/data/reminderEvent.dat      -  This is the more common file that will be found and needs to be removed.
  •  /var/opt/sfm/data/MemoryErrorCache.dat    - If this file is present it should be removed also.

SFM must be disabled and then re-enabled after the files are removed or events will continue to be reported. The suggested process for disable/re-enable and removing files is:

 

Disable the SFM provider module:

 

  • #cimprovider -d -m SFMProviderModule 

Remove file /var/opt/sfm/data/MemoryErrorCache.dat and/or reminderEvent.dat.

 

Enable the SFM provider module:

 

  • #cimprovider -e -m SFMProviderModule

    HTH,

    Surendar

    Was the post useful?

    Say Thank you by Clicking the WHITE KUDOS! Star.

    For HP Guided Troubleshooting; http://www.hp.com/support/hpgt

I work for HPE

Accept or Kudo

S_Logan
HPE Pro

Re: The system has detected that one of the power supplies has failed.

Hi Amir,

 

Please note it could be a intermittent hardware  issue with the Power Supply 0.

 

If required we need to change the power supply location and check whether the error is following in Power Supply 1.

 

HTH,

Surendar

Was the post useful?

Say Thank you by Clicking the WHITE KUDOS! Star.

For HP Guided Troubleshooting; http://www.hp.com/support/hpgt

 

I work for HPE

Accept or Kudo