cancel
Showing results for 
Search instead for 
Did you mean: 

Enclosure hardware failure

 
Jaroslav_Spanko
Occasional Contributor

Enclosure hardware failure

Hello guys
I am getting EMS errors from our HP-UX servers :

From first server

Mar 20 03:57:02 a2502s01 EMS [2628]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/storage/events/enclosures/msamon_sas/64000_0xfa00_0x7"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 172228610 -r /storage/events/enclosures/msamon_sas/64000_0xfa00_0x7 -n 172228633 -a

Mar 20 03:57:02 a2502s01 EMS [2628]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/storage/events/enclosures/msamon_sas/64000_0xfa00_0x7"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 172228610 -r /storage/events/enclosures/msamon_sas/64000_0xfa00_0x7 -n 172228634 -a

Also from second server

Feb 22 13:16:38 a2502s02 EMS [3491]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/storage/events/enclosures/msamon_sas/64000_0xfa00_0x7"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 228786178 -r /storage/events/enclosures/msamon_sas/64000_0xfa00_0x7 -n 228786185 -a

Feb 22 13:16:39 a2502s02 EMS [3491]: ------ EMS Event Notification ------   Value: "CRITICAL (5)" for Resource: "/storage/events/enclosures/msamon_sas/64000_0xfa00_0x7"     (Threshold:  >= " 3")    Execute the following command to obtain event details:   /opt/resmon/bin/resdata -R 228786178 -r /storage/events/enclosures/msamon_sas/64000_0xfa00_0x7 -n 228786186 -a

 

A general issue on the MSA60 would be reported on both servers at the same time.

It is always the MSA60 controller device that got reported (64000_0xfa00_0x7) , never an disk.

All disks in the shared disk cabinet and also the controller device are reported as connected (claimed).

The correct cabling of the MSA60, each server is connected to a dedicated  MSA60 controller

 

/opt/resmon/bin/resdata -R 172228610 -r /storage/events/enclosures/msamon_sas/64000_0xfa00_0x7 -n 172228634 -a

ARCHIVED MONITOR DATA:

Event Time..........: Fri Mar 20 03:57:02 2015
Severity............: CRITICAL
Monitor.............: msamon_sas
Event #.............: 404
System..............: a2502s01

Summary:
     Enclosure at hardware path 64000/0xfa00/0x7 : Hardware Failure


Description of Error:

     The enclosure services device is not responding.

Probable Cause / Recommended Action:

     The enclosure services controller has failed or is no longer connected to
     the host. Check the connection, cables and controller card.

Additional Event Data:
     System IP Address...: 10.164.176.121
     Event Id............: 0x550b9a8e00000002
     Monitor Version.....: B.01.00
     Event Class.........: I/O
     Client Configuration File...........:
     /var/stm/config/tools/monitor/wbem_default_msamon_sas.clcfg
     Client Configuration File Version...: A.01.00
          Qualification criteria met.
               Number of events..: 1
     Associated OS error log entry id(s):
          None
     Additional System Data:
          System Model Number.............: ia64 hp server rx2660
          OS Version......................: B.11.31
          System Serial Number............: DEH5015DAE
          System Software ID..............: 3436665520
          EMS Version.....................: A.04.20.31.04
          STM Version.....................: D.06.00
     Latest information on this event:
          http://docs.hp.com/hpux/content/hardware/ems/msamon_sas.htm#404

v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S    v-v-v-v-v-v-v-v-v-v-v-v-v



Component Data:
     Physical Device Path...: 64000/0xfa00/0x7
     Firmware Version.......: 2.18
     Serial Number..........:                              SGA93603NX
     Inquiry Product ID.....: MSA60
     Inquiry Vendor ID......: HP

Product/Device Identification Information:

     Logger ID.........: sctl
     Product Identifier: JBOD
     Product Qualifier.: HPMSA60
     SCSI Target ID....: 0x00
     SCSI LUN..........: 0x00

Do you have any idea | experience ?

Thanks a lot