HPE 9000 and HPE e3000 Servers
cancel
Showing results for 
Search instead for 
Did you mean: 

syslog report fan failure

 
SOLVED
Go to solution
Gary Yu
Super Advisor

syslog report fan failure

Hi all,

I hava an old A500 (9000/800/A500-44) server, recently has such entries in syslog:
Sep 10 10:04:07 typeR1 EMS [1542]: ------ EMS Event Notification ------ Value: "MAJORWARNING (3)" for Resource:
"/system/events/core_hw/core_hw" (Threshold: >= " 3") Execute the following command to obtain event details: /opt/resmon/bin/resdata -R 101056526 -r /system/events/core_hw/core_hw -n 101056539 -a

and when I ran the command, it told me fan failure:
at the bottom of the output it said:
Device identification information:

Number of fans present.: 4
Location of fan(s).....: Cabinet
Present fan number(s)..: 0,1,2,3


I'm confused, from the document(http://docs.hp.com/hpux/content/hardware/ems/dm_core_hw.htm#7), it should also report number of fans failed, I don't have A500's sepc on hand, does anybody know how many fans it shuld be on A500?
or is there any command I can ran to check fan status on a normal server just to compare?

thanks,
Gary
9 REPLIES 9
Paula J Frazer-Campbell
Honored Contributor

Re: syslog report fan failure

Garry

Please post full output of:-

/opt/resmon/bin/resdata -R 101056526 -r /system/events/core_hw/core_hw -n 101056539 -a


also a500 fan location included.

Paula
If you can spell SysAdmin then you is one - anon
Gary Yu
Super Advisor

Re: syslog report fan failure

Thanks Paula,

for some reason, I can't open your attachment. here's the output of the command
Event Time..........: Wed Sep 10 10:04:07 2003
Severity............: MAJORWARNING
Monitor.............: dm_core_hw
Event #.............: 7
System..............: typeR1

Summary:
Insufficient cabinet fans


Description of Error:

An insufficient number of fans are installed in the cabinet to cool the
unit.

Probable Cause / Recommended Action:

One or more fans have been removed from the cabinet. Replace the fan(s).

Additional Event Data:
System IP Address...: 10.42.60.11
Event Id............: 0x3f5ef71700000000
Monitor Version.....: B.01.00
Event Class.........: System
Client Configuration File...........:
/var/stm/config/tools/monitor/default_dm_core_hw.clcfg
Client Configuration File Version...: A.01.00
Qualification criteria met.
Number of events..: 1
Associated OS error log entry id(s):
None
Additional System Data:
System Model Number.............: 9000/800/A500-44
EMS Version.....................: A.03.20
STM Version.....................: A.22.00
Latest information on this event:
http://docs.hp.com/hpux/content/hardware/ems/dm_core_hw.htm#7

v-v-v-v-v-v-v-v-v-v-v-v-v D E T A I L S v-v-v-v-v-v-v-v-v-v-v-v-v


Device identification information:

Number of fans present.: 4
Location of fan(s).....: Cabinet
Present fan number(s)..: 0,1,2,3
ha@

Paula J Frazer-Campbell
Honored Contributor

Re: syslog report fan failure

Gary

The pdf file appears not to work so :-

Attached:_
If you can spell SysAdmin then you is one - anon
Steven E. Protter
Exalted Contributor

Re: syslog report fan failure

When you are done checking the software status of the fans, go look at the box.

Visually verify that very fan is actually working.

The last time I got a fan warning on my rp5450 server, there was indeed a fan not running. It had been unplugged from power.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
Paula J Frazer-Campbell
Honored Contributor
Solution

Re: syslog report fan failure

Gary

This error is I believe a temperature warning and not a fan failure direct warning.

Check that all the fans are running and air vents are clear, also check ambient air temp, I would say 80 degrees max in server location.

The jpeg shows the location of the fans.

A way to test each fan is to use a straw and insert it slowly into the fan from the rear and listen for the clicking as the straw contacts the fan - no clicking = no working.

Paula
If you can spell SysAdmin then you is one - anon
Robert_Jewell
Honored Contributor

Re: syslog report fan failure

The A500 has a GSP that you can log into and check the status of the temp and the fans:

GSP> PS

You may find some GSP logs relating to this whole thing as well. GSP> SL

-Bob
----------------
Was this helpful? Like this post by giving me a thumbs up below!
vofsky
Frequent Advisor

Re: syslog report fan failure

Hi Gary,
In my opion,I agree that the cause is the high temperature .You could change the speed of the fan under the maintenance mode.
Scot Bean
Honored Contributor

Re: syslog report fan failure

In my experience, diagnostics reporting fan failures can results from several causes:

- fan really is failed (can visually check), needs replacement
- diags software needs upgrade
- PDC / firmware need an upgrade
- HW board that monitors the fan speed / voltage need replacement

What version of PDC / firmware are you running? If less than 40.50 this document may help you:

http://www2.itrc.hp.com/service/cki/patchDocDisplay.do?patchId=PF_CHAW4324

It describes symptoms similar to yours. Older firmware thought there were more fans than there really are. Hence the false EMS alarm about a fan failure. A firmware upgrade MAY be a cure.
Andrew Merritt_2
Honored Contributor

Re: syslog report fan failure

You are using A.22.00 of OnlineDiags, which is a very old version (September 2000), so I'd recommend upgrading to a much more recent version plus any patches for that release of OnlineDiags.

In addition, check the GSP as others have suggested, or use the STM info tool to check the state of the fans.

Andrew