Operating System - Linux
1748111 Members
3868 Online
108758 Solutions
New Discussion юеВ

DL585 G1 System Overheating Issue

 
digriz_1
New Member

DL585 G1 System Overheating Issue

We recently upgraded a bunch of DL585 G1 (running RH Linux 4.7) to PSP 8.30 (hp-health-8.3.0.43-30.x86_64)

Now several of them are reporting System overheating problems, i.e.


messages:May 10 17:03:29 xxxxxxxx hpasmd[12590]: WARNING: hpasmd: System Overheating (Zone 5, Location CPU, Temperature 111C)
messages:May 10 17:03:29 xxxxxxxx hpasmd[12590]: CRITICAL: hpasmd: Automatic Operating System Shutdown Initiated Due to Overheat Condition

Temperature seems to high to be real to me, assume its either a software or hardware bug

Anyone got any ideas ?
And will upgrading to PSP 8.40 solve this issue, theres nothing in the 8.40 release notes related to this
15 REPLIES 15
Michal Kapalka (mikap)
Honored Contributor

Re: DL585 G1 System Overheating Issue

hi,

if this is the same issue on all servers, it could be some bud on PSP layer, but if its only on one machine, i would recomend to make HW healt check.

maybe the upgrade to the neves version of PSP it could be help, sometimes not all bugs will be reported in the release notes.

mikap
digriz_1
New Member

Re: DL585 G1 System Overheating Issue

We have 64 DL 585 G1, 46 have been upgraded to PSP 8.30 and of those 6 servers have had this overheating problem, in 2 different data centers and we never had an over heating problem before the upgrade
SERKAN AK├ЗIN
Occasional Contributor

Re: DL585 G1 System Overheating Issue

Hi,

We have 4 dl585 servers.
2 of them rhel 4.7 and have no problem.
But
2 of them rhel 5.4 and giving temp errors.

We don't have temp problems before upgrade.

HP says, you must change the mainboard but, I think it is occur after the firmware update and psp updates.

Do you find a solution?
Steven E. Protter
Exalted Contributor

Re: DL585 G1 System Overheating Issue

Shalom,

Use the web based PSP interface ( http://hostname:2301 ) to check actual temp.

These servers are pretty old, and you may have poor airflow or bad fans. All of these issues can cause overheating.

Clearly the software is more sensitive.

If this were a very severe problem, the system would have failed long ago. Still, run through the checklist, give them the eyeball check, make sure the fan openings are not covered with dust accumulation.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
Gerardo Arceri
Trusted Contributor

Re: DL585 G1 System Overheating Issue

Please upgrade firmware on the servers, we hit a similar bug couple of years ago and Firmware fixed it.
IF YOU ARE SURE THAT THE SYSTEM IS NOT OVERHEATING, use RBSU (Bios Setup) to disable Thermal Shutdown.
kiheiman
New Member

Re: DL585 G1 System Overheating Issue

We are seeing the CPU over-heating events on a number of DL585G1 Linux servers. The problem started about 6 months ago and seems to come in spurts. After the server powers down, it continues to generate the over-heating events in the IML - but the CPU heat sinks are not hot. The fix is to pull both AC power cords and boot the server back up. We have tried replacing motherboards, moving CPU modules around, etc. The problem appears to happen with both iLO driver versions 8.40 and 8.50 (not HP supported). It is also happening with Linux version 4.7 and 4.8. A number of the servers are on iLO firmware version 1.8x, but I do not see anything in the fix info that older versions would cause an over-heating event.
Alzhy
Honored Contributor

Re: DL585 G1 System Overheating Issue

Update your Firmware.
Use the hpsum to check and update the various firmware of you G5.
Hakuna Matata.
kiheiman
New Member

Re: DL585 G1 System Overheating Issue

Here is some more info on our situation.

Servers with the latest version of the motherboard firmware and iLO firmware are crashing. Servers with old versions of iLO firmware are not crashing. Servers running RHEL 4.7 are not crashing. The server crashes did not start until we upgraded some servers to RHEL 4.8. We are not running a PSP other than the 8.40 iLO driver. The smoking gun, based upon forum comments, seems to point to any server running a RHEL release newer than 4.7.
SERKAN AK├ЗIN
Occasional Contributor

Re: DL585 G1 System Overheating Issue

We changed the heatsink of the 4 processors and the problem is gone.

HP says that in the production of G1 servers in the years 2004 and 2005, there is a heatsink metal alloy problem. We changed them and the craches are gone.