ProLiant Servers (ML,DL,SL)
1752785 Members
5926 Online
108789 Solutions
New Discussion

Re: DL320e v2 Performance with P222 thermal shutdown

 
albal
Occasional Advisor

DL320e v2 Performance with P222 thermal shutdown

Hi,


Since the get go (December 2013) the PCI Slot 1 temperature for the P222 reporting has been off and states 80+ degreesC.  Is it is reporting in F and DL ILO not converting to C?

Anyway I thought no bother, until last night I got 3 alerts from ILO (Critical, Info, Repaired):

EVENT (04 Feb 21:20): Automatic Operating System Shutdown Initiated Due to Overheat Condition

EVENT (04 Feb 21:21): Automatic Operating System Shutdown Due to Overheat Aborted

EVENT (04 Feb 21:21): Automatic Operating System Shutdown Initiated Due to Overheat Condition

 

ilo, 1.32 Nov 05 2013 | ProLiant DL320e Gen8 v2, P80 09/01/2013

 

Controller in Slot 1

  • Controller Status  OK Serial Number PDSXH0ARH5M1SF Model HP Smart Array P222 Controller Firmware Version 4.68 Cache Module Status  OKCache Module Serial Number PBKUA0BRH5N0YE Cache Module Memory

    524288 KB

     

     

     

Sensor Location X Y Status Reading Thresholds

01-Inlet Ambient  Ambient  11  1   OK  19C  Caution: 42C; Critical: 46C 

02-CPU 1  CPU  13  6   OK  40C  Caution: 70C; Critical: N/A 

03-P1 DIMM 1-4  Memory  11  6   OK  25C  Caution: 87C; Critical: N/A 

04-HD Max  System  5  3   OK  35C  Caution: 60C; Critical: N/A 

05-Chipset  System  11  10   OK  43C  Caution: 105C; Critical: N/A 

07-VR P1  System  14  3   OK  30C  Caution: 115C; Critical: 120C 

08-Supercap Max  System  7  14   OK  24C  Caution: 65C; Critical: N/A 

09-iLO Zone  System  14  14   OK  41C  Caution: 77C; Critical: 82C 

11-LOM Zone  System  11  15   OK  36C  Caution: 68C; Critical: 73C 

12-PCI 1  I/O Board  10  12   OK  84C  Caution: 100C; Critical: N/A 

14-PCI 1 Zone  I/O Board  11  14   OK  36C  Caution: 69C; Critical: 74C 

15-PCI 2 Zone  I/O Board  12  14   OK  34C  Caution: 73C; Critical: 78C 

16-System Board  System  13  8   OK  25C  Caution: 68C; Critical: 73C 

17-Sys Exhaust  Chassis  14  15   OK  32C  Caution: 67C; Critical: 72C 

 

 

And in fahrenheit:

 

Sensor Location X Y Status Reading Thresholds

01-Inlet Ambient  Ambient  11  1   OK  64F  Caution: 108F; Critical: 115F 

02-CPU 1  CPU  13  6   OK  104F  Caution: 158F; Critical: N/A 

03-P1 DIMM 1-4  Memory  11  6   OK  77F  Caution: 189F; Critical: N/A 

04-HD Max  System  5  3   OK  95F  Caution: 140F; Critical: N/A 

05-Chipset  System  11  10   OK  109F  Caution: 221F; Critical: N/A 

07-VR P1  System  14  3   OK  86F  Caution: 239F; Critical: 248F 

08-Supercap Max  System  7  14   OK  75F  Caution: 149F; Critical: N/A 

09-iLO Zone  System  14  14   OK  106F  Caution: 171F; Critical: 180F 

11-LOM Zone  System  11  15   OK  97F  Caution: 154F; Critical: 163F 

12-PCI 1  I/O Board  10  12   OK  185F  Caution: 212F; Critical: N/A 

14-PCI 1 Zone  I/O Board  11  14   OK  97F  Caution: 156F; Critical: 165F 

15-PCI 2 Zone  I/O Board  12  14   OK  93F  Caution: 163F; Critical: 172F 

16-System Board  System  13  8   OK  77F  Caution: 154F; Critical: 163F 

17-Sys Exhaust  Chassis  14  15   OK  90F  Caution: 153F; Critical: 162F 

 

Have checked with my fingers and it's not above 50c (generally one can't hold one's finger on something much hotter than 50C).  Is this a firmware issue with the P222?

 

Thanks,

Al

 

p.s. the formatting of the message was much nice before but HP forums won't let you copy and past their own HTML from ILO into this forum :-/

3 REPLIES 3
Oscar A. Perez
Honored Contributor

Re: DL320e v2 Performance with P222 thermal shutdown

The P42x cards do run very hot.  Their setpoint in iLO is 85ºC and iLO uses its PID (Proportional Integral Derivative) algorithm to control the system FANs and keeps the card at or below 85ºC all the time.

I'm wondering if the card is truly heating up or this is just a false reporting.  Have you ever seen the card above 85ºC ?    




__________________________________________________
If you feel this was helpful please click the KUDOS! thumb below!
albal
Occasional Advisor

Re: DL320e v2 Performance with P222 thermal shutdown

Okay so it is RAID5 but it's sitting there mostly idle.  All other components are running at around 30-40 in an ambient of 19 - first time last night the ILO email alert came through.  T-Op to T-Max is quite little.

I've got another P222 on order and hopefully that one has a low profile backplate and I can swap this over to the other slot which will be more in line with the CPU HS/F exit flow.


For now I might ramp up the FAN speeds - power profile is low power so it might not be spining up fast enough to clear the air past the P222.

 

I wrote into tech support but he said unsupport drives was the issue :-/   If you can point me to 2TB SFF and 1TB SSDs then I'll take those instead.  I'm using HP Caddies.

Oscar A. Perez
Honored Contributor

Re: DL320e v2 Performance with P222 thermal shutdown

Unsupported drives could be an issue.  

The P222 card performs surface scan on the drives while the array is in idle and this process is enough to keep the card warm. The Slot-2 on that system is PCIe x4 only so, your P420 card will be definetely running cooler due to lower bandwidth but, the storage performance will be impacted.

 




__________________________________________________
If you feel this was helpful please click the KUDOS! thumb below!