HPE Community read-only access December 15, 2018
This is a maintenance upgrade. You will be able to read articles and posts, but not post or reply.
Hours:
Dec 15, 4:00 am to 10:00 am UTC
Dec 14, 10:00 pm CST to Dec 15, 4:00 am CST
Dec 14, 8:00 pm PST to Dec 15, 2:00 am PST
BladeSystem - General
cancel
Showing results for 
Search instead for 
Did you mean: 

iLO2 System Health Status Incorrect

 
SOLVED
Go to solution
Melissa O'Brien
Frequent Advisor

iLO2 System Health Status Incorrect

BL465c with iLo2 1.42 and OA 2.04

I have a Correctable Memory Error Threshold Exceeded error showing in my IML and SMH but it hasn't updated the iLo2 System Health Status or the Onboard Administrator Alerts.

Shouldn't the health status show as degraded?

Thanks for your help!
10 REPLIES
Mi6t0
Trusted Contributor

Re: iLO2 System Health Status Incorrect

Which version is your ilo and which is the OA?
Recommended are 1.42 for ilo and 2.04 for OA.
Melissa O'Brien
Frequent Advisor

Re: iLO2 System Health Status Incorrect

My iLO2 is 1.42
My OA is 2.04

I'm wondering if the Correctable Memory Error is not "critical" enough an error to change the System Health Status?
Raghuarch
Honored Contributor

Re: iLO2 System Health Status Incorrect

Hi Melissa,

If the correctable memory error is false, the following error message will be displayed in the Windows event log and a corresponding SNMP trap (type 6029) will be forwarded to the connected service center; however, no memory event will be logged in the Integrated Management Log (IML):
Event ID 1071
System Information Agent: Health: Correctable memory error detected.
The errors have been corrected, but the memory module should be replaced.
[SNMP TRAP: 6029 in CPQHLTH.MIB]

Please refer to the below link for more details.
http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?lang=en&cc=us&objectID=c01202059&jumpid=reg_R1002_USEN

Regards,
Raghuarch
Melissa O'Brien
Frequent Advisor

Re: iLO2 System Health Status Incorrect

Thanks Raghuarch.

I checked the event viewer and have the correct Event ID - the errors are valid. I actually swapped the memory into a different blade and same error occurred.

All the more reason for this to be reflected in the iLo2 System Health and Blade Enclosure system status!
Raghuarch
Honored Contributor

Re: iLO2 System Health Status Incorrect

Hi Melissa,

I know that OA doesn't report the Corrective memory error.

I think the iLO will degrade the system health status to Failed or Degraded.

What is the current system health status in iLO.

Regards,
Raghuarch
Melissa O'Brien
Frequent Advisor

Re: iLO2 System Health Status Incorrect

The current system status is showing as OK. See attached screenshot. The IML has not been cleared, it is still showing a status of

Caution Main Memory 12/08/2007 06:42 12/08/2007 06:42 1 Corrected Memory Error threshold exceeded (System Memory, Memory Module 6)
Raghuarch
Honored Contributor

Re: iLO2 System Health Status Incorrect

Hi Melissa,

The error is " Memory Error Threshold exceeded", however this was corrected and hence it is shown as "Corrected: Memory Error Threshold exceeded".

since the error no longer exist, that is corrected, the iLO doesn't reflect it in the System Health Status.

If the error is not Corrected you will see a Failed or degraded state.

Regards,
Raghuarch
Melissa O'Brien
Frequent Advisor

Re: iLO2 System Health Status Incorrect

Thanks!

That is so confusing. The Online Diagnostics is showing "Correctable Error Threshold Exceeded" and actually HP is shipping me a new memory DIMM, so isn't the system actually degraded?

What's the difference between Correctable Error Threshold Exceeded and Corrected Error Threshold Exceeded? I'm assuming there is an actual problem with the DIMM seeing as it has happened in different slots in different blades.

Raghuarch
Honored Contributor
Solution

Re: iLO2 System Health Status Incorrect

Hi Melissa,

The Online Diagnostics is showing "Correctable Error Threshold Exceeded"
This is correct since it mentions that the error can be corrected.

After the correction is done you will see Corrected Error Threshold Exceeded.

I'm assuming there is an actual problem with the DIMM seeing as it has happened in different slots in different blades.
You are right :-)
Of course the memory is having problem.

Regards,
Raghuarch
Terence Tsao
Frequent Advisor

Re: iLO2 System Health Status Incorrect

" Corrected Memory Error threshold exceeded (System Memory, Memory Module 6)".
This was ths last IML log and repaired, as long as system health status is ok , the blase server h/w is ok.
Regards.