Integrity Servers
cancel
Showing results for 
Search instead for 
Did you mean: 

An uncorrectable double bit error (DBE) has been detected

 
ESS IITC
Occasional Contributor

An uncorrectable double bit error (DBE) has been detected

Hi,
I have on erx6600 server with HPUX 11i31.For two months getting error message in even.log file with general message which is " An uncorrectable double bit error (DBE) has been detected".Server is working but with regiular message.Can anybody provide solution of this error message.Is it a bug??..
Can anybody help me out to sort out this issue.

Thanks in advance.
upen
3 REPLIES 3
sangilak
Trusted Contributor

Re: An uncorrectable double bit error (DBE) has been detected

Hi,


Uncorrectable double bit errors means that you have faulty memory. These kinds of errors are CRITICAL so they should be actioned as soon as possible.

If you have a support agreement with HP, log a case with the hardware team who then will verify which dimms are faulty and organize the replacement of it...

See following website for IA64 core hardware events:
http://h71000.www7.hp.com/doc/83final/wbem/wbemproviders_ia64corehw.html


sangilak
Prashanth.D.S
Honored Contributor

Re: An uncorrectable double bit error (DBE) has been detected

Hi There,

Looks like you had a Double Bit Error on one of the DIMM(Hardware Fault)sometime back in Sept 2010, have you replaced any memory module recently ?? If not check for errors in MP logs Capture and attach the following logs from MP.

MP>SL ==> Select SEL and D to dump on screen
MP>SL ==> Select FPL and D to dump.

As i mentioned this is a old event check this..

Event Time..........: Wed Jan 19 06:23:57 2011 <======
Severity............: CRITICAL
Monitor.............: ia64_corehw
Event #.............: 105100
System..............: RX6600DB.mrmewr.local


Event Details :

Event Date .............: Mon Sep 27 15:38:44 2010 <========
Sensor Number ..........: 0x81
Sensor Type ............: Memory
Sensor Class ...........: Unknown
Sensor Reading/Offset...: 0x01 (Offset)
Event Type.............: Assertion
Entity ID ..............: Unknown
Generic Message.........:
Unknown
Entity FRU Id Info......:
Unknown

Best Regards,
Prashanth
ESS IITC
Occasional Contributor

Re: An uncorrectable double bit error (DBE) has been detected

Hi Prashanth,
you are right.This problem started after memory upgrade.
How to know which DIMM is faulty ?
Server is working smoothly but with these error message time to time.

Pls. let me know how to confirm which DIMM is faulty.

Thanks and Regards.