ProLiant Servers - Netservers
1820592 Members
1684 Online
109626 Solutions
New Discussion

"Multiple-bit error in memory" LH3000

 
SOLVED
Go to solution
CA1151268
New Member

"Multiple-bit error in memory" LH3000

This 4-year old LH3000, previously 100% reliable, has stopped three times in the last month, each time with the front panel event log listing a Memory event as follows: "Multiple-bit error in memory Slot 2". (There is nothing about it in the Windows event log, - it's the LCD on the front of the case that reports this.)

After two incidents both referencing Slot 2, I moved the RAM modules to new slots, recording which one went where. At the next crash, the front-panel event still referenced Slot 2 even though a different stick of RAM was in that slot. So I took that DIMM out and ran the server with nothing in Slot 2 - this time it crashed and the front panel gave the same error, only referencing Slot 1 this time.

I'm thinking the processor board that has the DIMM slots is going bad, or some component that affects memory usage. (Ran two different brands of in-depth memory tests, all my DIMMS passed, no errors found).

I got no hits searching on "Multiple-bit error in memory", so it looks like a rare error. Anyone know what my next step should be?

Also, what are my tech support options on an out-of-warranty server? Can I purchase a single-incident contract?

Grateful for any advice, TIA.
1 REPLY 1
CA859951
Honored Contributor
Solution

Re: "Multiple-bit error in memory" LH3000

Mark:

ECC Multi-bit errors are rare and are usually the result of a hardware problem.

Here is a document with the steps to resolve the errors:
http://h10025.www1.hp.com/ewfrf/wc/genericDocument?lc=en&cc=us&docname=tis12262

"ECC Multiple-bit errors (two bits or more). Multiple-bit errors, which are extremely rare, may cause the system to halt or display a memory parity error. If this occurs, take the following steps:

Cycle the system power.

During the power-on self test (POST), press the D key to test memory.

If the memory test passes, then the memory failure could be a soft ECC error.
Reboot the system and reactivate your applications.
If the memory test fails, please contact your support representative for additional assistance to isolate the problem."


It is likely that the mainboard will need to be replaced, but you should get a certified HP tech on-site to give the server a once over.
You might be able to get a service contract for the server through CarePAQ: http://www.hp.com/hps/carepack/
or you may find a local shop with a good reputation to do the work. Make sure that if you do get a local shop that they have a partner relationship with HP and their techs are certified. HP servers require HP parts and not every body shop can get the right parts.

You did not tell me what speed the system was, so, here is a link to the HP Parts Store: http://h20141.www2.hp.com/hpparts/Search_Multi_Product.asp?mscssid=3NDEFDMTNML29M909VEPPBSW1AK660A1&SearchCriteria=LH~3000

I hope this helps! G'luck
"Now is the only thing that's real!"