ProLiant Servers (ML,DL,SL)
1753604 Members
6425 Online
108797 Solutions
New Discussion юеВ

DL585 G7 Uncorrectable Machine Check Exception

 
travis b
Occasional Contributor

DL585 G7 Uncorrectable Machine Check Exception

The server rebooted unexpectedly this morning. It is running 2008 R2.

From the system management homepage:
Uncorrectable Machine Check Exception (Board 0, Processor 1, APIC ID 0x00000010, Bank 0x00000004, Status 0xBA000000?F0F, Address 0x00000000?, Misc 0xC0040FFE?)

Followed by 300 of these:
An Unrecoverable System Error (NMI) has occurred (System error code 0x00000032, 0xA122088A)

Then the server rebooted.

Any ideas or suggestions?
Thanks,
Travis
3 REPLIES 3
Pogumirskiy Nikolay
Frequent Advisor

Re: DL585 G7 Uncorrectable Machine Check Exception

Try to remove memory modules to minimal configuration, also remove BBWC cache modules.
Kai-Uwe Schurig
Valued Contributor

Re: DL585 G7 Uncorrectable Machine Check Exception

Please update to the latest SystemROM 2011.01.29 (15 Apr 2011), it contains some fixes which may help:

http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?lang=en&cc=us&prodTypeId=15351&prodSeriesId=4194641&prodNameId=4194642&swEnvOID=4004&swLang=8&mode=2&taskId=135&swItem=MTX-cf0f933a5dc64170bcb968354e

Resolved an extremely intermittent issue that can result in a system reset or operating system crash (such as a Windows Blue Screen, Linux Kernel Panic, or VMware ESX PSoD) when the system is under heavy I/O load. This issue is typically seen with systems configured with a large amount of optional PCI-express expansion cards.
Johan Guldmyr
Honored Contributor

Re: DL585 G7 Uncorrectable Machine Check Exception

Does the DL585 G7 use the same architecture as the G5's ? Not completely but in terms of memory..

That the "memory controller" or (whatever it was called), was in the CPU? So that memory errors could come from a faulty CPU as well?