<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: DL380G7 Uncorrectable Machine Check Exception in ProLiant Servers (ML,DL,SL)</title>
    <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5374751#M124589</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I guess its processor or VRM for processor causing the problem. Try replacing it with a new working one. I would suggest to follow a step by step HW troubleshooting flow chart.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Replace the VRM for Processor 1 and re check if itw orks&lt;/P&gt;&lt;P&gt;Replace Processor with a good one and see.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Repeat it for both processros.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope this will help. Please keep posted with results.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 28 Oct 2011 06:00:41 GMT</pubDate>
    <dc:creator>SFHR</dc:creator>
    <dc:date>2011-10-28T06:00:41Z</dc:date>
    <item>
      <title>DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766324#M110517</link>
      <description>Hi there,&lt;BR /&gt;&lt;BR /&gt;i deployed a new DL380 G7 with the following specs:&lt;BR /&gt;&lt;BR /&gt;2x Xeon X5650 CPU (2.66 MHz), 6/6 cores; 12 threads&lt;BR /&gt;8x 8192 MB RAM 1333 MHz&lt;BR /&gt;1x Embedded P410i with 1GB FBWC&lt;BR /&gt;&lt;BR /&gt;Firmware:&lt;BR /&gt;&lt;BR /&gt;BIOS: 12/01/2010&lt;BR /&gt;iLo3: 1.16&lt;BR /&gt;P410i: 3.66&lt;BR /&gt;&lt;BR /&gt;OS: Debian Squeeze&lt;BR /&gt;Kernel: 2.6.32-5-amd64&lt;BR /&gt;&lt;BR /&gt;BIOS Setting for Power-Saving was set to "OS Control mode" and on Debian the package cpufrequtils was installed (which sets the CPU scheduler to "ondemand" for all CPUs).&lt;BR /&gt;&lt;BR /&gt;While running some tests the box suddenly crashed hard:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://test.thermoman.de/images/hp/dl380g7.kernel.panic.png" target="_blank"&gt;http://test.thermoman.de/images/hp/dl380g7.kernel.panic.png&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Integrated Management Log says:&lt;BR /&gt;&lt;BR /&gt;Class: System Error&lt;BR /&gt;Description: An Unrecoverable System Error (NMI) has occurred (System error code 0x00000000, 0x00000000)&lt;BR /&gt;&lt;BR /&gt;Class: CPU&lt;BR /&gt;Description: Uncorrectable Machine Check Exception (Board 0, Processor 2, APIC ID 0x00000020, Bank 0x00000005, Status 0xBA000000'00400405, Address 0x00000000'00000000, Misc 0x00000000'00004100)&lt;BR /&gt;&lt;BR /&gt;Class: CPU&lt;BR /&gt;Description: Uncorrectable Machine Check Exception (Board 0, Processor 2, APIC ID 0x00000021, Bank 0x00000005, Status 0xBA000000'00400405, Address 0x00000000'00000000, Misc 0x00000000'00004100)&lt;BR /&gt;&lt;BR /&gt;See &lt;A href="http://test.thermoman.de/images/hp/dl380g7.ilo.iml.png" target="_blank"&gt;http://test.thermoman.de/images/hp/dl380g7.ilo.iml.png&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;I googled this error and found some threads here on HP IT Resource Center regarding a bug with 2 NICs being enabled for PXE (not the case) and others suggesting problem with system board or CPU.&lt;BR /&gt;&lt;BR /&gt;Since i didn't find the mentioned Numbers (Status 0xBA000000'00400405) anywhere on the web i thought post it here for other lost souls :)&lt;BR /&gt;&lt;BR /&gt;Solution?&lt;BR /&gt;&lt;BR /&gt;1. upgraded BIOS Firmware to version 01/30/2011&lt;BR /&gt;2. memtest86+ - Result: no errors found&lt;BR /&gt;3. disabled cpufrequtils on Debian so CPUs don't get clocked down for power saving&lt;BR /&gt;4. running stress test at the moment, no definite results yet.&lt;BR /&gt;&lt;BR /&gt;Can someone tell me what part is being referenced by the IML status codes above? Is it CPU #2 that is detected as being faulty?&lt;BR /&gt;&lt;BR /&gt;Thanks in advance!&lt;BR /&gt;&lt;BR /&gt;Greetings,&lt;BR /&gt;Marcel.</description>
      <pubDate>Wed, 16 Mar 2011 21:53:29 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766324#M110517</guid>
      <dc:creator>M. Meckel</dc:creator>
      <dc:date>2011-03-16T21:53:29Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766325#M110518</link>
      <description>We are having this same issue with one of our DL380 G7s.  I read through the fixes on the other thread as well, but none of them resolved the problem.&lt;BR /&gt;&lt;BR /&gt;All firmware and drivers are up to date.&lt;BR /&gt;&lt;BR /&gt;It appears to be a hardware problem though, as not all of our DL380 G7s have this issue.&lt;BR /&gt;&lt;BR /&gt;I'll let you know if I come upon a valid fix.</description>
      <pubDate>Thu, 17 Mar 2011 17:47:02 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766325#M110518</guid>
      <dc:creator>James Kennedy_4</dc:creator>
      <dc:date>2011-03-17T17:47:02Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766326#M110519</link>
      <description>The System Board now gets replaced after the machine hung itself again even with newest BIOS installed (01/30/2011).&lt;BR /&gt;&lt;BR /&gt;I'll let you know if the swap fixes the problem.</description>
      <pubDate>Thu, 24 Mar 2011 13:45:47 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766326#M110519</guid>
      <dc:creator>M. Meckel</dc:creator>
      <dc:date>2011-03-24T13:45:47Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766327#M110520</link>
      <description>System board got replaced. I upgraded BIOS firmware to the latest available (01/30/2011) and did run my stress tests again.&lt;BR /&gt;&lt;BR /&gt;Result after 24 hours: machine crashed again.&lt;BR /&gt;&lt;BR /&gt;Integrated Management Log says:&lt;BR /&gt;&lt;BR /&gt;Class: System Error&lt;BR /&gt;Description: An Unrecoverable System Error (NMI) has occurred (System error code 0x00000000, 0x00000000)&lt;BR /&gt;&lt;BR /&gt;Kernel Panic output looks the same as the above linked image.&lt;BR /&gt;&lt;BR /&gt;BIOS Setting for Power-Saving was set to "OS Control mode" and the package cpufrequtils this time was NOT installed.&lt;BR /&gt;&lt;BR /&gt;I'll now for the rest of the weekend try with "HP Static High Performance Mode" (as suggested in some thread as workaround from HP).</description>
      <pubDate>Sat, 26 Mar 2011 17:22:29 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766327#M110520</guid>
      <dc:creator>M. Meckel</dc:creator>
      <dc:date>2011-03-26T17:22:29Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766328#M110521</link>
      <description>Having same issues here in Australia with multiple DL 380 G7's running WIndows Server 2008 R2 SP1 with all latest bios fixes. Have logged a support case with HP. Will reply back with outcome.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 28 Mar 2011 21:49:24 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766328#M110521</guid>
      <dc:creator>Glen Coghlan</dc:creator>
      <dc:date>2011-03-28T21:49:24Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766329#M110522</link>
      <description>In the BIOS, change the  Power Regulator mode to "Static High Performance".  Seems to be a good fix so far.</description>
      <pubDate>Tue, 29 Mar 2011 10:10:54 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766329#M110522</guid>
      <dc:creator>James Kennedy_4</dc:creator>
      <dc:date>2011-03-29T10:10:54Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766330#M110523</link>
      <description>&lt;P&gt;Hi James,&lt;BR /&gt;&lt;BR /&gt;yes, this seems to be a temporary fix for this issue.&lt;BR /&gt;&lt;BR /&gt;In Server BIOS set:&lt;BR /&gt;&lt;BR /&gt;- Advance Power option -&amp;gt; change to = HP Static High Performance Mode.&lt;BR /&gt;&lt;BR /&gt;- Minimum Processor Idle Power State -&amp;gt; No C-state&lt;BR /&gt;&lt;BR /&gt;I found this workaround here:&lt;BR /&gt;&lt;BR /&gt;"Absolute nightmare of a DL380 G7"&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://h30499.www3.hp.com/t5/ProLiant-Servers-ML-DL-SL/Absolute-nightmare-of-a-DL380-G7/m-p/4709685#M106891" target="_blank"&gt;http://h30499.www3.hp.com/t5/ProLiant-Servers-ML-DL-SL/Absolute-nightmare-of-a-DL380-G7/m-p/4709685#M106891&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;So far no more MCEs. I keep my fingers crossed.&lt;BR /&gt;&lt;BR /&gt;BUT: The green IT and power saving HP advertised its G7 line with is a big fat ridicule if you have to disable power saving get a stable machine.&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2011 16:55:47 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4766330#M110523</guid>
      <dc:creator>M. Meckel</dc:creator>
      <dc:date>2011-08-04T16:55:47Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4830293#M112542</link>
      <description>&lt;P&gt;Having the same issue myself with a DL380 G7 but I've only gone for the C-State option to start with as a friend of mine had an issue with the Intel CPU and this resolved his issue.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I'm hoping it's this as I don't really want to impact the power usage as noticed it jump from 95 watts to 125 with the other setting.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for the help.&lt;/P&gt;&lt;P&gt;Jase&lt;/P&gt;</description>
      <pubDate>Mon, 18 Jul 2011 12:28:29 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/4830293#M112542</guid>
      <dc:creator>Jase4772</dc:creator>
      <dc:date>2011-07-18T12:28:29Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5374397#M124582</link>
      <description>&lt;P&gt;Same issue...&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;10/27/2011 15:52&lt;/P&gt;&lt;P&gt;Uncorrectable Machine Check Exception (Board 0, Processor 1, APIC ID 0x00000001, Bank 0x00000005, Status 0xB2000000'00800400, Address 0x00000000'00000000, Misc 0x00000000'00000000)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Machine went down hard over the weekend and I troubleshot it down to system board yesterday and had HP come out and replace the motherboard today and now I can't even get the machine to boot to a smartstart CD, let alone the OS, it just keeps cycling power when it comes time to load an OS.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I implemented high performance power and have also put the processors in no C-states mode.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any other troubleshooting advice would be great, we have 9 other DL380 G7's and haven't had issue with them.&lt;/P&gt;</description>
      <pubDate>Thu, 27 Oct 2011 20:24:08 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5374397#M124582</guid>
      <dc:creator>Systems Engineer_1</dc:creator>
      <dc:date>2011-10-27T20:24:08Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5374751#M124589</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I guess its processor or VRM for processor causing the problem. Try replacing it with a new working one. I would suggest to follow a step by step HW troubleshooting flow chart.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Replace the VRM for Processor 1 and re check if itw orks&lt;/P&gt;&lt;P&gt;Replace Processor with a good one and see.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Repeat it for both processros.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope this will help. Please keep posted with results.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 28 Oct 2011 06:00:41 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5374751#M124589</guid>
      <dc:creator>SFHR</dc:creator>
      <dc:date>2011-10-28T06:00:41Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5374945#M124596</link>
      <description>&lt;P&gt;The C-State issue was resolved in the last BIOS firmware release for G7's. If your error is a result of the same problem then you should update as soon as possible.&lt;/P&gt;</description>
      <pubDate>Fri, 28 Oct 2011 08:27:37 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5374945#M124596</guid>
      <dc:creator>Jase4772</dc:creator>
      <dc:date>2011-10-28T08:27:37Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5499507#M126199</link>
      <description>&lt;P&gt;Appears that HP has now fixed this with the May 5, 2011 BIOS update&lt;/P&gt;
&lt;P&gt;&lt;A href="https://support.hpe.com/hpesc/public/docDisplay?docId=emr_na-c02914393" target="_blank" rel="noopener"&gt;https://support.hpe.com/hpesc/public/docDisplay?docId=emr_na-c02914393&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Problems Fixed:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Resolved an issue that may result in any of the following conditions: operating system stops responding, unexpected system reset, Blue Screen when using a Microsoft Windows operating system, kernel panic when using a Linux operating system, or Purple Screen when using VMware ESX. A message may be displayed by the operating system or logged in the HP Integrated Management Log (IML) when this issue occurs indicating an "Uncorrectable Machine Check Exception." However, there are instances where the system resets before the operating system displays an error message and instances where the IML contains no log entry when this issue occurs. This issue does not occur if the Minimum Processor Idle State is configured for No C-states or C1E-state. The system is susceptible to this issue in the default Minimum Processor Idle State configuration.&lt;/P&gt;
&lt;P&gt;Resolved an issue where PCI-Express Gen 3 option cards would run at PCI-Express Gen 1 speeds rather than the appropriate behavior of running at PCI-Express Gen 2 speeds. This server supports a maximum PCI-Express speed of Gen 2.&lt;/P&gt;
&lt;P&gt;Resolved an issue in which uncorrectable memory errors (or other fatal system errors) will not be logged to the HP Integrated Management Log (IML) when using some revisions of VMware ESX Server. These errors will result in a fatal error (Purple Screen of Death - PSoD) under VMware ESX, but there will not be any indication of the error type (including no indication of an uncorrectable memory error or what DIMM has failed). A VMware ESX Server issue which can result in uncorrectable memory errors this is addressed in VMware ESX 4.1 U1 and VMware ESX 4.0 U3. This System ROM revision addresses the logging of errors to the IML.&lt;/P&gt;
&lt;P&gt;&lt;EM&gt;[Note: broken link updated by Mod]&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 30 Oct 2020 14:16:24 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5499507#M126199</guid>
      <dc:creator>Doug Herlovitch_1</dc:creator>
      <dc:date>2020-10-30T14:16:24Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5610173#M128360</link>
      <description>&lt;P&gt;Running a Dl380g7 w2k8r2&lt;/P&gt;&lt;P&gt;Ihave had this &lt;STRONG&gt;Uncorrectable Machine Check Exception&lt;/STRONG&gt; in ILM , updated to bios&amp;nbsp;05/05/2011.&lt;/P&gt;&lt;P&gt;now the system resets with the following ILM msg logged:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Operating System failure (Windows bug check, STOP: 0x00000080 (0x00000000004F4454, 0x0000000000000000, 0x0000000000000000, 0x0000000000000000))&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;Uncorrectable PCI Express Error (Embedded device, Bus 0, Device 7, Function 0, Error status 0x00000000)&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;minidump tells &lt;STRONG&gt;This is typically due to a hardware malfunction.&amp;nbsp; The hardware supplier should&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;be called.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Installed HP Servicepack from 27.3.2012 the Server reboots twice a day.&lt;/P&gt;&lt;P&gt;anyone has same issues? any ideas?&lt;/P&gt;&lt;P&gt;shoud i exchange MB?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;thx&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 05 Apr 2012 10:51:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5610173#M128360</guid>
      <dc:creator>Robert Hawle</dc:creator>
      <dc:date>2012-04-05T10:51:52Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5830085#M132654</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are running Windows 2008 R2 SP1 Data Center&lt;STRONG&gt; Core&lt;/STRONG&gt; Edition for our 4 node Hyper-V farm utilising HP Proliant DL385 G7's (Performance edition with extra RAM + Fiber HBAs), we've been suffering terrible stability problems since upgrading our nodes and had a Microsoft Premier call open. After setting the host OS up to support NMI Crash Dump we experienced a crash and tried to "Generate NMI to System" this failed and according to Microsoft this clarifies that the issue is with the hardware since the Non Maskable Interrupt is the highest level of interaction and should bypass any soft hangs. The only difference I can see is that following the NMI dump (which apparently didn't work) we did get a bug stop in the IML post-reboot which is more than we got before (so maybe it did work but just didn't force the blue screen and reboot).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This Bug stop lead me here...so for the record we run in Static High Power since we don't want to risk any performance glitches one our hosts (we figure since we've reduced physical foot print through virtualisation we dont need to justify additional power saving).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We are running all the very latest Windows patches (including the KB2568088 for Bulldozer to even get VM's to boot), I just checked the BIOS (2012.05.08 A18) and the despite Static High Performance we have a default Minimum Processor Idle Power State of "Core C6 (CC6) State" I've now changed this on the most recent node to crash to "No C-States"&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Our biggest issue here is that the hangs are not following a pattern we've had anything from 1 week apart to 1 month and its happened on 3 out of the 4 nodes at different points so I find a physical hardware issue unlikely.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have a ticket open with HP and we're escalating so I will post if we get any updates, but I am interested to know if you guys stayed on No C-States to avoid the issue or if that earlier BIOS resolved the issue in your CPU's?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Also for the record we are running the 16 Core AMD's with Bulldozer (6282SE's).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Interesting times ahead gaining business confidence back!&lt;/P&gt;</description>
      <pubDate>Thu, 11 Oct 2012 08:19:17 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5830085#M132654</guid>
      <dc:creator>Dan Gough</dc:creator>
      <dc:date>2012-10-11T08:19:17Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5830093#M132655</link>
      <description>Seems this might be a curve-ball having refreshed my memory on the events around our last crash the Bug Stop was generated when we tested the NMI feature post reboot, as stated during the hang the NMI failed to do anything...guess we will have to wait for our HP ticket to escalate!</description>
      <pubDate>Thu, 11 Oct 2012 08:30:34 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/5830093#M132655</guid>
      <dc:creator>Dan Gough</dc:creator>
      <dc:date>2012-10-11T08:30:34Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/6420132#M141606</link>
      <description>&lt;P&gt;This workaround works for me too, this is the scenario:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;- HP Proliant DL 160 Gen 8 E5-2603&lt;/P&gt;&lt;P&gt;- HP 1 TB 6GB SAS 7.2K 3.5in SC MDL HDD&lt;/P&gt;&lt;P&gt;- HP Smart Array P420/1GB FBWC Controller&lt;/P&gt;&lt;P&gt;- Sangoma PCI Wildcard A102D (PCI Express 2.0)&lt;/P&gt;&lt;P&gt;- Elastix 2.4&lt;/P&gt;&lt;P&gt;- CentOS release 5.10&lt;/P&gt;&lt;P&gt;- Kernel 2.6.18-371.3.1.el5&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Rigth now i can count 40 day without MCEs and... I keep my finger crossed too..&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I attach the screen shot of iLo&lt;/P&gt;</description>
      <pubDate>Fri, 21 Mar 2014 00:57:21 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/6420132#M141606</guid>
      <dc:creator>cgchavero</dc:creator>
      <dc:date>2014-03-21T00:57:21Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/6822036#M152330</link>
      <description>&lt;P&gt;&lt;a href="https://community.hpe.com/t5/user/viewprofilepage/user-id/1186440"&gt;@Glen Coghlan﻿&lt;/a&gt;&amp;nbsp;yes, same here. my HP BL 465c G7 blade servers which was running for more than 2 years has just rebooted today during the business hours.&lt;/P&gt;&lt;P&gt;Here's the IML logs:&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;Uncorrectable Machine Check Exception (Board 0, Processor 1, APIC ID 0x00000010, Bank 0x00000004, Status 0xF2000000'00070F0F, Address 0x00000000'00000000, Misc 0x00000000'00000000)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;Uncorrectable Chipset Error (Error status 1 0x0018C154, Error status 2 0x00244000)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;Uncorrectable Chipset Error (Error status 1 0x0018C160, Error status 2 0x00002040)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;Uncorrectable Chipset Error (Error status 1 0x0018C16C, Error status 2 0x20000080)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;Uncorrectable Chipset Error (Error status 1 0x0018C170, Error status 2 0x040406FF)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;Uncorrectable Chipset Error (Error status 1 0x0018C174, Error status 2 0x00000003)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;Uncorrectable Chipset Error (Error status 1 0x0018C178, Error status 2 0x9452EA00)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;My Server ROM is on &lt;STRONG&gt;A19 12/08/2012&lt;/STRONG&gt; but according to &lt;A href="http://h20565.www2.hpe.com/hpsc/doc/public/display?docId=emr_na-c03250482" target="_blank"&gt;http://h20565.www2.hpe.com/hpsc/doc/public/display?docId=emr_na-c03250482&lt;/A&gt; The system ROM dated &lt;STRONG&gt;12.31.2011&lt;/STRONG&gt; corrects this issue which is older ?&lt;/P&gt;</description>
      <pubDate>Wed, 06 Jan 2016 00:40:06 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/6822036#M152330</guid>
      <dc:creator>Server-Support</dc:creator>
      <dc:date>2016-01-06T00:40:06Z</dc:date>
    </item>
    <item>
      <title>Re: DL380G7 Uncorrectable Machine Check Exception</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/6833738#M152760</link>
      <description>&lt;P&gt;&lt;a href="https://community.hpe.com/t5/user/viewprofilepage/user-id/1181651"&gt;@Server-Support﻿&lt;/a&gt;, did your issue get resolved?&amp;nbsp; We have now experienced the reboot and&amp;nbsp;"Uncorrectable Machine Check Exception" IML entry on two separate production servers, both DL385 G7.&lt;/P&gt;</description>
      <pubDate>Wed, 17 Feb 2016 16:09:17 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl380g7-uncorrectable-machine-check-exception/m-p/6833738#M152760</guid>
      <dc:creator>teojaimes</dc:creator>
      <dc:date>2016-02-17T16:09:17Z</dc:date>
    </item>
  </channel>
</rss>

