<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: DL320Gen11 with disconnecting NVMe devices in ProLiant Servers (ML,DL,SL)</title>
    <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7206241#M185338</link>
    <description>&lt;P style="margin: 0;"&gt;Hi There,&amp;nbsp;&lt;BR /&gt;Thank you for reaching out.&lt;BR /&gt;May we have the case # or the serial # via Private message on which the issue is being handled so we may check the progress?&lt;/P&gt;</description>
    <pubDate>Thu, 08 Feb 2024 05:03:44 GMT</pubDate>
    <dc:creator>ngnear</dc:creator>
    <dc:date>2024-02-08T05:03:44Z</dc:date>
    <item>
      <title>DL320Gen11 with disconnecting NVMe devices</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7206000#M185267</link>
      <description>&lt;P&gt;We have a new vSAN cluster with 4 x DL320Gen11, each with 4 x 3,84TB NVM. One of the hosts shows devices errors for the NVMe(s) in vSphere/ESXi. But there is no error in ILO and according to HPE nothing in AHS logs. The server has the problem since beginning of the week now and was running fine for a couple of weeks, including some HCIBench runs.&lt;/P&gt;&lt;P&gt;When issues start, it's not always the same NVMe. And most of the time, after the fist device disconnects in OS (not shown on PCI bus anymore), at least a second one follows imediatly or later. The frustrating part is that HPE support is pointing at VMware. VMware support shoud identify the broken device/part. VMware support checked logs and the server in a remote session, outcome is that some hw is broken and responsible for the disconnect of the NVMes. But from the OS side its not possible to see if it's a broken NVMe or the backplane or....&lt;/P&gt;&lt;P&gt;So there is no real progress in resolving this. Does anyone have any idea how to narrow down the issue? I'm already powering off single NVMes from within the ILO to see if the error reoccurs (funny that HPE support did not suggest that, I'm not on-site). But I've not yet a result. Any tests for the NVMes that can be triggered somewhere in RBSU? And how/where can I disable NVMes in RBSU?&lt;/P&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;P&gt;Update: with drive in bay 3 powered off, the one in bay 2 still failed a few hours later. Now I've disabled both. But I'm not happy at all how HPE suport is handling this. Somehow I'm supposed to proove which device has failed. In case it's the backplane that's nearly impossible. We are not paying _a_lot_ of money for HPE support contracts and in the end nobody moves or tries to fix this on-site.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 1949px;"&gt;&lt;img src="https://community.hpe.com/t5/image/serverpage/image-id/139510iC937A09E9812A82B/image-size/large?v=v2&amp;amp;px=2000" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="image.png" style="width: 1009px;"&gt;&lt;img src="https://community.hpe.com/t5/image/serverpage/image-id/139511i175155410D705BFB/image-size/large?v=v2&amp;amp;px=2000" role="button" title="image.png" alt="image.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 09 Feb 2024 01:29:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7206000#M185267</guid>
      <dc:creator>pirx</dc:creator>
      <dc:date>2024-02-09T01:29:52Z</dc:date>
    </item>
    <item>
      <title>Betreff: DL320Gen11 with disconnecting NVMe devices</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7206039#M185279</link>
      <description>&lt;P&gt;With 2 NVMes powered down over ILO, there has been no error in 36h. So what does this mean, 2 faulty NVMes? Broken backplane?&amp;nbsp;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;Embedded:Port=3A:Box=1:Bay=4 Enabled 3.84 TB NVMe SSD&lt;BR /&gt;Embedded:Port=3A:Box=1:Bay=3 &lt;STRONG&gt;Disabled&lt;/STRONG&gt; 3.84 TB NVMe SSD&lt;BR /&gt;Embedded:Port=4A:Box=1:Bay=1 Enabled 3.84 TB NVMe SSD&lt;BR /&gt;Embedded:Port=4A:Box=1:Bay=2 &lt;STRONG&gt;Disabled&lt;/STRONG&gt; 3.84 TB NVMe SSD&lt;/P&gt;&lt;/BLOCKQUOTE&gt;</description>
      <pubDate>Sun, 04 Feb 2024 16:03:44 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7206039#M185279</guid>
      <dc:creator>pirx</dc:creator>
      <dc:date>2024-02-04T16:03:44Z</dc:date>
    </item>
    <item>
      <title>Re: DL320Gen11 with disconnecting NVMe devices</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7206241#M185338</link>
      <description>&lt;P style="margin: 0;"&gt;Hi There,&amp;nbsp;&lt;BR /&gt;Thank you for reaching out.&lt;BR /&gt;May we have the case # or the serial # via Private message on which the issue is being handled so we may check the progress?&lt;/P&gt;</description>
      <pubDate>Thu, 08 Feb 2024 05:03:44 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7206241#M185338</guid>
      <dc:creator>ngnear</dc:creator>
      <dc:date>2024-02-08T05:03:44Z</dc:date>
    </item>
    <item>
      <title>Re: DL320Gen11 with disconnecting NVMe devices</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7207121#M185549</link>
      <description>&lt;P&gt;After a week with issues we removed all NVMes and reconnected cables. Issue is fixed since then (2 weeks now). A bit suprising as the server was running ok for already 4 weeks and then a connection problem seems to the root cause.&lt;/P&gt;</description>
      <pubDate>Thu, 22 Feb 2024 08:42:00 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7207121#M185549</guid>
      <dc:creator>pirx</dc:creator>
      <dc:date>2024-02-22T08:42:00Z</dc:date>
    </item>
    <item>
      <title>Re: DL320Gen11 with disconnecting NVMe devices</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7207590#M185638</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.hpe.com/t5/user/viewprofilepage/user-id/2042567"&gt;@pirx&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Perfect!&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We are glad to know the issue has been resolved and we appreciate you for keeping us posted.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 28 Feb 2024 15:34:11 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/dl320gen11-with-disconnecting-nvme-devices/m-p/7207590#M185638</guid>
      <dc:creator>Sunitha_Mod</dc:creator>
      <dc:date>2024-02-28T15:34:11Z</dc:date>
    </item>
  </channel>
</rss>

