- Community Home
- >
- Servers and Operating Systems
- >
- HPE ProLiant
- >
- ProLiant Servers (ML,DL,SL)
- >
- VMware v4, failure, ESX hosts disconnecting from v...
ProLiant Servers (ML,DL,SL)
1821414
Members
2933
Online
109633
Solutions
Forums
Categories
Company
Local Language
back
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Discussions
Forums
Discussions
back
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Blogs
Information
Community
Resources
Community Language
Language
Forums
Blogs
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-20-2010 11:11 AM
10-20-2010 11:11 AM
VMware v4, failure, ESX hosts disconnecting from vCenter and EMC SAN, SCSI Reservation issues also,
We have a multi cluster VMware environment consisting of HP BL680’s, DL580’s and DL585’s that has been very stable for over a year and we have made no recent changes to storage, switches, hosts or HBAs.
Recently we experienced two incidents where ESX hosts on different clusters have disconnected from vCenter and from their shared EMC Clariion arrays (three separate frames). EMC said the Cisco fiber switch saw them as disconnected at the ESX host port.
The incident did not happen all at once but started with one host disconnecting followed by other hosts over a period of two to three hours. In some cases both HBA paths lost connection to the SAN and in some cases only one HBA disconnected from the SAN. Re-booting the ESX host reestablished connection to vCenter and to the SAN but in some cases specific LUNs were still not accessible. VMware support found SCSI Reservations on multiple hosts and those host all were unable to see the same 3 LUNs. They had us trespass these LUNs after which the hosts could access their data.
In a second incident two days later, one ESX host (one not involved in the previous incident) disconnected from vCenter but did not lose connection to the SAN. Within an hour two other hosts from the same cluster also disconnected from vCenter but not from the SAN. Three of two hosts were re-booted and re-connected to vCenter . The third restored itself without re-booting. Again a specific LUN was inaccessible not appearing to the host. The hosts vmkernel logs on the affected hosts were showing SCSI reservations. The LUN was trespassed, after which we could browse the LUN but the VM’s would not start. The LUN was then trespassed back and the VM’s were able to start and access the data.
VMware has recommended a firmware upgrade to our HP and Emulex HBAs which they say resolved a similar issue with an environment similar to ours. However they do not know what the condition is that is causing the problem. Our environment has been very stable for over a year and we have made no changes to storage, switches, hosts or HBAs.
Looking to hear from anyone with a similar experience who might have a handle on the root cause of this issue.
Recently we experienced two incidents where ESX hosts on different clusters have disconnected from vCenter and from their shared EMC Clariion arrays (three separate frames). EMC said the Cisco fiber switch saw them as disconnected at the ESX host port.
The incident did not happen all at once but started with one host disconnecting followed by other hosts over a period of two to three hours. In some cases both HBA paths lost connection to the SAN and in some cases only one HBA disconnected from the SAN. Re-booting the ESX host reestablished connection to vCenter and to the SAN but in some cases specific LUNs were still not accessible. VMware support found SCSI Reservations on multiple hosts and those host all were unable to see the same 3 LUNs. They had us trespass these LUNs after which the hosts could access their data.
In a second incident two days later, one ESX host (one not involved in the previous incident) disconnected from vCenter but did not lose connection to the SAN. Within an hour two other hosts from the same cluster also disconnected from vCenter but not from the SAN. Three of two hosts were re-booted and re-connected to vCenter . The third restored itself without re-booting. Again a specific LUN was inaccessible not appearing to the host. The hosts vmkernel logs on the affected hosts were showing SCSI reservations. The LUN was trespassed, after which we could browse the LUN but the VM’s would not start. The LUN was then trespassed back and the VM’s were able to start and access the data.
VMware has recommended a firmware upgrade to our HP and Emulex HBAs which they say resolved a similar issue with an environment similar to ours. However they do not know what the condition is that is causing the problem. Our environment has been very stable for over a year and we have made no changes to storage, switches, hosts or HBAs.
Looking to hear from anyone with a similar experience who might have a handle on the root cause of this issue.
The opinions expressed above are the personal opinions of the authors, not of Hewlett Packard Enterprise. By using this site, you accept the Terms of Use and Rules of Participation.
Company
Learn About
News and Events
Support
© Copyright 2025 Hewlett Packard Enterprise Development LP