- Community Home
- >
- Servers and Operating Systems
- >
- HPE BladeSystem
- >
- BladeSystem - General
- >
- Re: strange NCU behavior
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО06-22-2009 10:42 AM
тАО06-22-2009 10:42 AM
strange NCU behavior
1) Server A is restarted
2) Server B NCU primary network link fails over to standby link when serverA is restarted.
In more detail we had a maintenance the other night which was to configure a virtual server for a new windows 2003 cluster. The server had been restarted multiple times during this install but on a random restart we noticed that at the exact same time the NCU on our production cluster failed and all connections to its passive node were lost. The production cluster server did not restart but only the NCU appeared to break when the non production cluster server was restarted. All IP info is different between clusters. I have also had this happen to another production cluster server where the same behavior was noted. ServerA would be restarted but then ServerB NCU would fail on primary link.
Has anyone seen behavior like this at all. We are getting quite afraid to restart any HP servers considering the damage it caused the last time.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО06-24-2009 09:34 AM
тАО06-24-2009 09:34 AM
Re: strange NCU behavior
The first question to ask is what type of failover occurs on Server B in the example above? Is it link loss occurring on the primary network link or does it fail over because of RX or TX path Failed (heartbeats)? You can gather this info by looking at the CPQTeam Log Entries in the System Log.
If it was link loss, ensure that the NICs actually attach to the switch ports that you think they do. Make sure the switch isn't misconfigured for some kind of link state tracking or uplink failure detection.
*** If it was due to RX or TX Path Validation, make sure that both the primary and standby network links are indeed connecting to switch ports that are in the same VLAN. It could be that they are not and that the primary link has been receiving path validation frames (heartbeats) from server A all along and when server A was rebooted, Server B's primary NIC no longer saw path validation frames and failed over to its standby NIC which was in the wrong VLAN resulting in loss of connectivity.
Path Validation frames are L2 multicast and will be heard by all NIC teams in a broadcast domain/VLAN.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО06-24-2009 12:29 PM
тАО06-24-2009 12:29 PM
Re: strange NCU behavior
1)On the night of the issue ServerA was undergoing maintenance and several restarts occurred
2) The vlan that serverA and B reside in is heavily populated.
3) Broadcast traffic within that vlan could have been high given the restarts and high poplulation of other production servers in vlan.
I have seen that enabling PortFast on a ciscoswitch will resolve such issues. This is not the first time we have encountered this issue. It has occurred between two other servers that also reside in the same vlan, same event ID 434.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО06-25-2009 06:13 AM
тАО06-25-2009 06:13 AM
Re: strange NCU behavior
Portfast is highly recommended, especially on a heavily populated VLAN.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО06-25-2009 09:08 AM
тАО06-25-2009 09:08 AM