- Community Home
- >
- Servers and Operating Systems
- >
- HPE ProLiant
- >
- ProLiant Servers (ML,DL,SL)
- >
- Re: Storageworks Issue
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-02-2011 03:09 PM
09-02-2011 03:09 PM
Storageworks Issue
About a month ago, I updated all firmware and drivers for my two different clusters
fast forward to now, i had a strange glitch
i thought it was a power outage that took everything offline, but found out that wasn't the case
both clusters didn't reboot, or anything, but both storage controllers (port 1 and port 2) of two different clusters both went offline at the same time in about the same fashion
was this caused by a bad UPS, or a bad switch\ports?
checking the event log shows me private interface (cross over heart beat) went down -> then public interface -> slot 1 storage controller -> slot 2 storage controller
Event Type: Warning
Event Source: ClusSvc
Event Category: Node Mgr
Event ID: 1123
Date: 9/1/2011
Time: 12:54:53 AM
User: N/A
Computer: SERVER
Description:
The node lost communication with cluster node 'Active node' on network 'Private'.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Event Type: Warning
Event Source: ClusSvc
Event Category: Node Mgr
Event ID: 1123
Date: 9/1/2011
Time: 12:54:53 AM
User: N/A
Computer: SERVER
Description:
The node lost communication with cluster node 'Active node' on network 'Public'.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Event Type: Error
Event Source: Storage Agents
Event Category: Events
Event ID: 1151
Date: 9/1/2011
Time: 12:55:30 AM
User: N/A
Computer: SERVER
Description:
External Array Controller Status Change. The external controller in I/O slot 1 of array "ZWVTMT423R" has a new status of 4.
(Controller status values: 1=other, 2=ok, 3=failed, 4=offline, 5=redundantPathOffline, 6=notConnected)
[SNMP TRAP: 16020 in CPQFCA.MIB]
Data:
Event Type: Error
Event Source: Storage Agents
Event Category: Events
Event ID: 1151
Date: 9/1/2011
Time: 12:55:30 AM
User: N/A
Computer: SERVER
Description:
External Array Controller Status Change. The external controller in I/O slot 2 of array "ZWVTMT423R" has a new status of 4.
(Controller status values: 1=other, 2=ok, 3=failed, 4=offline, 5=redundantPathOffline, 6=notConnected)
[SNMP TRAP: 16020 in CPQFCA.MIB]
Data:
The other errors that bother me are:
Event Type: Warning
Event Source: Ftdisk
Event Category: Disk
Event ID: 57
Date: 9/1/2011
Time: 12:54:55 AM
User: N/A
Computer: SERVER
Description:
The system failed to flush data to the transaction log. Corruption may occur.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Data:
0000: 00 00 00 00 01 00 be 00 ......¾.
0008: 02 00 00 00 39 00 04 80 ....9..¿
0010: 00 00 00 00 10 00 00 80 .......¿
0018: 00 00 00 00 00 00 00 00 ........
0020: 00 00 00 00 00 00 00 00 ........
Event Type: Warning
Event Source: Srv
Event Category: None
Event ID: 2012
Date: 9/1/2011
Time: 12:54:57 AM
User: N/A
Computer: Server
Description:
While transmitting or receiving data, the server encountered a network error. Occassional errors are expected, but large amounts of these indicate a possible error in your network configuration. The error status code is contained within the returned data (formatted as Words) and may point you towards the problem.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Data:
0000: 00 00 04 00 01 00 54 00 ......T.
0008: 00 00 00 00 dc 07 00 80 ....Ü..¿
0010: 00 00 00 00 84 01 00 c0 ....¿..À
0018: 00 00 00 00 00 00 00 00 ........
0020: 00 00 00 00 00 00 00 00 ........
0028: 7b 09 00 00 {...
After rebooting both clusters, everything came online fine.
I am extremely bothered by this. Everything was stable for about a month, and then the clusters both puke for no reason at all.
So, back to my original question: What do you think caused this? Bad network switch\ports, bad UPS, or bad drivers?
The curve ball for me is that no other servers show that there network interfaces were disconnected during the above times.
Thanks in advance for you help!
Cluster 1 is a file cluster: Storageworks msa 500 g1, and two DL 380 g3's.
Cluster 2 is an exchange cluster: Storageworks msa 500 g2, and two DL 380 g4's.
Both are running Windows 2003 server enterprise, sp2.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-02-2011 10:43 PM
09-02-2011 10:43 PM