BladeSystem - General
cancel
Showing results for 
Search instead for 
Did you mean: 

"Twin" Qlogic QMH4062 total failure!

Luca
Advisor

"Twin" Qlogic QMH4062 total failure!

We have 4 BL490c G6 servers running with sphere with 4 Qlogic 4062 adapters booting off a MSA2012i (now P2000). Two servers, one after the other with a delta time of a couple of hours max failed together; the Qlogic adapter stopped communicating and logged firmware error repeatedly on the console. Luckily our infrastructure could keep the restarted VMs running on the remaining 2 blades and we weren't faced with any particular heavy downtime thanks to VMware HA.
However, it is incredible that the sole single point of failure (the Qlogic iSCSI HBA) failed, not once but TWICE on 2 seperate servers. One of the two adapters will not even enter their BIOS simply reporting corrupted firmware while the other is unable to save any settings in nvram.
Just wanted to document this pretty heavy misshap and to keep an eye out on Qlogic HBAs. Needless to say that our c3000 is under an XR5500 and networked with twin procurve 2810 switches for redundancy so no, it wasn't due to an external "current transient" or ground loop. Nothing was being done around the rack or network. They just died, one after the other. Makes me wonder about the other 2 now.
Just a bad day or is something else lurking under the mat?