Operating System - HP-UX
1748255 Members
4037 Online
108760 Solutions
New Discussion юеВ

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

 
Gene_33
Occasional Advisor

fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

Hi all,

First time posting here, so I don't know what everyone needs to help to find a solution, but I will try to provide as much as possible.

The system is running 11i, with a Tachyon 2XL card. The driver came with the system, and I've download the necessary patch for the HBA (Don't remember the exact number, but I believe it was 27***, preceding the 23696 patch). Anyway, after everything is properly installed, configured, and zoned (three switches; the server is on one, three linux boxes on the second one, and the HP-UX box on the third. All five are put into one zone), the server could not detect the WWPN from the HP-UX box.
Here are what I have at the moment:

The lights on the HBA are on, one green, one yellow/amber, so I know it's fine.

The port light on switch three, which is where my HP-UX box is connected to, is NOT on. I've checked the GBIC, tried different cables, and rebooted the switch, all to no avail.

When I run "ioscan -funC fc", I get the following:
Class I H/W Path Driver S/W State H/W Type Description
=================================================================
fc 1 10/6/7/0 td CLAIMED INTERFACE HP Tachyon TL/TS Fibre Channel Mass Storage Adapter
/dev/td1

As you can see, /dev/td1 is claimed.

Now, when I run "fcmsutil /dev/td1", I get the following:
Vendor ID is = 0x00103C
Device ID is = 0x001028
PCI Sub-system Vendor ID is = 0x00103C
PCI Sub-system ID is = 0x000006
Topology = IN_LOOP
Local N_Port_id is = 0x000001
Local Loop_id is = 125
N_Port Node World Wide Name = 0x50060B000007AD5B
N_Port Port World Wide Name = 0x50060B000007AD5A
Driver state = ONLINE
Hardware Path is = 10/6/7/0
Number of Assisted IOs = 0
Number of Active Login Sessions = 0

The driver state is ONLINE, so it seems fine to me...

Lastly, here is what I get for running "fcmsutil /dev/td1 stat":
Channel Statistics

Statistics From Link Status Registers ...
Loss of signal 0 Bad Rx Char 0
Loss of Sync 0 Link Fail 0
Received EOFa 0 Discarded Frame 0
Bad CRC 0 Protocol Error 0

Channel State M/C Statistics ...
LSM Link Fail State 0
FCP assists freeze failure 7
HBA Changing AL_PA Across LIP 0
Channel Reset Request 0
Channel Offline Request 0
CE_FCP_FREEZE Request 0
CE_FCP_UNFREEZE Request 0
CE_FCP_FREEZE Cancel 0
CE_LDN Event 0
CE_LUP Event 7
CE_FCP_FROZEN Event 55498

CS_FCP_FROZEN, when FCP_UNFREEZE recvd in CS_FCP_FROZEN 8

Error Idle Interrupts 0

ERQ/FCP Assists resumed 0

Loop State T/O, when LSM is not ONLINE 0

Loop State T/O, when LSM is ONLINE 0

Loop State T/O, when LSM is in ARB 0

Credit Errors 55505

LIPs due to OLS 55506

Storm Statistics ...
Elastic Store Error Storm 0
Link Fail storm . 0
LIP(f8, xx) storm . 0
Loss Of Signal Storm 131339015
Out Of Sync Storm 5453312
Link Fault Storm 0

NOS_OLS Storm 0

PCI Error Interrupt Statistics ...
MPE Interrupts 0
CRS Interrupts 0
DER Interrupts 0
PER Interrupts 0

Target State M/C Statistics ...
Target Thread Alloc. Fail 0
Invalid SOF/EOF Received 0
Inbound Unknown Frames with non-zero df_ctl 0
Multi-Frame Sequences Received 0
SFS Received With S_ID/D_ID Mismatch 0
SFS Received exceeding TD_SFTBFSZ 0

I/O Error Statistics ...
FCP Frames Recvd As Responder 0
Bad Unassisted FCP frame 0
Bad S_ID in Unassisted FCP frame 0
No cdb for I/O 0
IRB entries on ERQ 0
SCSI Linked list entries 0
Late ABTS Responses Received 0
Too many mapping entries for a single I/O 0
SF state m/c statistics...
No reply before timeout 0
SF failed due to LDN 0
SF failed due to ASN, LKF or FTO 0
SF thread Allocation failures 0
Unable to map SF payload 0
SFT in ERQ during Chip Reset 0
Late ICM from Non-compliant target 0

It seems most of the outputs indicate normalcy, but "FCP assists freeze failure 7"? Any suggestions would be much appreciated.

Thank you,

Gene
9 REPLIES 9
Florian Heigl (new acc)
Honored Contributor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

Is it really a loop-type setup?
You might find additional low-level errors e.g. concerning FL_PORT or F_PORT login via mstm -> run -> logtool

Also You can check if the HP-UX box 'sees' anything like in the following:
root@lala:/tmp>/opt/fcms/bin/fcmsutil /dev/td0 get remote all

Target N_Port_id is = 0x[...]
Target state = DVS_READY
Symbolic Name = EMC SYMMETRIX xxxx
Port Type = N_PORT
Target Port World Wide Name = 0x[...]
Target Node World Wide Name = 0x[...]


yesterday I stood at the edge. Today I'm one step ahead.
Florian Heigl (new acc)
Honored Contributor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

also, have a look, if there's 'light' coming out of the adapter itself (a short look does no harm, I guess :)
yesterday I stood at the edge. Today I'm one step ahead.
Gene_33
Occasional Advisor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

Thank you for the suggestions, and after some further diagnosis, it seems the fault lies with the switch. Although I don't know what's wrong with the switch exactly, but it seems only the uplink is being able to detect anything. Time to ask for a new switch perhaps...(a LONG shot)

Gene
Tim D Fulford
Honored Contributor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

Not really an expert in this area but..
> Channel State M/C Statistics ...
> CE_FCP_FROZEN Event 55498
:
> Storm Statistics ...
> Loss Of Signal Storm 131339015
> Out Of Sync Storm 5453312

These look suspicoius;y like signals are being lost!!! If you are getting green lights on Host I'm guessijng the port may be the problem, try using another port?

Regards

Tim
-
Florian Heigl (new acc)
Honored Contributor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

What brand/model switch is it?

It sounds like the other ports were disabled...
yesterday I stood at the edge. Today I'm one step ahead.
Mic V.
Esteemed Contributor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

Regarding the "quick look" into the cable, may I suggest shining it on a piece of paper? I figure our eyes get enough abuse as it is... ;-)
What kind of a name is 'Wolverine'?
Gene_33
Occasional Advisor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

I've now tried the HP-UX machine again with an eight-port switch that has no zoning capability, with only the server. The result: server failed to detect the HP-UX's WWPN. I've downloaded the latest patch for the HBA, so I think it might be needed to move the HBA to a different slot, though I thought the s7000 series already eliminated this problem... =(

Gene
Florian Heigl (new acc)
Honored Contributor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

I read over those thousands of lossofsync's -until now I've only seen this a few times, either due to 1gbit/2gbit speed mismatch or simply broken hba's.

also, You wrote the last switch had no zoning capability - so the others had? I'd then think You're running a fabric configuration?
If Yes, the HBA didn't properly login according to the Topology = IN_LOOP fcmsutil stated.

please do check the EMS logs via logtool.
yesterday I stood at the edge. Today I'm one step ahead.
Gene_33
Occasional Advisor

Re: fcmsutil comes back fine, and the LED on the HBA is fine, but why not the LED on the switch?

Yes, someone else pointed that out as well. I don't think the 1gig/2gig is an issue, since according to the HP website, 2XL is capable of auto-adjusting itself accordingly. I've just moved up the HBA to slot 2, and hopefully this will solve the problem. If not, I will have to find another HBA...

Thank you all for the helpful inputs.

Gene