Switches, Hubs, and Modems
cancel
Showing results for 
Search instead for 
Did you mean: 

Fiber channel - random loss of link (4208vl)

Arcadier
Occasional Visitor

Fiber channel - random loss of link (4208vl)

Hi,

I've changed my Switches netgear -> Procurve in April. Now, I have a problem with my fiber channel links (since the changes).

I have five 4208vl Switches that are linked by fiber channel (trunk on 2 ports - no STP, all the Switches are connected on one of them). Every day, we can see from one to five loss of link (between 10 and 14 seconds) on one of these fiber links and there is loss of link on other fiber ports but it's more rare (2 or 3 in a week).
The trunk loss the links (the 2 ports where are linked the fibers channel are down).

I've tried to change modules and mini-GBIC with spares, no result...

Switches and accessories (same on all Switches) :
ProCurve Switch 4208vl-72GS (J9030A)
ProCurve Switch vl 4-Port Mini-GBIC Module (J8776A)
ProCurve Gigabit-SX-LC Mini-GBIC (J4858C)

I have netware and Windows servers and there are 2 netgear Gb Switches left.
I have Procurve Manager plus 2.3 but I do not yet mastered it.

Any idea ?

Thank you.
8 REPLIES
cenk sasmaztin
Honored Contributor

Re: Fiber channel - random loss of link (4208vl)

hi

can you use what type fiber optik cable
and how many meters fiber cabling between switches

cenk
cenk

Olaf Borowski
Respected Contributor

Re: Fiber channel - random loss of link (4208vl)

Arcadier,

Don't call the fiber optic links "fiber channel". This is a term used in the storage world. In the networking world, it's just a GIG Fiber link. Depending on what distance you want to cover, you have different options. SX, LX, LR, etc. The optics also have to match the fiber optic cabling you are using. Example: You should not use SX optic with single mode fiber. Assuming this is all okay, look at the event log of the switch. Usually the event log gives you a pretty good idea on what is going on. Depending on your topology, you might have to enable spanning tree. What software version are you running?
Arcadier
Occasional Visitor

Re: Fiber channel - random loss of link (4208vl)

Sorry, I have not a lot of time to look at the forum.

Olaf, thank you for these details.
I don't need spanninf tree, there's no loop on my network, just multi links between Switches (trunk activated)
I've installed the 4208vl Switches with L_11.09 software version.

Cenk, sorry but I'm not on site right now, I know that fiber links are less than 100 meters long and that's multimode SX link (I don't have the exact reference)

The logs of the Switch just say "loss of link on port B2" (or another port, or on Trk1, 2, 3...).
There's messages on others ports (linked with end users computers) for broadcast, collisons or mismatch duplex (the IT team on site is looking for that ports).

The PCM+ (and another monitoring sofware called OPManager) show a network utilization of 10% max (about 1% average).
cenk sasmaztin
Honored Contributor

Re: Fiber channel - random loss of link (4208vl)

hi Arcadier
please send me all topology layout and all switch sh tech command print

cenk
cenk

Arcadier
Occasional Visitor

Re: Fiber channel - random loss of link (4208vl)

All servers are connected on Gb mods (and forced to Auto-1000)
Workstations are connceted on 10/100 modules and not forced except if necessary (some are forced to 10Mb HDx)
Fiber is forced 1000SDx with Flow Control

Now (for tests) the trunk is deactivated and there is only one link between each Switch

Topology and config are joined.
cenk sasmaztin
Honored Contributor

Re: Fiber channel - random loss of link (4208vl)

hi Arcadier
1-I see two switch 1609rdc and 1609ss very strog log warning message for fiber module slots H and A

you have switch model not J9030A you have switch J8773A 4200vl-8
and you have sx module model number J4858B not J4858C
I think you can use J4858C SX module for fiber connection

2-I see in flash primary L.11.09 secondary L.10.23
I understant you make software update L.10.23 after L.11.09..... very danger
software update must be one by one 10.23 after 11.08 after 11.09

show flash
Image Size(Bytes) Date Version
----- ---------- -------- -------
Primary Image : 4459852 01/23/08 L.11.09
Secondary Image : 4132576 02/23/07 L.10.23
Boot Rom Version: L.10.02
Current Boot : Primary

3-My solution
a-you can change sx module with J4858C
b-you can update L.11.08 and downlaod secondary flash
and boot switch secondary flash running switch for a time
a few minute after boot primary flash and running switch normal
c-if not resolve your problem please change (1609rdc and 1609ss)
swich chassis

good luck......


***********************1609rdc*****************************
W 06/17/08 14:07:09 chassis: Lost Communication with Slot H
I 06/17/08 14:07:09 ports: trunk Trk2 is now inactive
I 06/17/08 14:07:09 ports: port H1 in Trk2 is now off-line
I 06/17/08 14:07:10 chassis: Slot H Downloading
I 06/17/08 14:07:13 chassis: Slot H Download Complete
W 06/17/08 14:07:22 chassis: Slot H Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

W 06/17/08 15:15:24 chassis: Lost Communication with Slot H
I 06/17/08 15:15:24 ports: trunk Trk2 is now inactive
I 06/17/08 15:15:24 ports: port H1 in Trk2 is now off-line
I 06/17/08 15:15:26 chassis: Slot H Downloading
I 06/17/08 15:15:29 chassis: Slot H Download Complete
W 06/17/08 15:15:38 chassis: Slot H Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

W 06/17/08 15:49:10 chassis: Lost Communication with Slot H
I 06/17/08 15:49:10 ports: trunk Trk2 is now inactive
I 06/17/08 15:49:10 ports: port H1 in Trk2 is now off-line
I 06/17/08 15:49:11 chassis: Slot H Downloading
I 06/17/08 15:49:14 chassis: Slot H Download Complete
W 06/17/08 15:49:23 chassis: Slot H Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

W 06/17/08 18:46:56 chassis: Lost Communication with Slot H
I 06/17/08 18:46:56 ports: trunk Trk2 is now inactive
I 06/17/08 18:46:56 ports: port H1 in Trk2 is now off-line
I 06/17/08 18:46:57 chassis: Slot H Downloading
I 06/17/08 18:47:00 chassis: Slot H Download Complete
W 06/17/08 18:47:09 chassis: Slot H Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

W 06/18/08 04:24:09 chassis: Lost Communication with Slot H
I 06/18/08 04:24:09 ports: trunk Trk2 is now inactive
I 06/18/08 04:24:09 ports: port H1 in Trk2 is now off-line
I 06/18/08 04:24:10 chassis: Slot H Downloading
I 06/18/08 04:24:13 chassis: Slot H Download Complete
W 06/18/08 04:24:22 chassis: Slot H Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

W 06/18/08 08:47:00 chassis: Lost Communication with Slot H
I 06/18/08 08:47:00 ports: trunk Trk2 is now inactive
I 06/18/08 08:47:00 ports: port H1 in Trk2 is now off-line
I 06/18/08 08:47:01 chassis: Slot H Downloading
I 06/18/08 08:47:04 chassis: Slot H Download Complete
W 06/18/08 08:47:13 chassis: Slot H Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

W 06/18/08 09:01:06 chassis: Lost Communication with Slot H
I 06/18/08 09:01:06 ports: trunk Trk2 is now inactive
I 06/18/08 09:01:06 ports: port H1 in Trk2 is now off-line
I 06/18/08 09:01:07 chassis: Slot H Downloading
I 06/18/08 09:01:10 ports: port F2 is now off-line
I 06/18/08 09:01:11 chassis: Slot H Download Complete
I 06/18/08 09:01:12 ports: port F2 is now on-line
I 06/18/08 09:01:13 ports: port F2 is now off-line
I 06/18/08 09:01:15 ports: port F2 is now on-line
W 06/18/08 09:01:20 chassis: Slot H Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

***********************************************************


**********************1609ss*******************************
W 06/17/08 18:46:54 chassis: Lost Communication with Slot A
I 06/17/08 18:46:54 ports: trunk Trk3 is now inactive
I 06/17/08 18:46:54 ports: port A1 in Trk3 is now off-line
I 06/17/08 18:46:56 chassis: Slot A Downloading
I 06/17/08 18:46:59 chassis: Slot A Download Complete
W 06/17/08 18:47:08 chassis: Slot A Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

I 06/18/08 09:09:05 ports: trunk Trk3 is now inactive
I 06/18/08 09:09:05 ports: port A1 in Trk3 is now off-line
I 06/18/08 09:09:07 chassis: Slot A Downloading
I 06/18/08 09:09:10 chassis: Slot A Download Complete
W 06/18/08 09:09:19 chassis: Slot A Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
I 06/18/08 09:09:43 chassis: Slot A Ready
I 06/18/08 09:09:53 ports: trunk Trk3 is now active
I 06/18/08 09:09:53 ports: port A1 in Trk3 is now on-line

W 06/18/08 14:56:39 chassis: Lost Communication with Slot A
I 06/18/08 14:56:39 ports: trunk Trk3 is now inactive
I 06/18/08 14:56:39 ports: port A1 in Trk3 is now off-line
I 06/18/08 14:56:40 chassis: Slot A Downloading
I 06/18/08 14:56:43 chassis: Slot A Download Complete
W 06/18/08 14:56:52 chassis: Slot A Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error
I 06/18/08 14:57:16 chassis: Slot A Ready
I 06/18/08 14:57:26 ports: trunk Trk3 is now active
I 06/18/08 14:57:26 ports: port A1 in Trk3 is now on-line
cenk

Arcadier
Occasional Visitor

Re: Fiber channel - random loss of link (4208vl)

Thank you for your answer.

Just a little things :
- I have J4858C SX module... but they are recognized like J4858B on all the Switches.
HP support says that "it's OK, the J4858C SX module has been released after the last version of the Switch, so it recognized them as J4858B modules"
They Says also that there's no problem to update from v10.xx to 11.09 without 11.08
That's what they said in the AIS Formation, so I've upgraded the software from 10.23 to 11.09... I have upgraded the Switches before configuring them.
But I'll see if we can test with the 11.08.
-> HP says to reset Switches to factory default and to disable LACP on all ports except on one where is connected a Netgear Switch
Matt Hobbs
Honored Contributor

Re: Fiber channel - random loss of link (4208vl)

For this issue:

Slot A Software exception at msgSys_drv.c:529 -- in
'eDrvPoll', task ID = 0x41e9f708
-> internal error

This is most likely being caused by sFlow sampling (PCM+ or another sFlow collector). Please disable traffic sampling in whatever sFlow application you're managing this switch with and this error should disappear.

Next, contact HP Support, advise them of this issue and ask that they send you a test build which should hopefully fix this.