Operating System - OpenVMS
1753307 Members
6584 Online
108792 Solutions
New Discussion юеВ

ProCurve 2708 and Gigabit NIC stalling

 
Sebastian Bazley
Regular Advisor

ProCurve 2708 and Gigabit NIC stalling

2-node Cluster (ES45+DS25) running VMS 7.3-2

The cluster is connected using:
Gigabit NIC in each
ProCurve 2708 switch

We've been unable to get this working properly - it only seems to work for a very short while and then stalls.

Has anyone else had any problems configuring this?

We're hoping to try the kit out on a test system in the UK shortly.

In the meantime, any advice/experience would be most welcome...
11 REPLIES 11
Volker Halle
Honored Contributor

Re: ProCurve 2708 and Gigabit NIC stalling

Sebastion,

what do you mean with 'stalls' ?

Does the cluster hang ? Any messages on the console regarding the LAN driver and possible duplex mismatch ? Does auto-negotiation work or are both (LAN NIC and switch-port) set to the same speed and duplex mode ?

Volker.
Sebastian Bazley
Regular Advisor

Re: ProCurve 2708 and Gigabit NIC stalling

Cluster hangs.
No messages.
We have set auto on both consoles.
The switch defaults to auto and has no management interface.

It did work once for 10 minutes, and then it hung.
Volker Halle
Honored Contributor

Re: ProCurve 2708 and Gigabit NIC stalling

Sebastian,

the systems in the cluster constantly communicate via all LAN interfaces by sending SCS HELLO multicast messages every 3 seconds. Each system in the cluster would output a 'Lost connection to node' message, if it does not receive a HELLO message from the other node within about 9 seconds.

Is BROADCAST enabled on the console terminals ? $ SHOW BROADCAST ? If not, you have to enable it, to see those messages !

You could force a crash, if the cluster seems to hang: Press HALT button, then >>> CRASH

In the crash, you could check LAN counters etc. How do you determine, that the cluster 'hangs' ?

Volker.
Sebastian Bazley
Regular Advisor

Re: ProCurve 2708 and Gigabit NIC stalling

Sorry, I should have said that the cluster was working fine before the new NICs and switch were installed.

[I believe they have now reverted to the previous hardware.]

It seems to be a problem that is specific to the particular switch and NICs.

Unfortunately, we can't run any more tests until we get access to the test systems in the UK.

In the meantime, we were hoping that someone might have come across a similar problem with that specific hardware.
Allan Bowman
Respected Contributor

Re: ProCurve 2708 and Gigabit NIC stalling

You might want to disable the switch auto-negotiation and set both ports manually. Be sure to verify your network paths if you have a second NIC on either node - if STP is enabled on the switch, it could eventually shut down your expected path.

Allan in Atlanta
Sebastian Bazley
Regular Advisor

Re: ProCurve 2708 and Gigabit NIC stalling

As far as I know, the switch does not provide the facility to change anything...

HP describe it as:

Unmanaged: provides plug-and-play simplicity
David B Sneddon
Honored Contributor

Re: ProCurve 2708 and Gigabit NIC stalling

Should that not be...

Unmanaged: plug-and-pray stupidity
Colin Butcher
Esteemed Contributor

Re: ProCurve 2708 and Gigabit NIC stalling

I think you've just discovered why I don't like unmanaged switches (especially in production environments) - you can't get any information out of them, or set them up as you want to.

Which NICs are you using? Assuming they're copper then have you got the appropriate specification cables and distances for GigE over copper?

I generally use fibre and managed switches - less problems on the whole. After all, it's not cheaper to use anything else if it wastes your time and creates unreliability issues. The last thing you want in a production environment is something that doesn't work consistently and reliably.

If the systems really matter then you may want to consider dual NICs and dual switches so that you have two cluster interconnect paths. Don't forget to enable / disable the releveant protocols on all the NICs in the systems either.

Hope this help,
Cheers, Colin.
Entia non sunt multiplicanda praeter necessitatem (Occam's razor).
Peter Zeiszler
Trusted Contributor

Re: ProCurve 2708 and Gigabit NIC stalling

Can you force the NICs to 1000 Full and have the switch then autosense?

We have had issues before when having to use NICs and Switches both in autosense. They both keep changing speeds and duplex which results in dropped packets and NO communication.