- Community Home
- >
- Networking
- >
- Switching and Routing
- >
- LAN Routing
- >
- Random static routing issue on HP 3500/6600
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-05-2021 12:52 AM
07-05-2021 12:52 AM
Random static routing issue on HP 3500/6600
Hello all,
I'm having a very strange and random (static) routing issue from one internal network (LAN A) to other 2 (local) networks.
"LAN A" has a 6600 HP switch as the router/GW (static routing) for this LAN (VLAN 100). All hosts have a default GW to this router (so they connect to the other LANs via this router).
"LAN B" (a 3500 HP router) has multiple VLANs and connects to "LAN A" via VLAN 1000 (for the link between the 2 routers). All hosts have a default GW pointing to their routers in their VLANs (which are configured in the 3500 HP router).
"LAN C" is in VLAN 200 and connects to "LAN A" through an HP 2810 switch uses the same 6600 as the router for reaching VLAN 100. All hosts in "LAN C" have a static routeto reach "LAN A" that points to the 6600 router.
While monitoring (ping) some hosts from LAN A to the other LANs, I might see some pings that fail (either "unreachable" or very high values) but is so random that it can take days for the errors to show. And I can't replicate it at will... it just happens. (btw, network load is usually the same).
These pings (3 packets) that fail are not the first ones, meaning, usually another host on LAN A already successfully ping some host in the other network and after that one (random) host will fail the ping. So I'm ruling out any "cache" or MAC issue.
Also each host in LAN A pings multiple hosts, but only one of them might fail. So that also kind of rules out issues with the host itself.
The configuration of this router is almost the default one, except for the VLANs and the static routing.
And, again, it might take days until I see this behavior (I run pings every 3 hours).
I reinstalled the 6600 router from scratch, upgraded the firmware to the latests on from HP and from Aruba... I even tried a different HP router... no luck so far.
Does anyone have a suggestion of what I could try? I'm running out of ideas...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-05-2021 05:47 AM
07-05-2021 05:47 AM
Re: Random static routing issue on HP 3500/6600
Hello stlm,
I would start with the same test (ping) but within the same vlan without going between the vlans in order to check if the issue can be seen without being routed.
Hope this helps!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-05-2021 10:44 PM
07-05-2021 10:44 PM
Re: Random static routing issue on HP 3500/6600
Hello Alex,
thanks for the suggestion, but I'm already doing that, the script also pings 2 IPs in the very same VLAN/LAN with no issues so far.
In fact we identified this issue because we added a script that uses ssh to connect from "LAN A" to the other LANs and from time to time we got some ssh errors (unreachable), while we never saw that in the very same VLAN/LAN.
Regards,
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-06-2021 07:32 AM
07-06-2021 07:32 AM
Re: Random static routing issue on HP 3500/6600
Hello stlm,
If you do a packet capture on the remote host which is failing do you see the ICMP requests coming and replyes going out?
Can you do such test?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-07-2021 02:08 AM
07-07-2021 02:08 AM
Re: Random static routing issue on HP 3500/6600
Hello Alex,
thank you for your reply.
The problem is the randomness of the issue. I did try a sniffer on (on some source/destination hosts), but as the problem didn't appear, I needed to end it (too many logs). The same happened when I mirrored the router's interface.
That's why this issue is so annoying, because I can't replicate it at will...
Regards,
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-07-2021 04:54 AM - edited 07-07-2021 04:54 AM
07-07-2021 04:54 AM - edited 07-07-2021 04:54 AM
Re: Random static routing issue on HP 3500/6600
Hello stlm,
You may try to limit the capture with a filter to ICMP including only the ip addresses of interest.
This way you will not have additional triaffic captured.
In wireshark it is called capture filter (not the display filter) - it should be set before capture start.
On the router you may try to use acl to count or to use the following way to mirror:
HP 5830 Switch Series - Configuring Traffic Mirroring (hpe.com)
Also you may search for some drops in the counters of the routers under the interfaces.
Hope this helps!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-08-2021 04:59 AM
07-08-2021 04:59 AM
Re: Random static routing issue on HP 3500/6600
Hi Alex,
thank you for the suggestions.
About the drops, I see 54 "Tx drops", as I'm not sure the implication of those drops and when they occurred, I'm monitoring the statistics for that ports (after clearing the stats, although the clearing is only valid for a single session).
Also thank you for the link, I will give that a try, as well as the filtering, which I was about to test, but wasn't sure if ICMP alone (with the affected IP addresses) would've be enough (wasn't sure if any other kind of protocol is involved, as my first test for each host is to test a tcp port in the remote host, and then a ping..., which sometimes the ping works after the tcp test fails).
cheers,
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-12-2021 03:07 AM
07-12-2021 03:07 AM
Re: Random static routing issue on HP 3500/6600
Hello stlm,
DId you find if the ICMP request is reaching the devices?
The ICMP with few drops is not always a problem as in a lot of devices the ping itself is with very low preference so if the other services are running fine this should not be a problem. The TCP and UDP and other protocols which are important for the needed services between the hots are more important as you mentioned. If you see very big per cent of drops in ICMP but usually from hundreds of pings this could also be sign of some issue. I hope you found more about the dropped packets. If you have any questions in regards to the topic please let me know.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-12-2021 04:07 AM
07-12-2021 04:07 AM
Re: Random static routing issue on HP 3500/6600
Hi Alex,
was about to add some more info and I see you replied to my previous comment, thanks again.
Last week it failed 1 test (after 7 days without any issues), and also today at 4:00 (is an internal network, and at that time there is no load at all...)
The test that failed was ssh from one host to another (only the first host, the rest were all ok), and right after that the ping was fine.
I had port mirroring and a sniffer running and I see that host A sent 4 ARP requests (asking for the gateway's MAC) and after the 4th request I see the pings.
I see a reply between those 4 ARP requests, but that might have been another host (that also made the same request). So I can't be 100% that it was related to host A... (I can't see the destination for the reply)
but, host A sent 4 ARP requests... and that may account for:
req 1 to 3 = tests to ssh port, as there are 3 attempts (seq # 0,1,2)
req 4 = first ping, which succeded.
So that would mean that the first 3 requests failed, and then the 4th succeded (I see an arp reply a few miliseconds after that last request).
Could that be the problem? that some ARP requests get "lost" (still, I'm sniffing the mirrored router port, so I assume the router actually sees the requests...) and that's why I get this issue? is there any fix on that?
(Although that won't explain why sometimes after a failure, the first pings that succeded have very high times...)
Since today I'm running scheduled dumps, and also added verbose for the ARP protocol, in the hope I can see the destination host of the ARP replies.
Regards,
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-15-2021 03:08 AM - edited 07-16-2021 01:29 AM
07-15-2021 03:08 AM - edited 07-16-2021 01:29 AM
Re: Random static routing issue on HP 3500/6600
@stlm wrote: prepaidcardstatus.comHi Alex,
was about to add some more info and I see you replied to my previous comment, thanks again.
Last week it failed 1 test (after 7 days without any issues), and also today at 4:00 (is an internal network, and at that time there is no load at all...)
The test that failed was ssh from one host to another (only the first host, the rest were all ok), and right after that the ping was fine.
I had port mirroring and a sniffer running and I see that host A sent 4 ARP requests (asking for the gateway's MAC) and after the 4th request I see the pings.
I see a reply between those 4 ARP requests, but that might have been another host (that also made the same request). So I can't be 100% that it was related to host A... (I can't see the destination for the reply)
but, host A sent 4 ARP requests... and that may account for:
req 1 to 3 = tests to ssh port, as there are 3 attempts (seq # 0,1,2)
req 4 = first ping, which succeded.
So that would mean that the first 3 requests failed, and then the 4th succeded (I see an arp reply a few miliseconds after that last request).
Could that be the problem? that some ARP requests get "lost" (still, I'm sniffing the mirrored router port, so I assume the router actually sees the requests...) and that's why I get this issue? is there any fix on that?
(Although that won't explain why sometimes after a failure, the first pings that succeded have very high times...)
Since today I'm running scheduled dumps, and also added verbose for the ARP protocol, in the hope I can see the destination host of the ARP replies.
Regards,
The TCP and UDP and other protocols which are important for the needed services between the hots are more important as you mentioned. If you see very big per cent of drops in ICMP but usually from hundreds of pings this could also be sign of some issue. I hope you found more about the dropped packets. If you have any questions in regards to the topic please let me know.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-20-2021 12:10 AM
07-20-2021 12:10 AM
Re: Random static routing issue on HP 3500/6600
Hi VainFox,
actually the drops (when they happen, as they are very random) I see are from the tests I run, and only when the routing is still not fully functional between 2 hosts. Once the routing is properly established, there are no drops.
That means the drops only happen in the first pings, then the rest are ok.
Is a very strange issue...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-22-2021 01:09 AM
07-22-2021 01:09 AM
Re: Random static routing issue on HP 3500/6600
Hello,
sorry to come back to this, but this still baffles me.
Yesterday night it happened again, but this time I was able to catch it.
host-a in LAN-1 sends a ping req to host-b in LAN-2.
I see the ICMP req in host-a sniffer, as well as in the router, but host-b doesn't get the ping req (no req entry for that host in the sniffer).
Does this mean that the router is not forwarding those single ICMP req, although it "sees" the packet?
If that's the case, I might see some Tx Drops on the interface to LAN-2 ( where host-b is), right?
But, I only see Tx Drops on the interface that goes to LAN-1 (where host-a is).
The interface in LAN-2 has 0 TX drops...
Still, could it be the router itself, even whe it sees the ICMP req, that is not forwarding it to the right interface?
again, this happened only once in the last 4-5 days... and for only one single source host to a single host...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-23-2021 07:32 AM
07-23-2021 07:32 AM
Re: Random static routing issue on HP 3500/6600
Hello stlm,
On which router the routing is performed? Is it on one or routing for one vlan is on one router but the routing for the other vlan is on another router? If you have two routers maybe both should be checked. Also do you see the arp on the hosts and routers do they have the proper records in arp table?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-26-2021 11:49 PM
07-26-2021 11:49 PM
Re: Random static routing issue on HP 3500/6600
Hello Alex,
there's is basically one router (LAN A) that connects to 3 different networks (B, C, D).
LAN B is routed via another router, that also has an IP in LAN A.
LAN C is routed via another router, that has a VLAN (only seen by the 2 routers) for the link between LAN A and LAN C.
LAN D is routed using the same router in LAN A, as this router has an IP from LAN D
So there are different scenarios. And at some point, I saw the same issue on all (one at a time).
ARP looks fine, but as the issue is random, I can't say for sure that at the very moment the issue happened, the arp was there... but the test I run involve multiple hosts, and 99% of the time when the issue occurs, there's only one that fails, the rest are just fine. Also at some point I manually added the ARP entry between the routers... but the issue was still there.
Regards,