- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- LAN access fails every after 7 days
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-13-2009 04:41 PM
08-13-2009 04:41 PM
LAN access fails every after 7 days
We've been having this recurring issue with one of our rp7400's. The LAN access fails every week and we had to reboot the server to revive it.
I have the following NIC installed
LAN0 - 10/100MBps - telnet access
LAN7/8 - APA LAN900 - NFS link
LAN1/2 - FDDI - Backup link
Currently, LAN7 has issues with it link, so whenever we boot the machine up APA complains that one link is down, so LAN8 takes over the APA group.
For the next 6 days, everything will be ok with the network access. Then on the 7th day, we start to experience packet loss as seen thru the ping tests. Then shortly, access to LAN0 and LAN900 is dead.
I've been playing with the idea of a dead gateway detection, but am not really sure about it.
The thing is, when we loose the connection, LAN8 logs errors in nettl saying it's link has gone down also, which leaves LAN900 nothing, which makes it go into disaster.
So, I was thinking, since DGD is activated on our server, it could've very well have been triggered by LAN900 going down.
But, LAN0 is only incessible thru the other subnets, users in the same subnet can still access the server.
I need ideas to bounce off from. would really appreciate your inputs.
TIA!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-13-2009 06:05 PM
08-13-2009 06:05 PM
Re: LAN access fails every after 7 days
To see all gateways you could use ip_ire_status
ndd â get /dev/ip ip_ire_status | grep â e IRE_GATEWAY â e flag
This results in a list of all gateways, the flags will indicate a dead gateway.
Check the current value:
ndd â get /dev/ip ip_ire_gw_probe
Disable Dead Gateway Detection:
ndd â set /dev/ip ip_ire_gw_probe 0
nddconf entry example:
TRANSPORT_NAME[3]=ip
NDD_NAME[3]=ip_ire_gw_probe
NDD_VALUE[3]=0
Bill Hassell, sysadmin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-13-2009 06:08 PM
08-13-2009 06:08 PM
Re: LAN access fails every after 7 days
To see all gateways you could use ip_ire_status
ndd -get /dev/ip ip_ire_status | grep -e IRE_GATEWAY -e flag
This results in a list of all gateways, the flags will indicate a dead gateway.
Check the current value:
ndd -get /dev/ip ip_ire_gw_probe
Disable Dead Gateway Detection:
ndd -set /dev/ip ip_ire_gw_probe 0
Bill Hassell, sysadmin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-16-2009 03:36 PM
08-16-2009 03:36 PM
Re: LAN access fails every after 7 days
Command:
ndd -get /dev/ip ip_ire_status | grep -e IRE_GATEWAY -e flag
Has the ff outputs:
IRE rfq stq addr mask src gateway mxfrg rtt ref type flag
000000004cb4f388 0000000000000000 0000000000000000 000.000.000.000 00000000 134.144.141.008 134.144.136.050 01500 00000 000 IRE_GATEWAY
No dead gateways here... And yes, DGD is enabled for this machine. We'll be disabling it from shell and from the nddconf
Is it possible that traffic fron LAN8 (LAN900) is 'transferred' to LAN0 when LAN8 dies? Currently, we're experiencing intermittent LAN8 connections as seen from syslog.log:
APA/LM: FOG:lan900 - lan8 is down
APA/LM: FOG:lan900 is down
APA/LM: FOG:lan900 - lan8 is up (lan8 is active)
APA/LM: FOG:lan900 is up (lan8 is active)
APA/LM: FOG:lan900 - lan8 is down
APA/LM: FOG:lan900 is down
APA/LM: FOG:lan900 - lan8 is up (lan8 is active)
APA/LM: FOG:lan900 is up (lan8 is active)
When this happens, ping statistics to other severs are the first to be affected, then NFS connections start failing, then access to LAN0 dies. Ideas?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-16-2009 04:11 PM
08-16-2009 04:11 PM
Re: LAN access fails every after 7 days
> APA/LM: FOG:lan900 is down
> APA/LM: FOG:lan900 - lan8 is up (lan8 is active)
These look very ominous. I would not enable LAN8 until the network problems are resolved.
Bill Hassell, sysadmin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-16-2009 04:16 PM
08-16-2009 04:16 PM
Re: LAN access fails every after 7 days
Is wireshark the only way to go? I'm not so sure if I can get this installed on this machine though.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-16-2009 11:47 PM
08-16-2009 11:47 PM
Re: LAN access fails every after 7 days
You can use tcpdump (which has much simpler requirements) or even HP-UX's native tools to create a network trace on the system that has the problem, then move the trace file to another host (e.g. your personal workstation) and use wireshark on it to analyze the stored trace. Wireshark can read most common network trace file formats.
Taking a network trace on HP-UX with no extra software installed:
http://www.compute-aid.com/nettl.html
MK
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-17-2009 03:17 AM
08-17-2009 03:17 AM
Re: LAN access fails every after 7 days
I would drop the APA configuration completely. That's a lot of complicated software that is essentially doing nothing (I assume that LAN7 and LAN8 are the only members).
As far as Wireshark, I would install it on a laptop (it is much easier and much simpler to setup) rather than on HP-UX. Then you can use tcpdump or even nettl to perform traces. Wireshark reads virtually every packet trace program there is.
What do the logs show in your routers? I would turn on stats for the problem ports and get the network administrators tracing the problem. In general, a LAN down message means that fundamental network connectivity has been dropped. You may have a bad switch or router that is causing the issue. Try a different port on the switch. Is there a forklift truck running over the LAN cables every week?
Bill Hassell, sysadmin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-20-2009 05:52 PM
08-20-2009 05:52 PM
Re: LAN access fails every after 7 days
Just an additional question. I got some things on the server's routing table. Here's what we have:
Routing tables
Destination Gateway Flags Refs Interface Pmtu
127.0.0.1 127.0.0.1 UH 0 lo0 4136
I am just confused about the routes pointing to the LAN900 interface. It contains several routes here. Can you guys explain how this is working?
Thanks!
134.144.188.38 134.144.188.38 UH 0 lan2 4136
134.144.141.8 134.144.141.8 UH 0 lan0 4136
134.144.188.166 134.144.188.166 UH 0 lan900 4136
134.144.141.20 134.144.188.165 UGH 0 lan900 1500
134.144.141.17 134.144.188.162 UGH 0 lan900 1500
134.144.141.16 134.144.188.161 UGH 0 lan900 1500
134.144.141.19 134.144.188.164 UGH 0 lan900 1500
134.144.141.18 134.144.188.163 UGH 0 lan900 1500
134.144.141.12 134.144.188.34 UGH 0 lan2 4352
134.144.141.11 134.144.188.33 UGH 0 lan2 4352
134.144.188.32 134.144.188.38 U 2 lan2 4352
134.144.188.160 134.144.188.166 U 2 lan900 1500
134.144.128.0 134.144.141.8 U 2 lan0 1500
127.0.0.0 127.0.0.1 U 0 lo0 4136
default 134.144.136.50 UG 0 lan0 1500
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-20-2009 05:52 PM
08-20-2009 05:52 PM
Re: LAN access fails every after 7 days
Routing tables
Destination Gateway Flags Refs Interface Pmtu
127.0.0.1 127.0.0.1 UH 0 lo0 4136
134.144.188.38 134.144.188.38 UH 0 lan2 4136
134.144.141.8 134.144.141.8 UH 0 lan0 4136
134.144.188.166 134.144.188.166 UH 0 lan900 4136
134.144.141.20 134.144.188.165 UGH 0 lan900 1500
134.144.141.17 134.144.188.162 UGH 0 lan900 1500
134.144.141.16 134.144.188.161 UGH 0 lan900 1500
134.144.141.19 134.144.188.164 UGH 0 lan900 1500
134.144.141.18 134.144.188.163 UGH 0 lan900 1500
134.144.141.12 134.144.188.34 UGH 0 lan2 4352
134.144.141.11 134.144.188.33 UGH 0 lan2 4352
134.144.188.32 134.144.188.38 U 2 lan2 4352
134.144.188.160 134.144.188.166 U 2 lan900 1500
134.144.128.0 134.144.141.8 U 2 lan0 1500
127.0.0.0 127.0.0.1 U 0 lo0 4136
default 134.144.136.50 UG 0 lan0 1500
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-21-2009 12:16 AM
08-21-2009 12:16 AM
Re: LAN access fails every after 7 days
Try "netstat -rnv" and post the output here.
MK