Operating System - Linux
1827774 Members
2591 Online
109969 Solutions
New Discussion

Re: NFS server not responding

 
Alex Lavrov.
Honored Contributor

NFS server not responding

Hello,

I have an nfs filesystem mounted on linux RHEL3 from HPUX.

Every 4-5 minutes I get in /var/log/messages this message:
kernel: nfs: server serv1 not responding, timed out

in "nfsstat -c" I see that:
calls retrans
708028 77304

It's not so good. What are the steps that I can take to solve this problem?

The network cards on both servers configured on 1000, Full, Autoneg ON.

No problems in pign either.


Thanx.
I don't give a damn for a man that can only spell a word one way. (M. Twain)
10 REPLIES 10
Alex Lavrov.
Honored Contributor

Re: NFS server not responding

Another thing, maybe it's related, I also get a lot of these messages:

kernel: application bug: dw.sapWT3_D00(3602) has SIGCHLD set to SIG_IGN but calls wait()
I don't give a damn for a man that can only spell a word one way. (M. Twain)
Aris Cruz
New Member

Re: NFS server not responding

Oh joy, nfs problems, they are always fun...

couple of things you can try.

1. change rsize=8192, wsize=8192 in mount options

2. RHEL3, i believe still has a default mount of udp, you can change the mount to tcp and nfsver=3

Alex Lavrov.
Honored Contributor

Re: NFS server not responding

Hello,

I'll try the first option.

About the second, it was TCP, so when the problems started we changed it to UDP it didn't help and it's already version 3.


thanx.
I don't give a damn for a man that can only spell a word one way. (M. Twain)
Dave Falloon
Trusted Contributor

Re: NFS server not responding

watch out for soft mounts. The linux NFS faq ( http://nfs.sourceforge.net/ ) covers this under the heading at the bottom:

Why do I get NFS timeouts when I mount a Linux NFS server from my Solaris NFS client?

This applies to linux clients as well. These messages are shown when a client gives up trying to connect the the nfs server, this only happens with soft mounts, a hard mount has no timeout value. Two settings to mitigate this are "-o proto=tcp,hard" Since you are using Gig E you should also adjust the performance options.

--Dave
Clothes make the man, Naked people have little to no effect on society
Alex Lavrov.
Honored Contributor

Re: NFS server not responding

Hmm, good point. I use soft mounts.

Before I assign points, I'll check it :)
I don't give a damn for a man that can only spell a word one way. (M. Twain)
Dave Falloon
Trusted Contributor

Re: NFS server not responding

Please do as there could be other reasons for timeouts such as process limits on the server, sometimes you can run low on filehandles and start bouncing connections, etc.

--Dave
Clothes make the man, Naked people have little to no effect on society
rick jones
Honored Contributor

Re: NFS server not responding

You might start looking at switch stats, ethtool stats (linux side) and lanadmin stats on the UX side. Also, netstat -s on linux and netstat -p udp and netstat -p tcp (depending on the flavor of the mount) on the UX side. Might toss-in a netstat -p ip if using UDP.

The idea is to see where packets are being lost and leading to the retransmitted NFS requests.

I don't have one for linux yet, but

ftp://ftp.cup.hp.com/dist/networking/briefs/annotated_netstat.txt

may be of some help.
there is no rest for the wicked yet the virtuous have no pillows
Steven E. Protter
Exalted Contributor

Re: NFS server not responding

Is the physical networking solid on this box?


I would suggest turning Autonegotiation to off and see if it gets any better. I would also see if other boxes have NFS problems with this box, perhaps even an HP-9000 or Solaris box.

The client and server nfs software should be up2date or yum current.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
Ivan Ferreira
Honored Contributor

Re: NFS server not responding

We had connection problems with an ethernet adapter on linux, and we disabled the apic in the kernel and the problem was solved.

linux noapic

I also agree that the autonegotiation should be turned off, and configure fixed speed on both, the switch and the server.
Por que hacerlo dificil si es posible hacerlo facil? - Why do it the hard way, when you can do it the easy way?
rick jones
Honored Contributor

Re: NFS server not responding

In the original post, the speed was said to be "1000" which I would take to mean gigabit. In that case, autoneg _must_ be on - gigabit requires it.
there is no rest for the wicked yet the virtuous have no pillows