1830938 Members
1720 Online
110017 Solutions
New Discussion

make_net_recovery slow

 
Bob Carback
Frequent Advisor

make_net_recovery slow

I'm running a make_net_recovery from an rx2620 and archiving onto an rx4640. The process has been running for 4 hours -- says it is making an archive that is 3.7 GB -- " The archive is estimated to reach 3771405 kbytes." so far the archive is 120 MB. During the make_net_recovery there is a lot of ---
NFS server uni-rx4640-03 not responding still trying
NFS server uni-rx4640-03 not responding still trying
NFS server uni-rx4640-03 ok

--- being displayed. A ping between the 2 servers looks good ---

ping uni-rx2620-01
PING uni-rx2620-01.unilab.digex.com: 64 byte packets
64 bytes from 10.209.83.229: icmp_seq=0. time=0. ms
64 bytes from 10.209.83.229: icmp_seq=1. time=0. ms
64 bytes from 10.209.83.229: icmp_seq=2. time=0. ms

----uni-rx2620-01.unilab.digex.com PING Statistics----
3 packets transmitted, 3 packets received, 0% packet loss
round-trip (ms) min/avg/max = 0/0/0
.

Any idea why so slow and why the NFS messages?
8 REPLIES 8
HGN
Honored Contributor

Re: make_net_recovery slow

Hi

You can try to re-export the NFS filesystem one more time
exportfs -a

You can check if this would help.

Rgds

HGN
Bob Carback
Frequent Advisor

Re: make_net_recovery slow

Thanks but that did not make a difference.
Sameer_Nirmal
Honored Contributor

Re: make_net_recovery slow

Hi,

It looks like the connection between NFS server and client is getting dropped and come up after sometime.

First of all , it is required to ensure you have latest NFS RPC patches installed on both servers.
Check the patch status using
# show-patches -a

Do you see any errors in syslog,dmesg on both servers?

Are these server on the same subnet?
Is there any firewall in between them?

When you see this erro, run these command
at server
# nfsstat -s
# mount -v
# netstat -s
run these command at client using
# rpcinfo -u nfs
# rpcinfo -s mountd
# rpcinfo -p

I see you started giving points to the replies for your questions which is good.
Maybe you can assign points for earlier replies later!!
Bob Carback
Frequent Advisor

Re: make_net_recovery slow

I am unable to run a command such as 'show-patches'. These are on the same subnet and there is no firewall. I see no messages in syslogd or dmesg. This is a new install -- HP-UX 11iv2 12/2005.
# nfsstat -s --- shows all zeros (0)

# mount -v
/dev/vg00/lvol3 on / type vxfs ioerror=mwdisable,delaylog,dev=40000003 on Wed Jan 25 10:56:28 2006
/dev/vg00/lvol1 on /stand type vxfs ioerror=mwdisable,log,tranflush,dev=40000001 on Wed Jan 25 10:56:30 2006
/dev/vg00/lvol8 on /var type vxfs ioerror=mwdisable,delaylog,dev=40000008 on Wed Jan 25 10:56:43 2006
/dev/vg00/lvol7 on /usr type vxfs ioerror=mwdisable,delaylog,dev=40000007 on Wed Jan 25 10:56:43 2006
/dev/vg00/lvol4 on /tmp type vxfs ioerror=mwdisable,delaylog,dev=40000004 on Wed Jan 25 10:56:43 2006
/dev/vg00/lvol6 on /opt type vxfs ioerror=mwdisable,delaylog,dev=40000006 on Wed Jan 25 10:56:43 2006
/dev/vg00/lvol5 on /home type vxfs ioerror=mwdisable,delaylog,dev=40000005 on Wed Jan 25 10:56:43 2006
uni-rx4640-03:/var/opt/ignite/clients on /var/opt/ignite/recovery/client_mnt type nfs rsize=32768,wsize=32768,NFSv3,dev=63000001 on Wed Jan 25 11:28:27 2006
uni-rx4640-03:/var/opt/ignite/recovery/archives/uni-rx2620-01 on /var/opt/ignite/recovery/arch_mnt type nfs rsize=32768,wsize=32768,NFSv3,dev=63000002 on Wed Jan 25 11:28:58 2006

# netstat -s
tcp:
8960 packets sent
7380 data packets (6785683 bytes)
1407 data packets (1862840 bytes) retransmitted
1579 ack-only packets (545 delayed)
1 URG only packet
0 window probe packets
0 window update packets
1870 control packets
10506 packets received
6212 acks (for 7457000 bytes)
1339 duplicate acks
0 acks for unsent data
4514 packets (2300370 bytes) received in-sequence
0 completely duplicate packets (0 bytes)
0 packets with some dup, data (0 bytes duped)
0 out of order packets (0 bytes)
0 packets (0 bytes) of data after window
0 window probes
544 window update packets
1 packet received after close
0 segments discarded for bad checksum
0 bad TCP segments dropped due to state change
488 connection requests
460 connection accepts
948 connections established (including accepts)
945 connections closed (including 23 drops)
22 embryonic connections dropped
4483 segments updated rtt (of 4483 attempts)
299 retransmit timeouts
0 connections dropped by rexmit timeout
0 persist timeouts
0 keepalive timeouts
0 keepalive probes sent
0 connections dropped by keepalive
0 connect requests dropped due to full queue
22 connect requests dropped due to no listener
0 suspect connect requests dropped due to aging
0 suspect connect requests dropped due to rate
udp:
0 incomplete headers
0 bad checksums
0 socket overflows
ip:
4706 total packets received
0 bad IP headers
0 fragments received
0 fragments dropped (dup or out of space)
0 fragments dropped after timeout
0 packets forwarded
0 packets not forwardable
icmp:
5 calls to generate an ICMP error message
0 ICMP messages dropped
Output histogram:
echo reply: 5
destination unreachable: 0
source quench: 0
routing redirect: 0
echo: 0
time exceeded: 0
parameter problem: 0
time stamp: 0
time stamp reply: 0
address mask request: 0
address mask reply: 0
0 bad ICMP messages
Input histogram:
echo reply: 10
destination unreachable: 2
source quench: 0
routing redirect: 0
echo: 5
time exceeded: 0
parameter problem: 0
time stamp request: 0
time stamp reply: 0
address mask request: 0
address mask reply: 0
5 responses sent
igmp:
0 messages received
0 messages received with too few bytes
0 messages received with bad checksum
0 membership queries received
0 membership queries received with incorrect fields(s)
0 membership reports received
0 membership reports received with incorrect field(s)
0 membership reports received for groups to which this host belongs
0 membership reports sent



# rpcinfo -u uni-rx4640-03 nfs
program 100003 version 2 ready and waiting
program 100003 version 3 ready and waiting

# rpcinfo -p uni-rx4640-03
program vers proto port service
100000 4 tcp 111 rpcbind
100000 3 tcp 111 rpcbind
100000 2 tcp 111 rpcbind
100000 4 udp 111 rpcbind
100000 3 udp 111 rpcbind
100000 2 udp 111 rpcbind
100024 1 tcp 49152 status
100024 1 udp 49153 status
100021 1 tcp 49153 nlockmgr
100021 1 udp 49156 nlockmgr
100021 3 tcp 49154 nlockmgr
100021 3 udp 49157 nlockmgr
100021 4 tcp 49155 nlockmgr
100021 4 udp 49158 nlockmgr
100020 1 udp 4045 llockmgr
100020 1 tcp 4045 llockmgr
100021 2 tcp 49156 nlockmgr
100068 2 udp 49162 cmsd
100068 3 udp 49162 cmsd
100068 4 udp 49162 cmsd
100068 5 udp 49162 cmsd
100083 1 tcp 49157 ttdbserver
100005 1 udp 49201 mountd
100005 3 udp 49201 mountd
100005 1 tcp 49196 mountd
100005 3 tcp 49196 mountd
100003 2 udp 2049 nfs
100003 3 udp 2049 nfs
100003 2 tcp 2049 nfs
100003 3 tcp 2049 nfs


Sameer_Nirmal
Honored Contributor

Re: make_net_recovery slow

Bob,

There are so many aspects one should check at network and nfs(itself) level.

I would advise to go through Dave Olker's NFS white paper. He is well known NFS Guru.
http://www.docs.hp.com/en/1435/NFSPerformanceTuninginHP-UX11.0and11iSystems.pdf

This would help you to check every aspect and find the cause of the issue.

Since the installed OS version is pretty much latest, I would suspect the cause like
incorrect network card speed settings ( half or full duplex) etc.
Luk Vandenbussche
Honored Contributor

Re: make_net_recovery slow

Hi,

What is your network setting

lanadmin -x 0?
Is it the same on your switch

Modify the ignite settings with

instl_adm -f

fe

instl_adm â d > /tmp/cfg

add the following entry to /tmp/cfg

_hp_lanadmin_args=â -X 100FDâ

instl_adm â f > /tmp/cfg
Bob Carback
Frequent Advisor

Re: make_net_recovery slow

the switch port and lan card on the server are both set to 100 FD.
This also happens when I try to scp from one of the servers to another unrelated server. I've even tried it on a couple different NIC cards on my server. I'll call support
Bob Carback
Frequent Advisor

Re: make_net_recovery slow

switch settings incorrect