Re: Oracle DB over NFS on Netapp only 35MB/sec

Christian Birkmeier · ‎12-13-2007

We experiance low performance accross NFS on our Netapp filer.
The throughput is limited to ~35MB/sec for table scans and also for a dd on the filesystem.

Our hardware exists of a rp7410 with 2x 1GBit. A Netapp FAS3050C with 4x 1GB (one aggregate).
Network equipment is CISCO 65xx switches.

For that I opened a call at HP and at Netapp. The specialists came to the conclusion, that the behavior from the HP-UX NFS client in combination with Netapp Filer gives this bad transferrate.
We crosschecked this by using a HP Proliant Server with Redhat Linux on the same network and filer equipment. Then we got a bandwith of 100MB/sec.

A network trace made on the filer shows us the behaviour of both:

HP-UX Client:
15 0.000789 192.168.158.222 -> 192.168.158.55 NFS V3 WRITE Call, FH:0x4f3aacce Offset:0 Len:32768 UNSTABLE[Unreassembled Packet]
16 0.005762 192.168.158.55 -> 192.168.158.222 NFS [TCP ACKed lost segment] V3 WRITE Reply (Call In 15) Len:32768 FILE_SYNC
17 0.000659 192.168.158.222 -> 192.168.158.55 NFS V3 WRITE Call, FH:0x4f3aacce Offset:32768 Len:32768 UNSTABLE[Unreassembled Packet]
18 0.005389 192.168.158.55 -> 192.168.158.222 NFS [TCP ACKed lost segment] V3 WRITE Reply (Call In 17) Len:32768 FILE_SYNC

Linux Client:
10 0.112781 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:0 Len:32768 UNSTABLE[Unreassembled Packet]
43 0.114106 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:32768 Len:32768 UNSTABLE[Unreassembled Packet]
77 0.114556 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:65536 Len:32768 UNSTABLE[Unreassembled Packet]
112 0.114941 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:98304 Len:32768 UNSTABLE[Unreassembled Packet]
179 0.115494 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:163840 Len:32768 UNSTABLE[Unreassembled Packet]
214 0.115778 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:196608 Len:32768 UNSTABLE[Unreassembled Packet]
236 0.115979 172.27.240.55 -> 172.27.224.190 NFS V3 WRITE Reply (Call In 10) Len:32768 FILE_SYNC
237 0.115987 172.27.240.55 -> 172.27.224.190 NFS V3 WRITE Reply (Call In 43) Len:32768 FILE_SYNC
250 0.116062 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:229376 Len:32768 UNSTABLE[Unreassembled Packet]
283 0.116333 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call[Unreassembled Packet]
318 0.116626 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:294912 Len:32768 UNSTABLE[Unreassembled Packet]
351 0.116897 172.27.224.190 -> 172.27.240.55 NFS V3 WRITE Call, FH:0x36672f4a Offset:327680 Len:32768 UNSTABLE[Unreassembled Packet]
371 0.117063 172.27.240.55 -> 172.27.224.190 NFS V3 WRITE Reply (Call In 77) Len:32768 FILE_SYNC

The explanation from support was - while linux send multiple NFS write requests without waiting on the ack, HP-UX send a single NFS call and waits until the filer replies ack. Each write takes 6ms - that is the limitation in bandwith!

Actually I tried Oracle 11g with DirectNFS Client on HP-UX - this gives us also on the rp7410 100MB/sec.

That sounds the explanation from support seams to be right, that the behaviour from HP-UX NFS client performs bad.

Does anybody have a idea if this behaviour of OS NFS client can be changed - by patching or kernel parameters or anything else?

I could not belive that Linux will outperform HP-UX in this discipline!

I'am looking forward to your answer!

Best regards,
Christian

rick jones · ‎12-13-2007

It has been a very long time since I played with NFS, but back then (ca UX 9.X and 10.X) I was seeing multiple outstanding writes on a file.

I wonder if there are/are not biod's enabled, and if the writes by the app are/are not O_SYNC or something?

Down in the noise level and not related (iirc) to the one at a time business, Linux (2.6 kernels anyway) will probably use TSO on the GbE interface. HP-UX by default will not as HP-UX defaults CKO (ChecKsum Offload) to off for the 1 Gigabit NICs. It does default CKO on on the 10Gbit NICs.

there is no rest for the wicked yet the virtuous have no pillows

Trevor Roddam_1 · ‎12-13-2007

Christian.

Not sure if this is going to help; however, HP-UX11iV3 supports NFS v4. Earlier versions of HP-UX didn't. This may be an explanation of why. I'm just not up to date with NFS, it has been a while.

I'm assuming you have HP-UX11iV1 ?

If possible let us know what OS and/or NFS version?

A quick test with UX11iV3 may prove fruitful. I too have seen very poor performance using Netapp with UX, but that was a while ago.

Baldric, I have a plan so cunning you could pin a tail on it and call it a weasle.

Christian Birkmeier · ‎12-14-2007

Hi all,

thank you for your quick response.
Here are some more details about our environment.
The rp7410 runs under 11.11 with NFSv3 with the mount options Netapp and Oracle recommends.
Beside the rp7410 we also have newer machines in use - e.g. a rx3600 running 11.23 with multiple GB interfaces.
For testing purpose I installed 11v3 on another rx3600 and mount the Oracle database with NFSv4. But the troughput was also not good.
Only when I use Oracle DirectNFS Client weather on rp7410 11v1 or on rx3600 11v2/11v3 gives me a very good performance. But 11g is very new and is not supported from our applications.
As far as I understand the experts from HP and Netapp Support - the whole story is about the behavior of the HP-UX NFS Client. No outstanding writes/reads.
By the way - I configured 16 biod's on each machine and set these Kernel parameters:

JAGad15675
nfs_new_lock_code = 1

JAGad72416
async_read_avoidance_enabled = 1

do local paging for binaries over NFS
page_text_to_local = 1

Are there any other kernel parameters witch change the behaviour of the NFS client to make also outstanding writes and reads? I think that's what linux NFS client does and also what the Oracle NFS client does.

Do you need further infos about our environment? Any suggestions?

Best regards,
Christian

Dave Olker · ‎12-14-2007

Hi Christian,

A couple things - we have a lot more kernel tunables, and therefore a much more flexible NFS client, on 11.23 and 11.31, so if you can use 11.23 on your client I'd suggest at least running that.

The fact that the HP client is sending write requests with UNSTABLE but the NetApp server is replying with FILE_SYNC is surprising to me. Why is the filer using FILE_SYNC semantics?

In any case, I'd be happy to work with you to improve your NFS performance of the client. My first suggestion would be to test the throughput with a tool like iozone rather than via an application.

I've never had a problem driving my HP-UX clients to 100 MB/sec with Gigabit Ethernet. If you're able to run 11.23 on your NFS client, here are the tunable settings I recommend you use:

_______________________________________

o nfs_async_read_avoidance_enabled
This tells the NFS client to issue READ calls even if all the biods are busy servicing WRITE calls

Default Setting: 0 (Disabled)
Recommended Setting: 1 (Enabled)

o nfs_fine_grain_fs_lock
By default (0), the NFS client code uses a global system-wide semaphore to control access to many routines and data structures. This use of a global semaphore leads to a lack of parallel activity through many of the main NFS client code paths. When set to 2, the client avoids all use of this global filesystem semaphore and uses finer grained locks to protect critical code paths and data structures. The result is a much higher performing NFS client.

Default Setting: 0 (Use FS Semaphore in all code paths)
Recommended Setting: 2 (Avoid FS Semaphore in all code paths)

o nfs_new_lock_code
By default (0) when an NFS client places a lock on a file we turn off the biods and buffer cache for this file, effectively making all access to the file synchronous. When enabled (1) the client will enable the biods and buffer cache on locked files if the entire file is locked.

Default Setting: 0 (Disabled)
Recommended Setting: 1 (Use biods and buffer cache on locked files)

o nfs_new_rnode_lock_code
This instructs the NFS client to allow processes waiting to lock an rnode (NFS version of an inode) on the NFS client to be interrupted by ^C. By default these processes sleep in the kernel at a non-interruptible state.

Default Setting: 0 (threads are not interruptible while waiting to lock an rnode)
Recommended Setting: 1 (threads are interruptible while waiting to lock an rnode)

o nfs_wakeup_one
There are a couple nasty thundering herd conditions in the NFS client code. By setting this tunable to 2 both of the thundering herd conditions are avoided and the CPU contention of the system is dramatically reduced as well as throughput increased.

Default Setting: 0 (both thundering herd conditions exist)
Recommended Setting: 2 (bypass both thundering herd conditions)

o nfs3_new_acache
By default (0) the NFS client uses a linear search when walking the list of credential structures associated with a given file or directory (i.e. all the users who want to look at a given NFS file or directory). When enabled (1), the NFS client uses a hashed algorithm which can greatly increase performance and reduce CPU overhead when many users attempt to access the same shared file or directory.

Default Setting: 0 (linear credential search)
Recommended Setting: 1 (hashed credential search)

_______________________________________

If you set the above tunables to the recommended values I'd be curious if the performance improves.

Regards,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Christian Birkmeier · ‎12-17-2007

Hi Dave,

Iâ am very happy to hear from you. I already read your book about HP-UX NFS and followed all the recommendations in it. The book is based on 11.11, so all the new things in 11.23 and 11.31 are not included.

I opened a call for the performance problem 1 1/2 year ago. The discussion between the Netapp and the HP expert (Mr. Holger Zessel - call# 1209123232-121) was that the combination HP-UX client and Netapp NFS server seems to work not very well. The Netapp expert explained that the HP-UX sends unstable writes but act as it sends stable writes. Therefore the filer waits 5ms (timeout) after each request before sending the response of the "unstable write". After the response arrives on the HP-UX box the next request will be send from it! This limits the bandwidth on the GB on 15-25MB/sec. Oracle's direct NFS client seems to work different - the same Oracle DB on the same HP-UX server (11.23) with the data files located on the same filer and the same mount points - I will get about 100MB/sec, while the OS NFS client gives me only 20-25MB/sec.

My first task today morning was to try out your kernel parameter recommendations on a test server (rx3600 11.23). But the throughput with dd command and also with Oracle does not grow. Iâ am not familiar with iozone - witch command line parameters should I use? In what output are you interested?

In what further details are you interested?

Iâ am really looking forward to your answer!

Best regards,
Christian

Dave Olker · ‎12-17-2007

Hi Christian,

> My first task today morning was to try out
> your kernel parameter recommendations on a
> test server (rx3600 11.23). But the
> throughput with dd command and also with
> Oracle does not grow.

Did you reboot the system after making these changes? Many of them only take effect after a reboot, especially if you have NFS filesystems mounted when the kctune command is used.

I tried reproducing your problem on my systems. I have an older NetApp filer and I used iozone to write a 1GB file to the filer from the following clients:

HP-UX 11.23
SuSE 9.2
Solaris 10

All of these clients are similar in hardware configuration. I used the same mount options on all 3 clients and here are the throughput results I saw:

HP-UX 11.23: 34901 MB/sec
SuSE 9.2: 32031 MB/sec
Solaris 10: 34337 MB/sec

So none of my systems push the GigE interface on the filer to wire speed, possibly because the filer is pretty old as is the version of Data ONTAP running on it. But my point is the 11.23 client is able to get comparable performance to the Solaris 10 and SuSE clients.

Also, I looked at a Wireshark trace of the data transfers and my trace shows the NetApp filer replying to the write calls with the UNSTABLE flag rather than FILE_SYNC. I thought that was interesting.

If you want to play with iozone, you can download the source from iozone.org. It's pretty easy to compile and the syntax I was using to test with is:

# iozone -c -e -s 1g -r 32k -i 0 -+n -f /filer-1/testfile

Hope this helps,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Christian Birkmeier · ‎12-18-2007

Hi Dave,

Yesterday - I did not reboot the system after kernel parameter changes, but unmount and remount the file systems.
Today, I rebooted the system as you recommend. It seems, that the write dd gives me a slightly better throughput than yesterday -> ~30MB/sec while yesterday ~25MB/sec.

Today, I did a nettl trace from a Oracle select. The first trace with active Direct NFS client and the second with HP-UX NFS client.
The main difference I found is, that Oracle sendâ s two read calls in one package. Also the answer of the file seems to come faster.
I will attach the trace files. One with Oracle direct NFS, another with Oracle and HP-UX NFS and the last was captured during ioscan â c â e â s 1m â r 32k â I 0 -+n â f /mnt/testfile

I hope this information help.

Best regards,
Christian

Dave Olker · ‎12-18-2007

Hi Christian,

Since you have the test environment all set up, I'd like you to try testing with UDP instead of TCP just to see what kind of difference you see. If you could mount the filesystem with "-o proto=udp" and try the test again I'd be curious if you see different performance or behavior.

Regards,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Dave Olker · ‎12-18-2007

One other data point, I just installed MemFS on my 11.23 NFS server and configured a 2GB memory file system. I exported this file system to my 11.23 client and here's what I see when I use iozone:

Run began: Tue Dec 18 10:17:09 2007

Include fsync in write timing
Include close in write timing
File size set to 1048576 KB
Record Size 32 KB
No retest option selected
Command line used: iozone -e -c -s 1g -r 32k -i 0 -+n -t 1 -F /hp-2/iozone
Output is in Kbytes/sec
Time Resolution = 0.000001 seconds.
Processor cache size set to 1024 Kbytes.
Processor cache line size set to 32 bytes.
File stride size set to 17 * record size.
Throughput test with 1 process
Each process writes a 1048576 Kbyte file in 32 Kbyte records

Children see throughput for 1 initial writers = 111903.17 KB/sec
Parent sees throughput for 1 initial writers = 111882.39 KB/sec
Min throughput per process = 111903.17 KB/sec
Max throughput per process = 111903.17 KB/sec
Avg throughput per process = 111903.17 KB/sec
Min xfer = 1048576.00 KB

On my test system, which is an rx2600, I can push my GigE NIC to wire speed with NFS/TCP and iozone writing a 1GB file - provided the NFS server's filesystem is fast enough. Using a memory-based filesystem on the server helped me remove this as a bottle neck.

Just thought I'd pass that along.

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Dave Olker · ‎12-18-2007

One other data point about how my NFS client and server are configured for TCP, I have the following entries in my /etc/rc.config.d/nddconf file:

TRANSPORT_NAME[0]=tcp
NDD_NAME[0]=tcp_recv_hiwater_def
NDD_VALUE[0]=1048576

TRANSPORT_NAME[1]=tcp
NDD_NAME[1]=tcp_xmit_hiwater_def
NDD_VALUE[1]=1048576

This sets the TCP send and receive window sizes to 1MB, which helps TCP advertise a larger window between the client and server. Don't know if you're using the default window sizes or not, but I thought this was worth mentioning as well.

If you make changes like this you'd need to issue the command "ndd -c" to re-read the configuration file and activate the changes.

Regards,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Christian Birkmeier · ‎12-19-2007

Hi Dave,

I set the ndd settings for send and receive window size as you recommend to 1MB. No other changes are made in nddconf file. Before this change there was the default set, I think 32K.

Here are my results from the iozone write tests with tcp and udp mount points:

Include close in write timing
Include fsync in write timing
File size set to 1048576 KB
Record Size 32 KB
No retest option selected
Command line used: /iozone -c -e -s 1g -r 32k -i 0 -+n -f /mnt/test_iozone
Output is in Kbytes/sec
Time Resolution = 0.000003 seconds.
Processor cache size set to 1024 Kbytes.
Processor cache line size set to 32 bytes.
File stride size set to 17 * record size.

(Oracle recommended mount options â except udp)
mount -o rw,bg,hard,nointr,suid,timeo=600,rsize=32768,wsize=32768,proto=,vers=3 flr-rbg04:/vol/ora11g/dbf /mnt

=udp: 69587KB/sec
=tcp: 49298KB/sec

I crosscheked the iozone with default window size of 32k:
=udp: 61113KB/sec
=tcp: 44875KB/sec

It looks like we are on a good way to improve the performance and came closer to the Oracle NFS client.
But â by the way udp is not supported by Oracle 10g.

Best regards,
Christian

Dave Olker · ‎12-19-2007

Hi Christian,

One of the enhancements we made to our NFS client specifically for Oracle was the "forcedirectio" mount option. It doesn't look like you're using this option currently. I don't know if it will help iozone performance, but it can help Oracle throughput in certain environments because it bypasses the buffer cache and lets Oracle do the caching on its own behalf.

Please try mounting the filesystem with the "forcedirectio" option and let me know how it affects both iozone and Oracle throughput.

Regards,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Christian Birkmeier · ‎12-20-2007

Hi Dave,

I tried the forcedirectio options with iozone and Oracle. It seems in our environment the forcedirectio degrade the throughput.
I also append the sysstat output from the filer during the tests. In that log â especially during the first test â I noticed that the throughput is very unstable. It varies between 0 and 72MB/sec!?!
By the way â yesterday while googleing for Dave Olker, I found another article â Designing a High Performance NFS Serverâ . In that I found that â TCP Segmentation Offloadâ and â Checksum Offloadâ is disabled by default. I enabled both as you recommend in your guide. Actually I was wondering why this features are disabled by default.
Attached you will find the logs I record during the tests.

Best regards,
Christian

Dave Olker · ‎12-20-2007

Hi Christian,

Yeah, that Dave Olker guy writes a lot of technical papers. He usually posts them on docs.hp.com once he finishes them so check there periodically for future ones.

Do you happen to have any 11i v3 systems in your test ring? We replaced the entire NFS client and server code in 11i v3 so I'd be willing to bet it would behave differently than the 11i v2 client. However, since my 11i v2 client is able to push my HP-UX server to full speed this seems like an issue specific to HP->NetApp.

I've heard in the past NetApp has recommended HP customers tune down the number of biods to 1 or 2 and that would give them better performance with filers. I always assumed that was because of the thundering herd and filesystem semaphore issues we had in our code that we recently resolved with the nfs_wakeup_one=2 and nfs_fine_grain_fs_lock=2 tunables. But there may be other quirks between our client and the filers that don't show up with other servers.

In any case, now that you've found the "Designing a High Performance NFS Server" paper, I'd suggest using all the tips/tricks outlined there (CKO, TSO, TCP windows, etc.). I'd also suggest trying the 11i v2 client with 1 or 2 biod daemons to see if that affects throughput at all. Finally, I'd really like to see how an 11i v3 client behaves in your environment.

I have 11i v2 and v3 systems in my test ring but my NetApp filer is too slow to see any performance differences in my tests.

Regards,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Christian Birkmeier · ‎12-21-2007

Hi Dave,

Actually we donâ t have a 11v3 server. So I canâ t test with the new client.
I tired the tests with 2 and 1 biods -> but no better performance. I think it is like you suggest the communication between HP-UX and Netapp performs not well.
I hope I will have the time to install 11v3 on the test server to try this combination.
Today it is my last day in office. I will be back in January 7th. I hope we could continue the discussion then.
Wish you a marry Christmas and a happy new year. Hope to hear from you in next year!

Best regards,
Christian

Dave Olker · ‎12-21-2007

Hi Christian,

Ok, thanks for confirming the "small number of biods" thing for me. I've heard that recommendation so many times over the years and it just never made any sense to me, especially after we removed the bottlenecks in our client code.

If you're able to get an 11i v3 client in there to test with that would be great. I only wish my filer was fast enough to really see a difference between my 11i v2 client and my Solaris, Linux, 11i v3 clients. Maybe someday I'll get one, but seeing as you have one already set up I'd love to get some kind of confirmation that 11i v3 clients perform as expected in your environment so we know we need to concentrate on the 11i v2 client.

I hope you have a very happy and safe holiday season,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Christian Birkmeier · ‎01-09-2008

Hi Dave,

back from holiday - and emptying my inbox in the last two days. I had a good time during holiday - hope you too!

I installed my test server rx3600 with 11iv3 (09/07) without any patches. This is my first time with 11iv3.
By the way, I can't see any biod's. Do you recommend any kernel parameters for this env.?
I changed the default values for nddconf and the network interface (checksum offload,...) as on the 11iv2 installation.

I repeate the test with iozone.

iozone -c -e -s 1g -r 32k -i 0 -+n -f /mnt/testfile

Iozone: Performance Test of File I/O
Version $Revision: 3.283 $
Compiled for 64 bit mode.
Build: hpuxs-11.0w

Contributors:William Norcott, Don Capps, Isom Crawford, Kirby Collins
Al Slater, Scott Rhine, Mike Wisner, Ken Goss
Steve Landherr, Brad Smith, Mark Kelly, Dr. Alain CYR,
Randy Dunlap, Mark Montague, Dan Million,
Jean-Marc Zucconi, Jeff Blomberg, Benny Halevy,
Erik Habbinga, Kris Strecker, Walter Wong.

Run began: Wed Jan 9 13:57:28 2008

Include close in write timing
Include fsync in write timing
File size set to 1048576 KB
Record Size 32 KB
No retest option selected
Command line used: /iozone -c -e -s 1g -r 32k -i 0 -+n -f /mnt/testfile
Output is in Kbytes/sec
Time Resolution = 0.000002 seconds.
Processor cache size set to 1024 Kbytes.
Processor cache line size set to 32 bytes.
File stride size set to 17 * record size.
random random bkwd record stride

KB reclen write rewrite read reread read write read rewrite read fwrite fre
write fread freread
1048576 32 37498 0

the output of sysstat 1 on the filer during the iozone test (it looks like the write performance varied between ~5 and ~66MB per sec.):

CPU NFS CIFS HTTP Net kB/s Disk kB/s Tape kB/s Cache
in out read write read write age
38% 9 0 0 2 2 35700 0 0 29295 3
22% 82 0 0 2857 67 91548 0 0 88801 2
37% 1758 0 0 60742 1414 35652 9796 0 28770 2
57% 1294 0 0 43838 1030 59884 74704 0 57868 2
51% 1951 0 0 66875 1564 64525 9201 0 59478 2
57% 170 0 0 5794 136 65106 114475 0 62985 2
57% 1888 0 0 64692 1514 89545 14000 0 85942 2
53% 647 0 0 22181 519 64939 78332 0 64427 2
50% 1976 0 0 67705 1583 63628 17223 0 58596 2
56% 718 0 0 24702 578 67984 76218 0 65081 2
59% 1837 0 0 62877 1471 87709 27107 0 85465 2
46% 343 0 0 11819 277 73807 58543 0 70166 2
52% 1955 0 0 66961 1567 50421 34118 0 47175 2
50% 690 0 0 23653 553 58146 66050 0 56174 2
55% 1913 0 0 65517 1534 64556 34976 0 60228 2
40% 464 0 0 16416 380 37040 75440 0 34144 2
28% 861 0 0 28972 680 39828 17664 0 36700 2
60% 1540 0 0 52791 1236 48636 4990 0 48142 2
50% 1809 0 0 62490 1460 56860 9016 0 50725 2
46% 0 0 0 0 0 37828 132824 0 34341 2
43% 1966 0 0 66855 1566 37730 21483 0 34765 2
46% 1110 0 0 38551 908 35738 57417 8 33093 2
46% 1344 0 0 45537 1068 37624 63232 0 33817 2
44% 953 0 0 33172 772 45084 54076 0 40829 2
48% 1842 0 0 62603 1467 37388 45160 0 33948 2
45% 719 0 0 24695 578 45557 69462 0 42693 2
52% 1680 0 0 58084 1355 67968 24402 0 65740 2
53% 1090 0 0 36839 865 40776 65580 0 37814 2
77% 96 0 0 3148 76 46094 9339 0 39741 2
25% 0 0 0 0 0 83600 0 0 82248 2
12% 0 0 0 0 1 49176 0 0 48693 2

Best regards,
Christian

PS: I can test 11iv3 only until end of next week - then the machine will be reinstalled with 11iv2 and go for production.

Jörg Brandenburger · ‎01-09-2008

Hi Christian

we are running Oracle 10.2 (SAP) on HP-UX
11.23 Itegrity servers.
DB located on NetApp / NFS.

running same test on 3020 Nearstore
aggregate is build with 38 disk SATA /500GB

iozone -c -e -s 1g -r 32k -i 0 -+n -f /mnt/testfile

KB reclen write
1048576 32 77178

I think this is not bad.

mount options are same as yours.
nfs rw,bg,hard,nointr,rsize=32768,wsize=32768,proto=tcp,vers=3,suid,timeo=600 0 0

NetApp vol options

nosnap=on, nosnapdir=on, minra=on, no_atime_update=on, nvfail=on,
ignore_inconsistent=off, snapmirrored=off, create_ucode=off,
convert_ucode=off, maxdirsize=20971, schedsnapname=ordinal,
fs_size_fixed=off, guarantee=file, svo_enable=off, svo_checksum=off,
svo_allow_rman=off, svo_reject_errors=off, no_i2p=off,
fractional_reserve=100, extent=off, try_first=volume_grow

nothing special tune'd

Best regards
Joerg

Dave Olker · ‎01-09-2008

Hi Joerg,

Thanks for sanity checking this. My only NetApp filer is only capable of about 40MB/sec regardless of which NFS client I use, and I get the same throughput from all clients.

I wonder if you'd get even better throughput from your 11.23 client if you tried the various tuning suggestions I listed earlier in this thread...

Also, can you tell me exactly what command you used to display the detailed options for the NetApp filesystem?

Thanks,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Christian Birkmeier · ‎01-10-2008

Hi JÃ¶rg,

thank you for your information. It seems it is not a problem weather of Netapp or HP-UX NFS client.
My latest tests with 11iv3 didn't show any difference to 11iv2. You activate minra=on on this volume - if I do so, the performance will degree down to less than 10MB/sec.
By the way Dave - the command on the filer is vol options . The filer I actually use for testing is a FAS3020 with an aggregate of 42 SATA disks Ã¡ 250GB
You are a lucky man if you get those throughput without any tuning. Do you have any suggestions where the bottleneck could be?
Should I print the exact configuration of the filer and the network configuration? My suggestion is more and more that the problem could be find in our network configuration.

Best regards,
Christian

Jörg Brandenburger · ‎01-10-2008

Hi Christian Hi Dave
lets talk a litte about my setup, problems and where we come from.

we have many SAP installations, based on oracle, most on HP-UX (only integrity) and Linux (RedHat, HP Proliants)

Last year, all installations are based on
2 EVA5000 storage systems.(168 pure fibre disks each)
All systems are fcp connected.
Now we moving to NetApp (only NFS for now)
primary systems FAS980c streched Metrocluster
and one 3020 Nearstore, only SATA.

Test and quality systems are located only on Nearstore.
differend tear class, best performance are not needed.

I think we have two so called "profiles" to access the data.
one way of operation is more sequential.
(Backup Restore, copy)
the other is how oracle is accessing the data.
If I run some tests, I dont know how to produce the oracle way to access the data.
(sorry not a oracle guy)
aprox 2 times a year, we copy over the productive system to the quality system's.
And now I think the newly copied system, are running with a bad performance.
crazy, because the restore runs faster.
4.5 hours for 1.6TB, is fine.
So I snooped around, and found your article.
searching for mount options.
reading:
http://www.netapp.com/library/tr/3557.pdf
and found the "forcedirectio"
But It seems it will not help to come out of the performance problems.

Network and performance for sequential IO
must be ok, because restore runs fine.

So I am not a lucky man :-(

Best regards
Joerg Brandenburger

Dave Olker · ‎02-22-2008

Hi guys,

I spent the past week helping on a performance problem with HP-UX clients and NetApp filer. I thought I'd pass along a couple of tunes I found on 11i v3 that helped performance tremendously for sequential reads.

# kctune -S
Tunable Value Expression Changes
nfs3_bsize 1048576 1048576 Immed
nfs3_max_threads 16 16 Immed
nfs3_nra 16 16 Immed

When my customer increased these values on their 11i v3 client they found the performance with the filer increased dramatically. I thought I'd pass this along in case these same tunables might help your performance with filers.

Regards,

Dave

I work at HPE
HPE Support Center offers support for your HPE services and products when and how you need it. Get started with HPE Support Center today.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]

Christian Birkmeier · ‎02-25-2008

Hi Dave,

thank you very much to provide us that information. We setup last week a new rx3600 cluster under 11.31.
I changed the kernel parameter you provided. In next view weeks we will migrate the first Oracle DB onto this cluster.
I'am looking forward how the performance increase there.

Best regards,
Christian

Steven E. Protter · ‎02-25-2008

Shalom Chris,

I would expect good results from Dave's recommendations.

I would not however expect blinding speed, especially with a write intensive database.

Though Oracle supports NFS now they still recommend redo, index and data be on raid 1 disk. You may need to look at the underlying disk configuration of your netapp to squeeze better performance from this setup.

SEP

Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com

Categories

Company

Local Language

Forums

Discussions

Forums

Discussions

Forums

Discussions

Forums

Discussions

Forums

Discussions

Discussions

Forums

Forums

Discussions

Forums

Discussions

Forums

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Community

Resources

Other HPE Sites

Discussions

Forums

Blogs

Re: Oracle DB over NFS on Netapp only 35MB/sec

Oracle DB over NFS on Netapp only 35MB/sec