- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - OpenVMS
- >
- Re: Very bad performance on disk with a lot of fil...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-31-2006 11:27 PM
07-31-2006 11:27 PM
Re: Very bad performance on disk with a lot of files and directories
this high CPU tick rate seems to indicate, that maybe even the CPU has become the bottleneck ! There is also a quite substantial Window Turn rate (about 12/sec).
If we assume nothing else was going on on the ICR disk, a file system read rate of 180 and an IO rate of 176 on that disk seem to match pretty well.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-31-2006 11:37 PM
07-31-2006 11:37 PM
Re: Very bad performance on disk with a lot of files and directories
And where are the about 2000 IO's to the other disks ? Solved by xfc ?
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-31-2006 11:39 PM
07-31-2006 11:39 PM
Re: Very bad performance on disk with a lot of files and directories
MONI FCP only shows IOs issued by the File System (XQP). All the other IOs on all the other disks will be application IOs not involving the XQP.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-01-2006 03:30 AM
08-01-2006 03:30 AM
Re: Very bad performance on disk with a lot of files and directories
I have tried to put all the information you have asked for in the attached textfile.
Both disks are on the same system. They are howeven on 2 different HSG's
We initially initialized this disk in a way that it could accomodate a whole lot of files.
init/nohigh /block=
The degree of changes on this disk is not very great, about 2000-3000 files are added monthly.
I don't think LD would help, when using LD, I have to make a full backup of the whole every day, this also takes a lot of time (size=300GB). We also thought about making a physical backup of this disk, but that has the same drawbacks.
I however am willing to try both options.
When I look at the File-ID's, they seem quite sequential.
It is true that DFU is much faster in reading the directories, that is because DFU accesses the directories in a intelligenter way. This doesn't my solve my backup problem, which was the reason I started this thread.
I am still a bith lost in solving this problem. Is there any solution to this problem. Could ODS-5 be an option.
I also have the option to migrate this disk to an EVA5000 storage system. It is nog a problem to do this, but I would like to be sure that things will get better doing that.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-01-2006 03:54 AM
08-01-2006 03:54 AM
Re: Very bad performance on disk with a lot of files and directories
Backup 300Gb in one (or some small numer) of files (and LD containers ARE just 1 file) should be done at about the max speed of your driev (tape or disk - the slower is determining).
That should beat the MANY,MANY, directory and header Lookups by a considerable factor!
hth
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-01-2006 05:51 AM
08-01-2006 05:51 AM
Re: Very bad performance on disk with a lot of files and directories
please consider to look at the STDT data for QF Seen and Seq Tmo (SDA commands given above) to rule out any HSG80 overload problems.
If your 2 big disks are so similar, then the achieveable IO rates should also be similar, which apparently is not the case. If you run a DIR/SIZ=ALL/GRA disk:[*...] on your 'well performing' disk, what kind of disk IO rate or File System Read rate and CPU Tick Rate do you get ? On the ICR disk, you seem to even saturate the CPU (FCP CPU Tick Rate = 86%).
Can you look at old accounting data/batch log files for your backup of the ICR disk. As the elapsed time has increased, did Direct IOs and CPU time also increase at the same rate ?
Another possible test would be to use a disk performance test utility (like DISKBLOCK or DISKPERF) to check the achievable single block synchronous read IO rate on those 2 disks in your given configuration.
The AUTOGEN report shown seems to indicate an unusal situation regarding feedback data for ACP_HDRCACHE (Too many ACP hits and attempts have occurred).
Do not bother to try to convert the disk to ODS-5 or copy to the EVA, before you understand the underlying problem.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-01-2006 06:42 AM
08-01-2006 06:42 AM
Re: Very bad performance on disk with a lot of files and directories
This worries me. There must be something wrong. Do you know how many files there were when it took 4 hours ?
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-01-2006 08:19 AM
08-01-2006 08:19 AM
Re: Very bad performance on disk with a lot of files and directories
From the let's touch all bases department: Please check the HSG-80 and make sure the RAID-5 unit has all members present. Has there been any change in the physical disks?
Andy
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-01-2006 09:03 AM
08-01-2006 09:03 AM
Re: Very bad performance on disk with a lot of files and directories
Here's my â ¬ 0,02:
I assume you spoke to Menco, who signaled a similar performance decrease about a year ago. I don't recall if it was the same disk, but i'm sure it was the same system.
He was messing with DFU and compressing directories. IIRC solved the problem.
Some other questions pop to mind. I assume you've already concidered them, but the worst question is the one not asked:
Are you sure you are using your HBA's and HSG controllers as efficiently as you can, spreading the load ?
Are there any other systems backing up from the same HSG / controller at the same time ? Maybe even using the same controller ?
Regards
Martin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-01-2006 11:19 PM
08-01-2006 11:19 PM
Re: Very bad performance on disk with a lot of files and directories
Only a pitty that all caching is disabled. Only header would have been better (diesctory files are re-used).
FWIW
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-10-2006 09:22 PM
08-10-2006 09:22 PM
Re: Very bad performance on disk with a lot of files and directories
I have tried/checked several of the hints given in the various replies:
- mounting with nocache
- checked the load on the HBA's / HSG80's --> no problem
- ..
This didn't solve the problem.
In the beginning of this week we had to reboot this server and we then took the opportunity to AUTOGEN the system.
We did the AUTOGEN not based fully upon the Feedback, but started with INITIAL and added then several parameters to get the situation we desired.
After this AUTOGEN and reboot.
We started the backup on this disk en and it only ran for 2h:45m!!!!
So I can only conclude that the problem was in the tuning of the system, I assume it's the ACP_ parameters that dit the trick.
I have added the changed parameters in which you can that these parameters are increased.
Also the paged pool has increased quite a bit, this could also be of influence.
So for now our problems are solved and I would like to thank everybody that helped me in solving this problem.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-10-2006 10:20 PM
08-10-2006 10:20 PM
Re: Very bad performance on disk with a lot of files and directories
thanks for updating this thread with your findings. Looking at the system parameter changes, nothing jumps out. Maybe even a clean reboot (without changing anything) would have 'solved' the problem.
Consider to closely watch the backup performance on that disk and try to archive accounting information (like DIRIOs, elapsed CPU time and elapsed time), so you could start comparing them, should the problem show up again in the future.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-11-2006 12:21 AM
08-11-2006 12:21 AM
Re: Very bad performance on disk with a lot of files and directories
Allthough I'd feel bad about rebooting fixing it (sort of against the VMS spirit), that is more likely in my opinion.
Did you use BACK/IGNOR=INTERLOCK?
Our friend Mark Hopkins described a situation he analyzed to me which had an interesting performance cliff.
If /ignore=interlock is used for backup then backup will re-visit the file header after copying the data. This revisit is normally a hit in the ACP caches. However, if you manage to open just enough files then the later files in a backup batch might push out the headers for a first files causing all headers to be re-read. This can cause 1/2 more IO, and they can be 'bad' IOs with large seeks.
Please realize that for smallish sized files, like you have, there will be just 1 data IO and 1 header IO per file and that would become 2 headers IO for just one data.
REDUCING the quotas for the account used in the backup woudl help in such case.
Is your backup account 'old school' (thousands of ASTs/DIRios) or new school (Hundreds) ?
(small: 400M blocks space, 100M free, 4M files suggest a max of 300/4 = 75 block per file).
hth,
Hein.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-11-2006 01:19 AM
08-11-2006 01:19 AM
Re: Very bad performance on disk with a lot of files and directories
another thought crossed my mind:
When was that node last rebooted before ? You described, that the time used to backup this disk kept increasing over time. If there has been a reboot inbetween and the backup time still kept increasing, forget about this idea, otherwise:
If nonpaged pool becomes severely fragmented, this could cause a massive CPU load when allocating/deallocating packets. Note that the MONI FCP CPU Tick Rate was quite high on your system.
If this problem shows up again, use SHOW MEM/POOL/FULL and look at the difference between:
Number of Free Blocks - Free Blocks on Lookasides
The difference will be the number of nonpaged pool packets in variable length pool. If this number becomes significant (many thousand), this will cause a dramatic effect on the CPU time.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-11-2006 02:17 AM
08-11-2006 02:17 AM
Re: Very bad performance on disk with a lot of files and directories
NPAG_GENTLE
NPAG_AGGRESSIVE
turns off an algorithim which has been known to lead to excessive nonpageed pool fragmentation (extreme cases of which result in SPINWAIT bugchecks).
These are dynamic parameters so it can be done without a reboot. In VMS V8.2 the default value of these was chanegd to be 100.
Purely Personal Opinion
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-11-2006 03:46 AM
08-11-2006 03:46 AM
Re: Very bad performance on disk with a lot of files and directories
In the last 2 years I found several of these cases simply by keeping a list of top IO images (VPA).
Wim
- « Previous
-
- 1
- 2
- Next »