1834745 Members
2965 Online
110070 Solutions
New Discussion

A healthy system

 
Shahul
Esteemed Contributor

A healthy system


Hi,

I have a bit of performance concerns now. Could some one explain me when we can say a system is healthy/not healthy. Yes, if the processor usage beyond 90% always, I can say system is loaded, but what about Memory consuption, like page out, page in, paged out, paged in...etc. What should be the value of all these in a healthy system.

Replys are highly appreciated.

TIA
Shahul
9 REPLIES 9
Pete Randall
Outstanding Contributor

Re: A healthy system

Shahul,

My goal as far as memory is concerned is to never swap. I look at swapinfo -tam and want to see device swap utilization of 0%. If it's more than that, you've got performance problems and the only way to really address these problems is to add more memory.


Pete

Pete
Joseph Loo
Honored Contributor

Re: A healthy system

hi,

there is really no hard and fast rule for this. but i guess you know how to check your memory via vmstat command if you do not own glance.

# vmstat 5 20
which i will look out for free and po column. you definitely do not want a low free and more than 0 po. do note that page is about 4kb of the physical memory.

regards.
what you do not see does not mean you should not believe
Bharat Katkar
Honored Contributor

Re: A healthy system

I would start analysizing the system in followig order whatever the symptons are in following order:
1. Buffers
2. Memory
3. Disk
4. CPU

See the attached doc for more details.
You need to know a lot to actually know how little you know
Bharat Katkar
Honored Contributor

Re: A healthy system

This doc talks about Memory Management
You need to know a lot to actually know how little you know
Tim Howell
Frequent Advisor

Re: A healthy system

Thanks to Bahrat for these documents!!! They are most helpfull!
if only we knew...
Ted Buis
Honored Contributor

Re: A healthy system

You may also find the book by Robert F. Sauers useful, titled "HP-UX Tuning and Performance", which you can find at Amazon.com. Make sure you get the 2nd edition that covers 11i.
Mom 6
Geoff Wild
Honored Contributor

Re: A healthy system

Depends a lot on what the system is doing...

In general though Page Outs are bad...

On top of all that, it really is your user's voice - if they think everything is fine - then no need to fix.

If they say it was better 6 months ago - then you need to look into it....

On a heavy used system - say running a 1.2 TB Oracle DB and SAP, having all the memory almost consumed, some stuff in swap, may be normal...

For example, on one of my servers:

# memdetail
Memory Stat total used avail %used
physical 10080.0 10027.9 52.1 99%
active virtual 13453.6 3867.9 9585.7 29%
active real 8047.8 2343.3 5704.5 29%
memory swap 7697.1 1417.2 6279.9 18%
device swap 26528.0 12719.8 13808.2 48%

is perfectly normal - response times from SAP are below SAP reccomendations - users are happy....

My cpu load is neglegible:

System: pc0003 Wed May 5 11:43:38 2004
Load averages: 0.94, 1.50, 1.51
589 processes: 550 sleeping, 39 running
Cpu states:
CPU LOAD USER NICE SYS IDLE BLOCK SWAIT INTR SSYS
0 0.77 3.2% 11.1% 8.3% 77.5% 0.0% 0.0% 0.0% 0.0%
1 0.83 2.8% 10.7% 9.9% 76.7% 0.0% 0.0% 0.0% 0.0%
2 0.76 2.8% 12.3% 10.3% 74.7% 0.0% 0.0% 0.0% 0.0%
3 1.18 2.4% 13.0% 10.1% 74.5% 0.0% 0.0% 0.0% 0.0%
5 1.17 1.6% 15.0% 10.1% 73.3% 0.0% 0.0% 0.0% 0.0%
--- ---- ----- ----- ----- ----- ----- ----- ----- -----
avg 0.94 2.6% 12.5% 9.7% 75.3% 0.0% 0.0% 0.0% 0.0%

I have no PO's...


Again, a lot of what you are asking really depends on what the system is doing...

Rgds...Geoff
Proverbs 3:5,6 Trust in the Lord with all your heart and lean not on your own understanding; in all your ways acknowledge him, and he will make all your paths straight.
Hemanth Gurunath Basrur
Honored Contributor

Re: A healthy system

Hi Shahul,

System Healthcheck product may help you in determining whether your system is healthy or not. You can download the product from http://www.software.hp.com/portal/swdepot/displayProductInfo.do?productNumber=SHCBASE01

HTH.

Regards,
Hemanth
Brian M Rawlings
Honored Contributor

Re: A healthy system

Interesting discussion. You want your system to be busy, providing optimal use of an expensive resource. But, at some point, it is too busy, and other expensive resources (staff, customer loyalty, competitive edge) is jeapordized.

If your SAP system is like others I've seen (any computing environment, for that matter), useage is a very peaky thing. I like to see some headroom during the average times, to allow for the "1PM syndrom", where everybody comes back from lunch at once, and all indicators go to 100% for an hour.

If you are never paging, all day, ever, it is a good indicator that you have sufficient headroom on memory size. Paging provides an interesting performance curve. Performance stays pretty much constant/acceptable until you start paging. At that point, it can suddenly drop dramatically for everyone. If you avoid paging, or just see a tiny bit at the worst peaks, memory is not your issue.

Having busy CPUs, processor usage around 90% as you mention, tells me that if you watch it very long, you will see peaks hitting 100% from time to time (and drops to 60% from time to time). Again, the key is how long you are at 100%, and how often. If your CPU utilization rarely maxes out, and only for short times, and only during the worst times (8AM, 1PM, or unspecified other for your particular work environment), then that is probably on the healthy side, and you are getting your money's worth out of that system.

I hate to see it that close, though, because the slightest growth, and all of a sudden you are at 100% lots of the time, and very frequently. Like the paging, this generally means a sudden dramatic impact to users, which directly correlates to phone calls and emails and boss visits that we'd all prefer to avoid.

Then there is disk I/O. The good news for you is that it is very hard to have all three elements of the system under equal stress, CPU, memory, and I/O. If you have high CPU usage, you are probably not I/O bound. Or, you won't know that you are until you add a CPU, and see your CPU usage drop to 55% average, but see no apparent performance increase.

One bottleneck generally hides others, because they can't be seen until the most prominent one is removed. That's why I generally recommend to add memory when adding a CPU. "Why, we don't have memory issues?" Well, not that you can see, anyway. Time and again, I've helped customers with CPU upgrades, only to help them with a (generally unbudgeted) memory upgrade a few weeks or months later, since the CPU addition didn't do much for them except reveal the "other" bottleneck.

Sorry this is so general, but the question has general applicability. If I were you, looking at 90% CPU utilization (presuming this is average, not peak), I'd be uncomfortable with that, and would be pinging management for more (or faster) CPUs in the upcoming budget cycle.

Lacking particulars on the memory situation (although you look to be OK right now), I'd look at adding memory along with the processor(s), to keep from hitting the snag that experience tells me you may be ready to hit.

When it comes to memory, there are some interesting gotchas, depending on what type of server you have, how the memory is distributed across the memory buses, and how well it is interleaved. Lots of memory boards are better than few, and DIMM counts should be in powers of two (like 4 or 8 or 16, never 3 or 7). The HP 9000 config guides call for adding memory in large chunks for best performance, which irks a lot of people. It seems very self-serving on HP's part, to sell lots of memory.

That turns out not to be the case, there really are valid, technical reasons for the guidelines. Unfortunately, the magic of memory interleaving only comes into play for applications that do lots of sequential memory access. Of the scores of processes running in your system, some do this much of the time, and some are too random for interleaving to be a factor, and some do it well some of the time, but change their usage pattern under heavy load or other circumstances. So, how much will a well designed, properly interleaved memory scheme help with performance? Nobody can tell, without exhaustive testing of YOUR app, in YOUR environment, with various memory configurations being stressed by YOUR users.

It's enough to make a system architect weep. But real geeks don't cry...

If you post back with your memory configuration, and your server class & some other details (CPU counts, system board counts in RP7410/8410, I/O card counts and types), I'll try to post some more concrete suggestions. Plus, others may point out things I haven't touched on yet (LVM striping for RAID array data, alternating PV Links, etc).

Best Regards,

--bmr
We must indeed all hang together, or, most assuredly, we shall all hang separately. (Benjamin Franklin)