- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Monitoring systems
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 01:38 AM
08-30-2001 01:38 AM
I am writing a script to monitor system resources and mail me warnings if certain parameters are exceeded.
My list, limit and collection method so far is :-
= 0% idle on CPU (sar -u)
> 80% user on CPU (sar -u)
> 15 % wio (sar -u)
> 50% busy (sar -d)
avwait > avserv (sar -d)
>= 90% rcache (sar -b)
>= 70% wcache (sar -b)
Does anything else come to mind?
Any suggestion gratefully received.
If you would like the completed script please Email me:-
paula@avro.co.uk
;^)
Paula
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 02:07 AM
08-30-2001 02:07 AM
Re: Monitoring systems
You should setup your script to be similar to the alarm thresholds in measureware. HP have spent some time working out good thresholds for alarms.
Heres the alarmdef file from that as an example (the one we use on all our servers);
[attached]
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 02:08 AM
08-30-2001 02:08 AM
Re: Monitoring systems
we also check "sar -v" and monitor nfile and nproc usage.
regards,
Thierry.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 02:14 AM
08-30-2001 02:14 AM
Re: Monitoring systems
I am also interested in writing scripts to do my own sort of performance monitoring, or collect system metrics.
That's why I would be interested in your scripting efforts too since I'm not very experienced with HP-UX systems.
Unfortunately I haven't found time yet to get familiar with the adviser's syntax because I think if you already have MWA running you could benefit from its logging and alarm triggering mechanisms.
Have a look at the extract manpage to find out how to retrieve data from the MWA logfiles.
At the moment I attempt to get the CPAN module Perf::ARM installed, but still experience errors during the make test.
If you are into Perl you may find this module useful.
You may reach the module's author under
Regards
Ralph
ralph.grothe@lit.verwalt-berlin.de
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 02:35 AM
08-30-2001 02:35 AM
Re: Monitoring systems
MeasureWare (OV performance agent) has a very sophisticated alarming capability that
allows you to configure not just thresholds but also alarms based on
multiple metrics, symptoms and duration. This makes it very effective in
only producing alarms when there really are alarms, or to proactively alarm
before a real problem exists
Later,
Bill
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 07:31 AM
08-30-2001 07:31 AM
SolutionIf you want to develop something yourself (and that's fun, informative, and a skill-builder) then I'd add high-water marks for critical system tables ('sav -v' as Thierry suggested, too) and certainly for filesystem utilization ('bdf').
You'll want to record thresholds and alert as the threshold you define is exceeded. Give consideration to how you will send a second or third alert if the "water" continues to rise and/or if the level stays constant but the situation persists.
With my regards!
...JRF...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 07:52 AM
08-30-2001 07:52 AM
Re: Monitoring systems
You hit the nail on the head "fun, informative, and a skill-builder".
Plus I can build it to suit me and my systems.
;-)
Paula
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 09:13 AM
08-30-2001 09:13 AM
Re: Monitoring systems
CPU threshhods should not be single event alarms. I hope you are using teh everage over an extended monitoring period. (1-5 minutes)
Run queue thressholds should be averaged over an even longer period. (2-10 minutes)
vmstat is your friend: look fro free page list getting too small and paging rate getting too high.
top directed to a file will let you grep out memory usage and high CPU processes.
If you have any problems with memory leaks, use ps with the XPG4 environment to capture memory sizes greater than the expected values.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-30-2001 11:29 AM
08-30-2001 11:29 AM
Re: Monitoring systems
to add something more to your script (and yes, I am
interested in it, of course ;-) I would include even more
"sar": "sar -a" and check for excessive directory I/O
(but it seems like the "dirb/s" have been dropped in 11i)
and even some "find", to search for huge directories
(even in 11i that's still a resource hog), c/b devices
outside of "/dev", sticky bits outside your fixed list
of "official" programs.
Then, how about checking "nfsstat" if you are using NFS?
And even "grep" on the "/etc/mnttab" to find NFS-mounts
using silly blocksizes, and such?
Hopefully this thread does go on for while!
My first ?0.02 on this,
Wodisch
PS: please mail to wodisch@wodisch.de, thanks!