- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- "Load" increasing and machine locking up
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-09-2004 03:57 AM
07-09-2004 03:57 AM
It will allow commands to be run but no processes to be killed. Eventually it locks up.
sar shows 1 disk has high av service times of 300ms. This is way up on normal. Does this behaviour ring any bells with anyone?
Many thanks, Garry
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-09-2004 05:22 AM
07-09-2004 05:22 AM
Re: "Load" increasing and machine locking up
first thing I would suggest is to check the patch levels of your machine. Looks like you are way behind.
if you have glance installed, after a reboot, start a terminal version of glance plus and watch where it starts to choke. At least it will give you an indication of the culprit for the failure. Watch the meter bars on top to see CPU, disk, memory and swap utilizations and see just before it totally locks up, what reaches a 100% or close to this utilization.
A starting point at least.
UNIX because I majored in cryptology...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-09-2004 05:47 AM
07-09-2004 05:47 AM
Re: "Load" increasing and machine locking up
Yes you system has become clasically io bound... High load average, high disk utilisaton. The disk with 300ms service time is probably broken. Usually 5-10ms is OK for JBOD disk and 1-4 ms for disk with cache & controller, either way 300ms is 30 times or more too large.
Regards
Tim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-12-2004 08:45 PM
07-12-2004 08:45 PM
Re: "Load" increasing and machine locking up
running out of resources. Nothing is unusual just before the lockups. Tim, when you say the disk is probably broken, do you mean it could be broken hardware-wise ??
Thanks,
Garry
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-12-2004 08:52 PM
07-12-2004 08:52 PM
Re: "Load" increasing and machine locking up
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-12-2004 09:08 PM
07-12-2004 09:08 PM
Re: "Load" increasing and machine locking up
Thanks,
Garry
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-12-2004 09:17 PM
07-12-2004 09:17 PM
SolutionTOC Transfer of control
see
http://www.interex.org/pubcontent/enterprise/mar99/09qa/09qa.html
If all else fails, try stopping all processes that can be stopped and umounting all file systems that can be umounted and then TOC the machine. Doing a transfer of control (TOC) will only save the contents of memory to disk if your machine has been properly configured to do so (see savecore(1M)). An analysis of the dump can be performed to determine the cause of the process hangs.
http://www.docs.hp.com/cgi-bin/fsearch/framedisplay?top=/hpux/onlinedocs/5990-8170/5990-8170_top.html&con=/hpux/onlinedocs/5990-8170/00/00/68-con.html&toc=/hpux/onlinedocs/5990-8170/00/00/68-toc.html&searchterms=toc&queryid=20040713-031651
Steve Steel
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-12-2004 10:25 PM
07-12-2004 10:25 PM
Re: "Load" increasing and machine locking up
Brilliant. I have learned much from them!
Hopefully I can finally track down our
problem when it next occurs!
Garry
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-14-2004 09:48 PM
07-14-2004 09:48 PM
Re: "Load" increasing and machine locking up
Found the machine was locking up because it was waiting on i/o on a disk that was not working properly. All processes went idle while the one waiting just sat there patiently! I identified the disk using sar.
They are all hot-swapable so we pulled the offending disk out and put it back and the machine recovered and carried on! I've now applied a firmware patch for the disks taking them from HP01 to HP04. Looks OK at the moment but if it occurs again we'll get a new disk.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
07-22-2004 08:25 AM
07-22-2004 08:25 AM
Re: "Load" increasing and machine locking up
I was on holiday, so did not reply to your question.. but it seesm the question was fully resolved due to circumstance.
A disk doing 300ms service times is VERY likely to be broken either becauser of poor or old formware, or most likely, an intermittent hardware failure, as you found out..
Tim