- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Lost a SCSI disk and system hangs
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО05-04-2004 01:09 AM
тАО05-04-2004 01:09 AM
Lost a SCSI disk and system hangs
The monitor processes we have feeds information to another process on each node which distributes processes among the 4. When one workstation goes down the processes that are assigned to it are relocated to another box. Most of the times this works fine. In the case of the drive failure though we run into major problems.
The process that monitors node availability and the process that allocates processes are both custom programs written by our project. We inherited a lot of legacy code from a former project.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО05-04-2004 01:24 AM
тАО05-04-2004 01:24 AM
Re: Lost a SCSI disk and system hangs
The ping command works at the network level and requires very little from the other parts of the OS, specifically no disk IO.
Any other processes when running require some sort of IO to run, be it reading the program from disk, opening device files, etc. When the boot disk went bad any IO that was or is pointed toward anything on that disk will hang indefinitely. You most likely cannot even get logged in because you've got to read things like /etc/profile, /home/????/.profile, etc.
With your process as it is designed, you are really out of luck in this case. The only real way around this is to mirror the disks on the box. If you don't have Mirror Disk already you would have to buy it but unfortunately HP-UX 10.20 is WAY out of support so I doubt you could even buy that anymore.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО05-04-2004 01:27 AM
тАО05-04-2004 01:27 AM
Re: Lost a SCSI disk and system hangs
Bill Hassell, sysadmin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО05-05-2004 04:46 AM
тАО05-05-2004 04:46 AM
Re: Lost a SCSI disk and system hangs
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО05-05-2004 03:42 PM
тАО05-05-2004 03:42 PM
Re: Lost a SCSI disk and system hangs
Maybe what you need is to improve upon your monitoring checks. Without having to do too much, if ftp is running on the boxes, you could script fpt and replace that with your ping test... maybe ftp a file to each host for to show a system status??
or if sendmail is is running and listening on all the nodes... have them send a message that contains info about the nodes status. Should be easy enough to implement. Instead of another mailbox to manage, add an alias and redirect the output to a script that would parse the results of the other node's email w/ status info.... assuming that your entire issue was caused by the down box, but ping response.. the above ideas may help.
-denver