- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- how to detect hardware errors
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-18-2004 12:46 AM
08-18-2004 12:46 AM
how to detect hardware errors
we recently had a disk failure in a D370.
After a reboot it appeared that a mirror set was corrupt. Conclusion was that both disks in a mirror set were defective.
We are wondering when the errors occurred. Did they start because of the reboot or were they dormant?
This brings me to this question :
Is there a way to detect hardware errors in a D-class server? There is no guardian processor in it I think.
I assume we can only check outputs of ioscan, vgdisplay and a dd to /dev/null.
But maybe you have other tips?
Thanks in advance,
Jeroen.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-18-2004 12:49 AM
08-18-2004 12:49 AM
Re: how to detect hardware errors
As a note, Predictive has been replaced now by ISEE.
Were there any errors in dmesg or the syslog.log?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-18-2004 01:18 AM
08-18-2004 01:18 AM
Re: how to detect hardware errors
you can use cstm (if installed) or look at /var/tombstones/ts99 (most recent)
This should give you hints about any HW Failure
Regards
Franky
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-18-2004 01:26 AM
08-18-2004 01:26 AM
Re: how to detect hardware errors
If the disk is having touble finding data (re reads?) you can see it in access times creeping up. You'd need a baseline (save a sar output before problems develop) to compare.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-18-2004 01:34 AM
08-18-2004 01:34 AM
Re: how to detect hardware errors
This type of failures would be logged into the syslog.log. If you want to get notified on such errors then you may have to write a script to capture key words like, SCSI,lbolt, error, warning etc.
EMS notification will also appear in syslog.log. You can also monitor the /var/opt/resmon/log/event.log through your script.
If you do not want to use scripts, then it is a good practice to check the syslog.log file everyday for such a problem.
In your case, if one of the mirror disk was bad and if you tried to boot with a single disk, then you have to give "-lq" at the ISL
ISL>hpux -lq
This will make the system activate vg00 without checking the quorum. You need to do this if the below command gives output as,
ISL>lsautofl
hpux
But if your autoboot string was
ISL>lsautofl
hpux -lq
then it should have booted on its own, provided the primary and alternate boot paths were set correctly.
Hope this helps.
Cheers,
Mohan.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-19-2004 09:30 PM
08-19-2004 09:30 PM
Re: how to detect hardware errors
Our customer knows enough now.
The quorum check etc was done and didn't work before. The hardware call was already solved, we just wanted to know some ways of detecting the hardware errors.
I guess the syslog.log and checking for certain keywords is the best way to go.
Actually a new server with a GSP is the best way...
Thanks all!