- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Re: EMS verses stm
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-30-2002 12:39 AM
01-30-2002 12:39 AM
EMS verses stm
Checking the disk using stm and all appears OK, no errors logged at all and verification works fine.
What is most likely to be correct??????
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-30-2002 02:27 AM
01-30-2002 02:27 AM
Re: EMS verses stm
You won't want to perform a "write" test on your production disk just to verify whether it is faulty for writes. In STM, you are likely to have performed an exercise test only.
To check it out, schedule downtime, perform a full backup of the data on the disk, initiate a write test using tools like STM or dd. That will help verify whether your disk is faulty. If it isn't, then it could be your disk controller. Check whether other disks on the same bus are also facing the same errors.
Hope this helps. Regards.
Steven Sim Kok Leong
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-30-2002 02:38 AM
01-30-2002 02:38 AM
Re: EMS verses stm
The main reason for asking this question is that the disk concerned is the root disk, one of two in /dev/vg00. The company didn't want to buy mirrordisk to save on costs so we have no fall back plans. This message is only appearing on one disk, so far 2 messages in 3 days
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-30-2002 02:45 AM
01-30-2002 02:45 AM
Re: EMS verses stm
Sorry if I gave the impression that I was laughing. I seriously wasn't.
Usually if there is a bad sector on the disk, you will not be able to read from it as well. As such, I personally think that the exercise test you performed in STM would be rather thorough.
Have you ran an exercise test on the SCSI controller as well?
Can you post up the EMS error message for us forumers to take a look? The EMS error might give more insight.
Hope this helps. Regards.
Steven Sim Kok Leong
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-30-2002 02:46 AM
01-30-2002 02:46 AM
Re: EMS verses stm
I'd make sure that I had a current Ignite recovery tape of all of vg00 -- just for insurance sake, should your disk go bad ;-)
In my opinion, no server should be without a mirrored boot volume.
# /opt/ignite/bin/make_tape_recovery -x inc_entire=vg00 -I -v -a /dev/rmt/0mn
Regards!
...JRF...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-30-2002 02:57 AM
01-30-2002 02:57 AM
Re: EMS verses stm
We already have an upto date make_recovery tape taken recently on hand just incase it is needed.
Interesting thing I noticed was that the first message occured Sunday pm when nothing was runnning on the system.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-30-2002 03:16 AM
01-30-2002 03:16 AM
Re: EMS verses stm
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-30-2002 03:09 PM
01-30-2002 03:09 PM
Re: EMS verses stm
I would trust the EMS output and give it the benefit of the doubt.
Since EMS has been reporting write errors and not read errors, an exercise test of your disk via STM may not reflect 100% accurately because that is a read test. However, if it was bad sector(s), then a disk exercise should fail during read as well.
Did you perform just a verification test or an exercise test on your harddisk? If you did perform an exercise test and it passed, then the harddisk failure might be due to other harddisk-related hardware reasons (such as the write head) than bad sectors. Note in the EMS error that it stated:
Reallocating the data to a spare area on the medium was
attempted, but failed.
Since this is the only disk on the same bus that has been receiving the errors, it should be safe to deduce that your harddisk SCSI controller is functioning fine.
Since this is your root disk (and not a data disk), you should perform an updated make_tape_recovery and replace the disk, as the others have already mentioned. Otherwise, you risk some of your system data being corrupted.
Hope this helps. Regards.
Steven Sim Kok Leong
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-31-2002 12:24 AM
01-31-2002 12:24 AM
Re: EMS verses stm
Many thannks for the information, I actually performed a verification to start with but after your previous post I then performed an excersise. Both have worked fine and 100% completed with no new messages reported by EMS. As this machine was purchased 2nd hand, I have found out that my collegue who loaded the OS had actuallly performed a read/write test on the disk pror to the OS load beginning of December which again was fine.
I am interested to see another thread has been started regarding EMS reporting critical failures on a fully working system. Is EMS over sensitive
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-31-2002 12:39 AM
01-31-2002 12:39 AM
Re: EMS verses stm
One quick pice of advice:
If the system is production then replace
the disk, and then make arrangements to have
a mirrored disk installed. I guess you could
answer your own question... how much would
really be lost cost wise if a production
system was down when it is being used.
We had a production system down today
because of a faulty HBA card and it has
cost us in the tens of thousands. Having
the right redundancies helps, but you can
never really predict what time of what
day you have a hardware failure.
-Michael
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-31-2002 01:25 AM
01-31-2002 01:25 AM
Re: EMS verses stm
We both know that a simple purcase like mirrordisk saves thousands in lost production time, but when the IT department falls under the control of the accounting trolls (Dilbert cartoon)trying to get a box of DDS2 tapes is a nightmare