Operating System - HP-UX
1820592 Members
2153 Online
109626 Solutions
New Discussion юеВ

Event Monitor notification : Serious : what does that mean ???

 
touny
Occasional Advisor

Event Monitor notification : Serious : what does that mean ???

Hi I received this mail (root) on my hp-ux 11.00.
>------------ Event Monitoring Service Event Notification ------------<

Notification Time: Thu Mar 22 08:41:28 2007

hostname sent Event Monitor notification information:

/storage/events/disks/default/0_0_1_1.0.0 is >= 3.
Its current value is SERIOUS(4).



Event data from monitor:

Event Time..........: Thu Mar 22 08:41:28 2007
Severity............: SERIOUS
Monitor.............: disk_em
Event #.............: 4
System..............: hostname

Summary:
Disk at hardware path 0/0/1/1.0.0 : Excessive number of new media-defects.


Description of Error:

There is an excessive number of new media-defects on this drive. The drive
must be replaced before it runs out of spare data blocks. All errors have
been recovered and no data have been lost.

Probable Cause / Recommended Action:

This drive may fail in the near future and should be replaced.
etc...
...................................

What does that mean in ? Should I have to replace a disk ? Or I don't know I need helps ! please

Thanks in advance

Cheers
Al
7 REPLIES 7
James R. Ferguson
Acclaimed Contributor

Re: Event Monitor notification : Serious : what does that mean ???

Hi:

The message means what it says. An excellent guideline to disk replacement is this one:

http://docs.hp.com/en/5991-1236/When_Good_Disks_Go_Bad.pdf

Regards!

...JRF...
DCE
Honored Contributor

Re: Event Monitor notification : Serious : what does that mean ???



The system is warning you that the disk is starting to fail.

You now have the opportunity(luxury) of replacing the disk on your schedule, with no production interruption. I suggest you replace the disk as soon as it is convenient.

touny
Occasional Advisor

Re: Event Monitor notification : Serious : what does that mean ???

O my gosh I really have to change the disk !
It is a production server and they never never never did a backup OS !
I am feeling bad ...
Thanks for the doc.
I will have a look as I never change any disk on hp system (but did it on pseries AIX)

If you have other procs / checklist it is welcome

Thanks for the reply

Cheers
Al
touny
Occasional Advisor

Re: Event Monitor notification : Serious : what does that mean ???

Following the documentation you give me (thanks)
Checking the disk I have no error status on the disks ...
NAWUX01: / # ioscan -funCdisk
Class I H/W Path Driver S/W State H/W Type Description
=====================================================================
disk 0 0/0/1/1.0.0 sdisk CLAIMED DEVICE SEAGATE ST39204LC
/dev/dsk/c1t0d0 /dev/rdsk/c1t0d0
disk 1 0/0/1/1.2.0 sdisk CLAIMED DEVICE SEAGATE ST39204LC
/dev/dsk/c1t2d0 /dev/rdsk/c1t2d0
disk 2 0/0/2/0.0.0 sdisk CLAIMED DEVICE SEAGATE ST39204LC
/dev/dsk/c2t0d0 /dev/rdsk/c2t0d0
disk 3 0/0/2/0.2.0 sdisk CLAIMED DEVICE SEAGATE ST39204LC
/dev/dsk/c2t2d0 /dev/rdsk/c2t2d0
disk 4 0/0/2/1.2.0 sdisk CLAIMED DEVICE HP DVD-ROM 305
/dev/dsk/c3t2d0 /dev/rdsk/c3t2d0

and check the disk with diskinfo command:
# diskinfo /dev/rdsk/c1t0d0
SCSI describe of /dev/rdsk/c1t0d0:
vendor: SEAGATE
product id: ST39204LC
type: direct access
size: 8891556 Kbytes
bytes per sector: 512
# diskinfo /dev/rdsk/c1t2d0
SCSI describe of /dev/rdsk/c1t2d0:
vendor: SEAGATE
product id: ST39204LC
type: direct access
size: 8891556 Kbytes
bytes per sector: 512
# diskinfo /dev/rdsk/c2t0d0
SCSI describe of /dev/rdsk/c2t0d0:
vendor: SEAGATE
product id: ST39204LC
type: direct access
size: 8891556 Kbytes
bytes per sector: 512
# diskinfo /dev/rdsk/c2t2d0
SCSI describe of /dev/rdsk/c2t2d0:
vendor: SEAGATE
product id: ST39204LC
type: direct access
size: 8891556 Kbytes
bytes per sector: 512

# dd if=/dev/rdsk/c1t0d0 of=/dev/null bs=1024k count=64
64+0 records in
64+0 records out

So I don't see any errors on the disk mentionned on the event notification.

There is something else that I should check ?

Thanks in advance
Cheers
Al
MHudec
Frequent Advisor

Re: Event Monitor notification : Serious : what does that mean ???

Touny,

simple dd with first 64M might not reveal the issue (issue might be beyond those 64M).

Check also disk via support tools manager (command line utility is called cstm).

Go according to advice in EMS. Replace the drive. From it's hw path I assume it's vg00 drive, so probably you have all logical volumes mirrored there right?

If so, then just reduce the mirror and replace the drive with the same model.
Steven E. Protter
Exalted Contributor

Re: Event Monitor notification : Serious : what does that mean ???

Shalom,

Back up that disk, with ignite if its a root dis. It has serious issues and is going to die at some point.

Contact hardware after you have a good backup and arrange replacement.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
Andrew Merritt_2
Honored Contributor

Re: Event Monitor notification : Serious : what does that mean ???

Hi Al,
I'd agree, the best course of action would be to replace the disk.

One thing to check, what version of the OnlineDiags do you have installed? It shows as the version when you run STM, and in the EMS event it is "STM Version".

I notice that the disks are 8Gb, which means that the events may be valid.

(Event 4 has rarely been seen on disks >9Gb, but that is an error as those disks use SMART, and the disk_em monitor does not check the defect data on the drive. If you get Event 4 from disk_em for a >9Gb disk, you can ignore it.)

Andrew