HP-UX
1748211 회원
4742 온라인
108759 솔루션
새 메시지

disk error 관련 문의 드립니다.

 
최경민_1
비정기 조언자

disk error 관련 문의 드립니다.

안녕하세요.
event 메세지에 아래와 같은 error 메세지가 나와서 어떤 조치를 취해야할지 문의 드립니다.
model : rx6600
OS ver : 11.23
internal disk 이며 data 영역으로 mirror 구성 되어있습니다. vgdisplay, lvdisplay, pvdisplay를 확인해도 stale 된 부분은 없습니다. 전면 LED 도 정상이구요.
==================================================
event.log
==================================================

>------------ Event Monitoring Service Event Notification ------------<

Notification Time: Fri Apr 30 17:03:25 2010

hanla01 sent Event Monitor notification information:

/storage/events/disks/default/0_4_1_0.0.0.4.0
is >= 1.
Its current value is CRITICAL(5).



Event data from monitor:

Event Time..........: Fri Apr 30 17:03:25 2010
Severity............: CRITICAL
Monitor.............: disk_em
Event #.............: 100237
System..............: hanla01

Summary:
Disk at hardware path 0/4/1/0.0.0.4.0 : Media error


Description of Error:

The device was unsuccessful in reading or writing data for the current I/O
request due to an error on the medium. The maximum number of retries were
attempted and the data could not be read.

Probable Cause / Recommended Action:

If the event is reported against a device other than a disk drive:

- Reformatting the medium may fix the problem.
- Alternatively, the medium in the device is flawed.
- If the medium is removable, replace the medium with a fresh one.
- Alternatively, if the medium is not removable, the device has
experienced a hardware failure. Repair or replace the device, as
necessary.

If the event is reported against a disk drive on a system on which none
or only some of the disks are in a redundant environment (i.e., mirrored):

- Review applications for errors at the time the event was reported to
determine which data could not be read.
- Attempt to re-read the data.
- Re-write the data to the disk to allow the disk to reallocate to a
spare area on the disk.
- If a re-read of the data and/or a rewrite of the data are not
successful, the disk should be replaced and data restored from backup.

If the event is reported against a disk drive on a system on which all
disks are in a redundant environment (i.e., mirrored):

- When the OS is patched to current LVM and SCSI patches, reallocation
will take place automatically for these disks, and no action needs be
taken to check or replace these drives.
- To avoid unnecessary paging and notification, the severity of this
event can be changed to MINOR_WARNING by enabling the alternate
configuration for this event in
/var/stm/config/tools/monitor/default_disk_em.clcfg (and
/var/stm/config/tools/monitor/rst_disk_em.clcfg, if it exists):

- Find the following lines:
EQ:100237:CRITICAL:...

and insert a "#" in column 1.

- Remove the "#" from column 1 of the line which starts:
EQ:100237:MINOR_WARNING:...


Additional Event Data:
System IP Address...: 10.223.20.10
Event Id............: 0x4bda8ecd00000000
Monitor Version.....: B.01.01
Event Class.........: I/O
Client Configuration File...........:
/var/stm/config/tools/monitor/default_disk_em.clcfg
Client Configuration File Version...: A.01.00
Qualification criteria met.
Number of events..: 1
Associated OS error log entry id(s):
0x4bda8ecc00000000
Additional System Data:
System Model Number.............: ia64 hp server rx6600
OS Version......................: B.11.23
STM Version.....................: C.60.00
EMS Version.....................: A.04.20
Latest information on this event:
http://docs.hp.com/hpux/content/hardware/ems/scsi.htm#100237

v-v-v-v-v-v-v-v-v-v-v-v-v D E T A I L S v-v-v-v-v-v-v-v-v-v-v-v-v



Component Data:
Physical Device Path...: 0/4/1/0.0.0.4.0
Device Class...........: Disk
Inquiry Vendor ID......: HP
Inquiry Product ID.....: DG146ABAB4
Firmware Version.......: HPD7
Serial Number..........: 3NM2XFY900009805T5J4

Product/Device Identification Information:

Logger ID.........: sdisk
Product Identifier: SCSI Disk
Product Qualifier.: HPDG146ABAB4
SCSI Target ID....: 0x04
SCSI LUN..........: 0x00

I/O Log Event Data:

Driver Status Code..................: 0x0000007C
Length of Logged Hardware Status....: 22 bytes.
Offset to Logged Manager Information: 24 bytes.
Length of Logged Manager Information: 34 bytes.

Hardware Status:

Raw H/W Status:
0x0000: 00 00 00 02 F0 00 03 0F 84 61 A8 0A 00 00 00 00
0x0010: 11 00 81 80 00 97

SCSI Status...: CHECK CONDITION (0x02)
Indicates that a contingent allegiance condition has occurred. Any
error, exception, or abnormal condition that causes sense data to be
set will produce the CHECK CONDITION status.

SCSI Sense Data:

Undecoded Sense Data:
0x0000: F0 00 03 0F 84 61 A8 0A 00 00 00 00 11 00 81 80
0x0010: 00 97

SCSI Sense Data Fields:
Error Code : 0x70
Segment Number : 0x00
Bit Fields:
Filemark : 0
End-of-Medium : 0
Incorrect Length Indicator : 0
Sense Key : 0x03
Information Field Valid : TRUE
Information Field : 0x0F8461A8
Additional Sense Length : 10
Command Specific : 0x00000000
Additional Sense Code : 0x11
Additional Sense Qualifier : 0x00
Field Replaceable Unit : 0x81
Sense Key Specific Data Valid : TRUE
Sense Key Specific Data : 0x80 0x00 0x97

Sense Key 0x03, MEDIUM ERROR, indicates that the command terminated
with a nonrecovered error condition that was probably caused by a
flaw in the medium or an error in the recorded data. This sense key
may also be returned if the device is unable to distinguish between a
flaw in the medium and a specific hardware failure (sense key 0x04).
For the RECOVERED ERROR, HARDWARE ERROR, or MEDIUM ERROR Sense Key,
the Sense Key Specific data indicates that 151 retries were
attempted.

The combination of Additional Sense Code and Sense Qualifier (0x1100)
indicates: Unrecovered read error.

SCSI Command Data Block:

Command Data Block Contents:
0x0000: 28 00 0F 84 61 80 00 00 80 00

Command Data Block Fields (10-byte fmt):
Command Operation Code...(0x28)..: READ
Logical Unit Number..............: 0
DPO Bit..........................: 0
FUA Bit..........................: 0
Relative Address Bit.............: 0
Logical Block Address............: 260333952 (0x0F846180)
Transfer Length..................: 128 (0x0080)

Manager-Specific Data Fields:
Request ID.............: 0x074175C5
Data Residue...........: 0x0000AE00
CDB status.............: 0x00000002
Sense Status...........: 0x00000000
Bus ID.................: 0x07
Target ID..............: 0x04
LUN ID.................: 0x00
Sense Data Length......: 0x12
Q Tag..................: 0xDF
Retry Count............: 7


>---------- End Event Monitoring Service Event Notification ----------<

==================================================
ioscan -funC disk
==================================================
Class I H/W Path Driver S/W State H/W Type Description
====================================================================================
disk 0 0/0/2/1.0.16.0.0 sdisk CLAIMED DEVICE TEAC DV-28E-N
/dev/dsk/c0t0d0 /dev/rdsk/c0t0d0
disk 8 0/2/1/0.1.5.0.1.1.6 sdisk CLAIMED DEVICE EMC SYMMETRIX
/dev/dsk/c17t1d6 /dev/rdsk/c17t1d6
disk 9 0/2/1/0.1.5.0.1.1.7 sdisk CLAIMED DEVICE EMC SYMMETRIX
/dev/dsk/c17t1d7 /dev/rdsk/c17t1d7
disk 10 0/2/1/0.1.5.0.1.2.0 sdisk CLAIMED DEVICE EMC SYMMETRIX
/dev/dsk/c17t2d0 /dev/rdsk/c17t2d0
disk 30 0/3/1/0.2.5.0.1.1.6 sdisk CLAIMED DEVICE EMC SYMMETRIX
/dev/dsk/c21t1d6 /dev/rdsk/c21t1d6
disk 31 0/3/1/0.2.5.0.1.1.7 sdisk CLAIMED DEVICE EMC SYMMETRIX
/dev/dsk/c21t1d7 /dev/rdsk/c21t1d7
disk 32 0/3/1/0.2.5.0.1.2.0 sdisk CLAIMED DEVICE EMC SYMMETRIX
/dev/dsk/c21t2d0 /dev/rdsk/c21t2d0
disk 1 0/4/1/0.0.0.0.0 sdisk CLAIMED DEVICE HP DG072A8B54
/dev/dsk/c7t0d0 /dev/rdsk/c7t0d0
disk 2 0/4/1/0.0.0.1.0 sdisk CLAIMED DEVICE HP DG072A8B54
/dev/dsk/c7t1d0 /dev/rdsk/c7t1d0
disk 3 0/4/1/0.0.0.2.0 sdisk CLAIMED DEVICE HP IR Volume
/dev/dsk/c7t2d0 /dev/dsk/c7t2d0s2 /dev/rdsk/c7t2d0 /dev/rdsk/c7t2d0s2
/dev/dsk/c7t2d0s1 /dev/dsk/c7t2d0s3 /dev/rdsk/c7t2d0s1 /dev/rdsk/c7t2d0s3
disk 4 0/4/1/0.0.0.3.0 sdisk CLAIMED DEVICE HP IR Volume
/dev/dsk/c7t3d0 /dev/rdsk/c7t3d0
disk 5 0/4/1/0.0.0.4.0 sdisk CLAIMED DEVICE HP DG146ABAB4
/dev/dsk/c7t4d0 /dev/rdsk/c7t4d0
disk 6 0/4/1/0.0.0.5.0 sdisk CLAIMED DEVICE HP DG146ABAB4
/dev/dsk/c7t5d0 /dev/rdsk/c7t5d0
==================================================
lvmtab
==================================================
/dev/vg03
/dev/dsk/c7t4d0
/dev/dsk/c7t5d0
==================================================
1 응답 1
양계전
초등학생

disk error 관련 문의 드립니다.

vg03이 software mirror 로 되어있나 보군요

문제있는 디스크를 mirror에서 제거 하시구요

lvreduce -m 0 /dev/vg03/lvol? /dev/dsk/c7t4d0

하시고 디스크 replace 하시고 다시 mirror를 거세요