- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Probable disk failure.. need to confirm...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-05-2009 09:29 PM
тАО11-05-2009 09:29 PM
Probable disk failure.. need to confirm...
I have a disk which seems to be failing on HPUX 11.11
Following are the messages i got from EMS.
Disk at hardware path 0/1/1/0.0.0 : I/O request failed.
Disk at hardware path 0/1/1/0.1.0 : Software configuration error
Disk at hardware path 0/1/1/0.0.0 : A SMART event has occurred.
Disk at hardware path 0/1/1/0.0.0 : Software configuration error
Tests i did
# ioscan -fnH 0/1/1/0.0.0
Class I H/W Path Driver S/W State H/W Type Description
=====================================================================
disk 1 0/1/1/0.0.0 sdisk CLAIMED DEVICE HP 73.4GST373454LC
/dev/dsk/c2t0d0 /dev/rdsk/c2t0d0
# diskinfo /dev/rdsk/c2t0d0
SCSI describe of /dev/rdsk/c2t0d0:
vendor: HP 73.4G
product id: ST373454LC
type: direct access
size: 71687369 Kbytes
bytes per sector: 512
# pvdisplay /dev/dsk/c2t0d0
--- Physical volumes ---
PV Name /dev/dsk/c2t0d0
VG Name /dev/vg00
PV Status available
Allocatable yes
VGDA 2
Cur LV 9
PE Size (Mbytes) 16
Total PE 4374
Free PE 0
Allocated PE 4374
Stale PE 0
IO Timeout (Seconds) default
Autoswitch On
MSTM
Hardware path: 0/1/1/0.0.0
Product Id: ST373454LC Vendor: HP 73.4G
Device Type: SCSI Disk Firmware Rev: HPC3
Device Qualifier: HP73.4GST373454LC Logical Unit: 0
Serial Number: 3KP1Z4KM00007621M7F3
Capacity (M Byte): 70007.20
Block Size: 512
Max Block Address: 143374737
Error Logs
Total Retries: 0 Buffer Overruns: N/A
Read Reverse Errors: N/A Buffer Underruns: N/A
Write Errors: 0 Non-Medium Errors: 12
Verify Errors: 0
SYSLOG
Nov 5 18:32:42 eca1ap21 vmunix: SCSI: Request Timeout; Abort Tag -- lbolt: 612603113, dev: 1f020000, io_id: 2aed574
Nov 5 18:32:52 eca1ap21 vmunix: SCSI: Request Timeout; Abort Tag -- lbolt: 612603313, dev: 1f020000, io_id: 2aed61c
Nov 5 18:32:52 eca1ap21 vmunix: SCSI: Request Timeout; Abort Tag -- lbolt: 612603413, dev: 1f020000, io_id: 2aed61e
Nov 5 18:32:52 eca1ap21 vmunix: SCSI: Request Timeout; Abort Tag -- lbolt: 612603413, dev: 1f020000, io_id: 2aed5d3
Nov 5 18:32:52 eca1ap21 vmunix: SCSI: isrEscape Controller at 0/1/1/0.
Nov 5 18:32:52 eca1ap21 vmunix: SCSI: First party detected bus hang (HTH) -- lbolt: 612603734, dev: 1f020000
Nov 5 18:32:52 eca1ap21 vmunix: SCSI: Resetting SCSI -- lbolt: 612603834, bus: 2 path: 0/1/1/0
Nov 5 18:32:52 eca1ap21 vmunix: SCSI: Reset detected -- lbolt: 612603834, bus: 2 path: 0/1/1/0
Nov 5 18:32:52 eca1ap21 vmunix: SCSI: Read error -- dev: b 31 0x020000, errno: 126, resid: 1024,
# cd dsk
# ll | grep 020000
brw-r----- 1 bin sys 31 0x020000 Feb 6 2007 c2t0d0
# dd if=/dev/rdsk/c2t0d0 of=/dev/null bs=1024k count=64
64+0 records in
64+0 records out
All tests show that the SCSI device is OK except syslog.
Should i plan disk replacement.??
Thanks for your suggestions
Sunny
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-05-2009 09:51 PM
тАО11-05-2009 09:51 PM
Re: Probable disk failure.. need to confirm...
Yes..syslog says lbolt error possibly hardware.
We can try running dd on the full disk and see if we get any error.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-05-2009 09:55 PM
тАО11-05-2009 09:55 PM
Re: Probable disk failure.. need to confirm...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-05-2009 10:22 PM
тАО11-05-2009 10:22 PM
Re: Probable disk failure.. need to confirm...
70007+1 records in
70007+1 records out
Some more logs i found
Nov 5 18:32:52 eca1ap21 vmunix: LVM: VG 64 0x000000: PVLink 31 0x020000 Failed! The PV is not accessible.
Nov 5 18:32:52 eca1ap21 vmunix:
Nov 5 18:32:57 eca1ap21 above message repeats 2 times
Nov 5 18:32:57 eca1ap21 vmunix: LVM: VG 64 0x000000: PVLink 31 0x020000 Recovered.
Any suggestions?
Sunny
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-05-2009 11:05 PM
тАО11-05-2009 11:05 PM
Re: Probable disk failure.. need to confirm...
What is the value for IO timeout?
#pvdisplay /dev/dsk/cxtydz | grep -i io
default normally refers to 30 secs.
It can be changed online.
pvchange -t 120 /dev/dsk/cxtydz
This sets the time for IO timeout attempts from LVM in which if a response is not recieved from the PV or the Path of the PV , that will be marked as failed and if that is a path failure and Alternate Paths are configured then the Alternate path will be used for IO.
This is avoid syslog error for "pvlink failed"
Any stale extents on the disk?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-05-2009 11:10 PM
тАО11-05-2009 11:10 PM
Re: Probable disk failure.. need to confirm...
There are no Stale PEs on the disk.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-06-2009 01:16 AM
тАО11-06-2009 01:16 AM
Re: Probable disk failure.. need to confirm...
Your Above tests shows that Disk is fine.
So you can just chage PV time out & can be done online
pvchange -t 180 /dev/dsk/c2t0d0
Regards
Sanjeev
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-06-2009 03:14 AM
тАО11-06-2009 03:14 AM
Re: Probable disk failure.. need to confirm...
Can we conclude the disk is OK.
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-06-2009 03:33 AM
тАО11-06-2009 03:33 AM
Re: Probable disk failure.. need to confirm...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-06-2009 08:30 AM
тАО11-06-2009 08:30 AM
Re: Probable disk failure.. need to confirm...
if you can ioscan , diskinfo and dd a drive normally than there is no hardware problem
You can also check or "refer" PDF / Document
"Good_Disk_Gone_Bad"