Operating System - HP-UX
1834814 Members
2408 Online
110070 Solutions
New Discussion

Syslog Messages - Bad Disk ?

 
John_44
Advisor

Syslog Messages - Bad Disk ?

Hello All,

Yesterday these messages appeared in Syslog:

Mar 16 12:25:20 vmunix: SCSI: Third party detected bus hang -- lbolt: 1190522002, bus: 0
Mar 16 12:25:20 vmunix: lbp->state: 20
Mar 16 12:25:20 vmunix: lbp->offset: 2f8
Mar 16 12:25:20 vmunix: lbp->uPhysScript: 280000
Mar 16 12:25:20 vmunix: From most recent interrupt:
Mar 16 12:25:20 vmunix: ISTAT: 0a, SIST0: 90, SIST1: 00, DSTAT: 80, DSPS: 00600a00
Mar 16 12:25:20 vmunix: NCR chip register access history (most recent last): 409997614 accesses
Mar 16 12:25:20 vmunix: 0, ISTAT<-20
Mar 16 12:25:20 last message repeated 13 times
Mar 16 12:25:20 vmunix: 0, ISTAT: 20
Mar 16 12:25:20 vmunix: 0, ISTAT: 20
Mar 16 12:25:20 vmunix: lsp: 0
Mar 16 12:25:20 vmunix: lbp->owner: 6804c80
Mar 16 12:25:20 vmunix: bp->b_dev: 1f008000
Mar 16 12:25:20 vmunix: scb->io_id: 62e26c
Mar 16 12:25:20 vmunix: scb->cdb: 2a 00 00 53 31 c0 00 00 10 00
Mar 16 12:25:20 vmunix: lbolt_at_timeout: 1190521902, lbolt_at_start: 1190520402
Mar 16 12:25:20 vmunix: lsp->state: 85
Mar 16 12:25:20 vmunix:
Mar 16 12:25:20 vmunix: SCSI: Resetting SCSI -- lbolt: 1190522102, bus: 0
Mar 16 12:25:20 vmunix:
Mar 16 12:25:20 vmunix: SCSI: Reset detected -- lbolt: 1190522102, bus: 0
Mar 16 12:25:20 vmunix: LVM: vg[1]: pvnum=0 (dev_t=0x1f008000) is POWERFAILED
Mar 16 12:25:20 vmunix: LVM: PV 0 has been returned to vg[1].

At first I thought that I had a disk that went bad but ioscan shows the disk as claimed, diskinfo returns information about the disk and the disk, which is part of a mirror set, shows that the mirror is synced. Any ideas? Any help is appreciated and I will assign points as always.

John
5 REPLIES 5
Steven E. Protter
Exalted Contributor

Re: Syslog Messages - Bad Disk ?

A SCSI device, most likely a disk is bad.

It could be caused by a bad drive cage, if you have one. It could be a bad cable, or bad scsi termination. It could be an unused disk gone bad, messing with its brothers and sisters on the scsi chain.

You might wish to use one of the utilities below try and identify the actual cause.

stm
mstm
xstm(X-windows only).

You should back up data and get HP support in to replace the desk ASAP.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
Marco Santerre
Honored Contributor

Re: Syslog Messages - Bad Disk ?

Is does look like a disk has problem. Have you tried doing a vgdisplay to see if you had any stale extents on any of your logical volumes?

I would, though place a call with HP regarding that disk (if it is an internal disk or if it's a disk that HP takes care of).
Cooperation is doing with a smile what you have to do anyhow.
Eugeny Brychkov
Honored Contributor

Re: Syslog Messages - Bad Disk ?

John,
disk is c0t8d0. Please check its surface running
dd if=/dev/rdsk/c0t8d0 of=/dev/null bs=4096k
and if it will return 'I/O' error then disk is bad. If not, then SCSI bus/interface is under suspicion. Check bus termination, bus length.
Please note that SCSI Id8 has LOWEST priority. So all other devices on the bus trying to gain the bus access will win over this disk
Eugeny
S.K. Chan
Honored Contributor

Re: Syslog Messages - Bad Disk ?

Hang on .. it may not be hardware related. When you see "pv0 has been returned .." with "pvnum=0..is POWERFAILED" it could be due to low timeout value of the disk. By default it's 30sec. What you can do is try changing that to say 60sec first and see if you still get the error after a few days. To change ..
# pvchange -t 60 /dev/dsk/c0t8d0
This is a possibility only, you would still want to check the disk physically (it's connection or termination) and use appropriate diagnostics tool for this. Running "exerciser" (in STM)on the disk is recommended but do it when the system is less busy.
KCS_1
Respected Contributor

Re: Syslog Messages - Bad Disk ?

hello~

I have seem this message before.

Above posted message are good stuff for solving your problem.

In addition,

Check your h/w(disk : /dev/dsk/c0t8d0) :

# ioscan -funCdisk

look at all disk which are returned correct status or not

# diskinfo -v /dev/dsk/c0t8d0

Returned correct informations from the disk or not

# dd if=/dev/dsk/c0t8d0 of=/dev/null bs=512k

testing of read/write

are they say normal status??
if, all of things are good,
in my opinion, that's ok!

but, one of things returned not ok, swap in/out and then gathering the disk information,again such as above,again.

you have a problem,may it's bad disk!! i think
Call your CE of HP.

have a good day!

Easy going at all.