1751792 Members
5188 Online
108781 Solutions
New Discussion юеВ

filesystem hanging

 
SOLVED
Go to solution
jedd
Advisor

filesystem hanging

hello all
i need help with diagnosing this problem. i'm attaching a txt file where i get it from the syslog. it seems like about once a week, the filesystem hangs for some reason. i look at the syslog and this is what i get. it seems like i have a user that is increasing the nfs sever daemons. will that cause the filesystem to hang. i don't know how to diagnose all the scsi error. i have (4) 17Gb drives in a single channel, with oracle running on it. i have vg00 for all my root filesystem. i have vgDB for all my oracle stuff including RDBMS, and all the datafiles. nothing on vg00. i striped (3) 17GB drives for all the oracle. so i don't know if that's causing it to fail. i installed the latest release HP-UX 11.11 09/2002. i would be appreciate if somebody can help me decipher the errors on the txt file.
16 REPLIES 16
Sridhar Bhaskarla
Honored Contributor

Re: filesystem hanging

Hi

You have a problem with your c0t5d0 disk.

Do a ioscan -fnC disk > disk.out

Look at disk.out and see if this disk is claimed. It could be a hardware issue.

-Sri
You may be disappointed if you fail, but you are doomed if you don't try
James R. Ferguson
Acclaimed Contributor

Re: filesystem hanging

Hi:

Assuming that the disk is claimed by 'ioscan' you could look for errors thusly:

# dd if=/dev/rdsk/c0t5d0 of=/dev/null bs=256k

Regards!

...JRF...
Rita C Workman
Honored Contributor

Re: filesystem hanging

I agree that it looks like you have a disk problem. But I think running ioscan may still show the disk as claimed.
It looks like there are some bad blocks on the disk that can no longer be read.

Determine for this disk what it associates to...then you will know best how to proceed and what you may need to address.
Fixing vg00 is one thing, but if you have bad blocks (that HP did not re-allocate) and it involves your Oracle database..that is another can of worms.

Just a thought,
Rita
Eugeny Brychkov
Honored Contributor

Re: filesystem hanging

I agree with Rita and James. It's highly possible that it's hardware problem. dd will show if disk surface is readable, and if not then as soon as this disk is a part of root and has contiguous allocation policy this can be the cause of hangs. The best choice is to replace it. Make ignite tape and replace disk (call HP if you have warranty/contract)
Eugeny
Helen French
Honored Contributor

Re: filesystem hanging

I would suggest you to check your disks and other hardware with STM tools. You can do 'dd' on the disks as specified above. Also, install any patches if you haven't updated your system lately.
Life is a promise, fulfill it!
Steven E. Protter
Exalted Contributor

Re: filesystem hanging

I chime in.

Bad disk.

Time to shut the database cold back it while HP support comes out with a new disk. Every lbolt error I ever got led to disk replacement.

It is conceivable that stm will point you to a scsi problem, and if the problem repeats itself, look into cabling scsi card or drive cage(if applicable).

Not an optimal Oracle setup either. You'd be better off with mirror/us and raid 10 mirroring, but it is more costly in terms of disk space.

At least for data, having a full mirror copy is much more important for performance and reliablity than striping.


SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
jedd
Advisor

Re: filesystem hanging

hello all thanks for the reply. i am not too good at reading dd output. can you tell me what format the output will be.
Helen French
Honored Contributor

Re: filesystem hanging

If you execute the dd command as specified by James and if you have a problem with your disk, it will report an I/O error and will come out of the process. If no error found, the task will be completed with total number of blocks in and number of blocks out.
Life is a promise, fulfill it!
Michael Steele_2
Honored Contributor

Re: filesystem hanging

Jedd:

dd if=/dev/rdsk/c0t6d0 of=/dev/null count=1000000

At this point the cursor will hang for several seconds. Let it. You want to read beyond the disk cache, note count=1000000 to do this.

Then, "...control c..." to break out and read the following

6930+0 records in
6930+0 records out

If the disk is bad the cursor will not return until the disk is swapped out. So you'll have to open up another window or another session.

You'll occasionally see the I/O error.

dd read error: I/O error
6930+0 records in
6930+0 records out
Support Fatherhood - Stop Family Law