Operating System - Linux
1832804 Members
3166 Online
110045 Solutions
New Discussion

Re: ml350 internal 72i scsi tape problem

 
Andrea Rossi
Frequent Advisor

ml350 internal 72i scsi tape problem

Hi all

I got a problem with an ML350G3 (2 processors, 2,5 GB ram, Redhat ES 2.1 kernel 2.4.9-e27, PSP 7).

performing a full backup of about 30GB I got this message (after about 1,5 hours)

kernel: scsi : aborting command due to timeout : pid 0, scsi1, channel 0, id 5, lun 0 Write (6) 00 00 28 00 00

three times, then system hang, so that I can't even reboot it.

internal scsi is seen as Adaptec AIC-7899 (driver aic7xxxx)
tape is a C7438A

Thanks a lot
5 REPLIES 5
Mark Grant
Honored Contributor

Re: ml350 internal 72i scsi tape problem

personally, I'd suspect the device or the cable. However, you might try the latest red hat kernel which may have an updated driver for the tape drive.
Never preceed any demonstration with anything more predictive than "watch this"
Steven E. Protter
Exalted Contributor

Re: ml350 internal 72i scsi tape problem

causes in rough order of priority

1) Problem with drive.
2) problem with cable
3) problem with scsi termination
4) problem with scsi card.

You need to check all of these things with utilities or eyeballs.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
Dineshkumar Surpur
Frequent Advisor

Re: ml350 internal 72i scsi tape problem

The abort command here is due to the fact
that there was no reply from the device on
the status of write command and driver
has timed out on that write resulting in kernel panic. You can do the following

1. Try backup for a small GB and see if it
is successful
2. Update the tape device firmware and driver
update.

Most probably i feel its a device problem
rather then a termination or cable issue.
Andrea Rossi
Frequent Advisor

Re: ml350 internal 72i scsi tape problem

they found out the jumpers on the tape were wrong.
Anyway after tape substitution the system hanged on Tuesday night, before the backup.
Tonight backup was successfull, dispite the fact I found "kernel: scsi : aborting command due to timeout" in /var/log/messages.

Andrea Rossi
Frequent Advisor

Re: ml350 internal 72i scsi tape problem

the system hangs once a day!
what's happening?
the hardware test was successfull.

P.S. I found
mtrr: your CPUs had inconsistent fixed MTRR settings
mtrr: probably your BIOS does not setup all CPUs

in /var/log/messages.