Storage Boards Cleanup
To make it easier to find information about HPE Storage products and solutions, we are doing spring cleaning. This includes consolidation of some older boards, and a simpler structure that more accurately reflects how people use HPE Storage.
Tape Libraries and Drives
cancel
Showing results for 
Search instead for 
Did you mean: 

HP StorageWorks Ultrium 1760 hangs machine

Wanping
Occasional Visitor

HP StorageWorks Ultrium 1760 hangs machine

Drive is an HP StorageWorks Ultrium 1760 (LTO) with 1.6TB tapes. Tape
operates normally for a period then, whilst the tape isn't in operation,
the machine hangs and /var/log/messages contains:

Aug 30 14:55:56 aws1 kernel: irq 169: nobody cared! (screaming interrupt?)
Aug 30 14:55:56 aws1 kernel: irq 169: Please try booting with acpi=off and report a bug
Aug 30 14:55:56 aws1 kernel: [] __report_bad_irq+0x3a/0x77
Aug 30 14:55:56 aws1 kernel: [] note_interrupt+0xea/0x115
Aug 30 14:55:56 aws1 kernel: [] do_IRQ+0x143/0x1ae
Aug 30 14:55:56 aws1 kernel: [] common_interrupt+0x18/0x20
Aug 30 14:55:56 aws1 kernel: [] mwait_idle+0x33/0x42
Aug 30 14:55:56 aws1 kernel: [] cpu_idle+0x26/0x3b
Aug 30 14:55:56 aws1 kernel: handlers:
Aug 30 14:55:56 aws1 kernel: [] (usb_hcd_irq+0x0/0x4b)
Aug 30 14:55:56 aws1 kernel: [] (ahd_linux_isr+0x0/0x1cf[aic79xx])
Aug 30 14:55:56 aws1 kernel: [] (ata_interrupt+0x0/0x1eb[libata])
Aug 30 14:55:56 aws1 kernel: Disabling IRQ #169

but the machine carries on. Then sometime later:

Aug 30 21:53:25 aws1 kernel: scsi1:0:3:0: Attempting to abort cmd d1627500: 0x0 0x0 0x0 0x0 0x0 0x0
Aug 30 21:53:25 aws1 kernel: scsi1:0:3:0: Command already completed
Aug 30 21:53:35 aws1 kernel: scsi1:0:3:0: Attempting to abort cmd d1627500: 0x0 0x0 0x0 0x0 0x0 0x0
Aug 30 21:53:35 aws1 kernel: scsi1:0:3:0: Command already completed
Aug 30 21:53:35 aws1 kernel: Recovery code sleeping
Aug 30 21:53:40 aws1 kernel: Recovery code awake
Aug 30 21:53:40 aws1 kernel: Timer Expired
Aug 30 21:53:40 aws1 kernel: scsi1: Device reset returning 0x2003
Aug 30 21:53:40 aws1 kernel: Recovery SCB completes

and that's the last entry before the reboot.

I tried turning
ACPI off and reducing the speed of the SCSI bus to 160 all to no avail.

Kernel details:

[root@aws1 ~]# uname -a
Linux delhi-aws1 2.6.9-67.ELsmp #1 SMP Wed Nov 7 13:58:04 EST 2007 i686
i686 i386 GNU/Linux

-----------------------------
I hope someone could help me to solve this problem.
2 REPLIES
Curtis Ballard
Honored Contributor

Re: HP StorageWorks Ultrium 1760 hangs machine

I don't see anything there that I can diagnose but if you can pull the logs from the drive shortly after a failure maybe they will give us something additional if there is unusual SCSI bus activity.

It looks like the OS is attempting to abort a TESTUNIT READY command which is a pretty trivial "are you alive and ready" ping. Not typically something where an error would occur.

HP Library and Tape Tools is the utility for pulling the drive logs using the "Support Ticket" function. It works on several Linux versions but i don't recognize yours so I'm not certain if it would work for you or not. It is available at http://www.hp.com/support/tapetools
Wanping
Occasional Visitor

Re: HP StorageWorks Ultrium 1760 hangs machine

Thanks, Curtis. The Linux version we are using is RadHat4.6. I noticed that a new firmware version was published early this year. I was wondering if updating the firmware would be helpful or not?