System Administration
cancel
Showing results for 
Search instead for 
Did you mean: 

Digital Unix 5.0 issues SCSI errors i am not sure what they are

whiteSox
Frequent Advisor

Digital Unix 5.0 issues SCSI errors i am not sure what they are

cam_logger: SCSI event packet
cam_logger: bus 0 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0 target 0 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0 target 0 lun 0
ss_perform_timeout
Reached max abort count, scheduled bus reset
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0 target 2 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 0
psiop_hardintrptidec5)
Bus reset detected
cam_logger: SCSI event packet
cam_logger: bus 0 target 2 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error

Does any one know what those scsi errors are i am getting these errors on Digital 5.0 Hp true 64 system ...running on Alpha 2000..

thanks for your help ..
16 REPLIES
cnb
Honored Contributor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

Looks like one of the devices on SCSI bus 0 is causing device timeouts across the scsi chain. Could be controller, faulty device, loose cable or termination. Check the connections.

Take a look at the error log or post it here.

Do you have DecEvent installed if not uerf may help to get an idea of what is happening.

Have you changed any device configuration?

Post #scu show edt

hth

whiteSox
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

i am unable to login ...
Steven Schweda
Honored Contributor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

> i am unable to login ...

I am unable to guess why. Fingers broken?
Keyboard filled with super-glue?

Perhaps you could describe what you did, what
you did it with, and what happened when you
did it. Or else consult a psychic?
whiteSox
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

scu show edt

CAM Equipment Device Table (EDT) Information:

Bus/Target/Lun Device Type ANSI Vendor ID Product ID Revision N/W
-------------- ----------- ------ --------- ---------------- -------- ---
0 0 0 Direct SCSI-2 DEC RZ29B (C) DEC 0016 W
0 1 0 Direct SCSI-2 DEC RZ29B (C) DEC 0016 N
0 2 0 Direct SCSI-2 DEC RZ1DF-CB (C) DEC 0371 W
0 5 0 Sequential SCSI-2 DEC TLZ09 (C)DEC 0167 N
0 6 0 CD-ROM SCSI-2 DEC RRD45 (C) DEC 0436 N
1 0 0 Direct SCSI-2 DEC RZ1DF-AB (C) DEC 0372 N
1 2 0 Direct SCSI-2 DEC RZ1DF-AB (C) DEC 0372 N
1 3 0 Direct SCSI-2 DEC RZ1DF-AB (C) DEC 0372 N
whiteSox
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

please take loook at the attachment
cnb
Honored Contributor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are


WhiteSox:

Well mixing *these* Wide and Narrow devices on the same bus is never a good idea.

Disks 0 and 2 are Wide 16-bit, whereas all the other devices are Narrow 8-bit.

Suggest you get the hardware straightened out first.

BTW: Having a tape drive on the same SCSI bus is as the system disk is generally never a good idea (if your system disk is also on Bus 0). But I know nothing.

Steven:

;-)

Always enjoy your posts!


hth

DCBrown
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

timeout on disconnected request

This means scsi command delivered to 0/0/0 ok, then host disconnects. When disk is ready then it will arbitrate for the bus to start the data transfer .... and this never happens. Host times out, falls down, goes boom. Rinse, repeat.

Also see the same thing on 0/2/0. A random guess is the host can arb for the bus, but the target device can't or fails for some reason. I'd start by disconnecting all devices except 0/0/0 (which is probably not enough to successfully boot, but...) and see how far it gets. Still a problem - check termination/cables. Since there are two drives complaining about the same thing it *could* be one of these drives that has broken and mucked up the bus... but less likely than one of the other devices. Any devices added/changed on the bus lately? Wrong scsi ids? too much termination? If just having 0/0/0 on the bus "works", then start adding back in devices until it breaks.
DCBrown
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

and yeah, mixing wide and narrow devices should force all communication on the bus to *ALL* devices to be narrow which really hurts wide disk performance.
DCBrown
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

re: tape on same bus.

Be aware that some tape io has upwards of a 90 minute timeout value and during looong tape transfers some tape drives won't give up the bus. It depends on tape drive, backup software, and transfer size as to whether it splashes over and hurts other devices trying to share the same bus.

I'm assuming there is no fair arbitration hub or anything like that.
whiteSox
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

what log files will tell me what is going on ..what log files should i be looking for the server came back up ...and i am trying to understand what is going on ....Thanks for your help ..
cnb
Honored Contributor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

Check the error log.

DecEvent is used for error log translation for AS2000 systems. If it isn't installed then uerf may give you an idea of what is going on.

# man dia

-or-

# man uerf

Since both scsi id 0 and 2 are the ones logging events, and they are the only WIDE devices on the bus. Logic would indicate investigate those devices first.

"psiop_hardintrptidec5..." > Controller Hard interrupt

Depending upon controller, controller firmware and disk drive firmware, these wide devices may not play well on a narrow scsi chain. In addition, as stated earlier, the timeout could also be from anything on scsi bus 0.

If you're not comfortable troubleshooting/investigating this, place a call to your service provider.

hth
whiteSox
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

does any one know what this error event means ..
----- EVENT INFORMATION -----

EVENT CLASS ERROR EVENT
OS EVENT TYPE 199. CAM SCSI
SEQUENCE NUMBER 23.
OPERATING SYSTEM DEC OSF/1
OCCURRED/LOGGED ON Thu Jul 2 05:03:51 2009
OCCURRED ON SYSTEM xyzzzz
SYSTEM ID x00050009 CPU TYPE: DEC 2100
SYSTYPE x00000000
PROCESSOR COUNT 2.
PROCESSOR WHO LOGGED x00000001

----- UNIT INFORMATION -----

CLASS x0022 DEC SIM
SUBSYSTEM x0000 DISK
BUS # x0000
x0000 LUN x0
TARGET x0
whiteSox
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

cnb,

The server was running for about 4 to 5 years with the same configuration .Do you know what that error event means ..is there a problem with the mother board or the disk drives ?
cnb
Honored Contributor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

Something in the disk subsystem. Disk 0 is still reporting *some* event. Not enough information here.

Read the guides and install DecEvent to get a more thorough description of what is going on.

http://h18000.www1.hp.com/support/svctools/decevent/index.html

FWIW: There's a lot more to this system than a "Motherboard" and disks. System Module, CPU modules, Memory modules, Bridge, I/O, etc...
http://h18000.www1.hp.com/alphaserver/archive/2000/2000_tech.html
whiteSox
Frequent Advisor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

Does any one know if this OS is supported by HP ?.
Rob Leadbeater
Honored Contributor

Re: Digital Unix 5.0 issues SCSI errors i am not sure what they are

Hi,

5.0 hasn't been supported for a long time:

http://h30097.www3.hp.com/pdf/FINAL_Tru64UNIX_Policy_Page_24March2009.pdf

You might want to post the output of "sizer -v" so we can see exactly what version you're running...

Cheers,

Rob

P.S. Please remember to assign points to replies...