Operating System - Tru64 Unix
cancel
Showing results for 
Search instead for 
Did you mean: 

Re: Alpha Server 8200 System Crash

 
SOLVED
Go to solution
Highlighted
Regular Advisor

Alpha Server 8200 System Crash

Hi all,

We have a frequent crash at Alpha Server 8200. Tru64 V5.1A is running at this machine and when system crash the following message appear :
"Signing disks CPU09 unexpected machine check through vector 060000000 processor machine
CPU 8 halted
halt code = 7
machine check while in PAL mode
PC=18100"

Can anybody help ?????
9 REPLIES 9
Highlighted
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

did the machine create a crash dump? If so could you post crash_data as attachment?
This looks like a case for a HP call.

greetings,

Michael
Highlighted
Regular Advisor

Re: Alpha Server 8200 System Crash

Thank you.

Find attached the crash dump files.
Please do your best.
Highlighted
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

this is the cause line
panic (cpu 8): pciaerror

However I do not have any idea, what's behind it.

This type of error seems to have occurred already a long time ago as you can see here.
http://ftp.support.compaq.com.au/pub/patches/Digital_UNIX/v3.2c/duv32cas00004-19980414.html

Do you have decevent or webes on your machine? With dia -R you may see more information on this error. I hope you have a maintenance contract with HP.

greetings,

Michael
Highlighted
Honored Contributor

Re: Alpha Server 8200 System Crash


There is a hardware problem in this AS8200.

The symptoms point to a problem on a PCI bus, probably the one where the Gigabit interface card is sitting. Another candidate is the KFTIA I/O module.

So you will have to call the nearest service provider and ask them to fix the machine.

Johan.

_JB_
Highlighted
Regular Advisor

Re: Alpha Server 8200 System Crash

Dear all,
I checked the system and I got the attached log file by using DECevent (dia command).

Please advise. . .
Highlighted
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

what patch kit do you run? Did you check compatibility of the hardware to 5.1?
Can anyone give a comment on this message in binary errorlog?
ERR 0 x00004009 ERROR SUMMARY
CSR OVERRUN ERROR
PCI NONEXISTENT ADDRESS ERROR

thanks,

Michael
Highlighted
Regular Advisor

Re: Alpha Server 8200 System Crash

Hi,

Yesterday I got another system crash . I checked the decevent and found something maybe help to diagnose the crash problem. I think there is a power regulator problem as you can see at my attached file. Can you agree with me. Please advise . . . .

Regards,

Emad Omar
Highlighted
Valued Contributor

Re: Alpha Server 8200 System Crash

"A CSR OVERRUN ERROR says that a CSR command packet on the downhose contained too many longwords. That suggests a hardware problem in the TIOP or DWLPB, or possibly a bad connection on the hose."

You probably want to log a hardware call to get this looked at. Could be a bad (hardware) module somewhere.

Disclaimer: I'm a software guy. :)

Greg
Highlighted
Frequent Advisor
Solution

Re: Alpha Server 8200 System Crash

hello emad,

can you please send me your binary.errlog file for this Alpha Server8200 to analyze it for giving you a proper answer.

my email is: aelsayed@ncs.com.kw

Best Regards,
Amr
Try To Be Smart