Operating System - Tru64 Unix
1828371 Members
2991 Online
109976 Solutions
New Discussion

Alpha Server 8200 System Crash

 
SOLVED
Go to solution
Emad Omar
Regular Advisor

Alpha Server 8200 System Crash

Hi all,

We have a frequent crash at Alpha Server 8200. Tru64 V5.1A is running at this machine and when system crash the following message appear :
"Signing disks CPU09 unexpected machine check through vector 060000000 processor machine
CPU 8 halted
halt code = 7
machine check while in PAL mode
PC=18100"

Can anybody help ?????
9 REPLIES 9
Michael Schulte zur Sur
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

did the machine create a crash dump? If so could you post crash_data as attachment?
This looks like a case for a HP call.

greetings,

Michael
Emad Omar
Regular Advisor

Re: Alpha Server 8200 System Crash

Thank you.

Find attached the crash dump files.
Please do your best.
Michael Schulte zur Sur
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

this is the cause line
panic (cpu 8): pciaerror

However I do not have any idea, what's behind it.

This type of error seems to have occurred already a long time ago as you can see here.
http://ftp.support.compaq.com.au/pub/patches/Digital_UNIX/v3.2c/duv32cas00004-19980414.html

Do you have decevent or webes on your machine? With dia -R you may see more information on this error. I hope you have a maintenance contract with HP.

greetings,

Michael
Johan Brusche
Honored Contributor

Re: Alpha Server 8200 System Crash


There is a hardware problem in this AS8200.

The symptoms point to a problem on a PCI bus, probably the one where the Gigabit interface card is sitting. Another candidate is the KFTIA I/O module.

So you will have to call the nearest service provider and ask them to fix the machine.

Johan.

_JB_
Emad Omar
Regular Advisor

Re: Alpha Server 8200 System Crash

Dear all,
I checked the system and I got the attached log file by using DECevent (dia command).

Please advise. . .
Michael Schulte zur Sur
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

what patch kit do you run? Did you check compatibility of the hardware to 5.1?
Can anyone give a comment on this message in binary errorlog?
ERR 0 x00004009 ERROR SUMMARY
CSR OVERRUN ERROR
PCI NONEXISTENT ADDRESS ERROR

thanks,

Michael
Emad Omar
Regular Advisor

Re: Alpha Server 8200 System Crash

Hi,

Yesterday I got another system crash . I checked the decevent and found something maybe help to diagnose the crash problem. I think there is a power regulator problem as you can see at my attached file. Can you agree with me. Please advise . . . .

Regards,

Emad Omar
Greg Yates
Valued Contributor

Re: Alpha Server 8200 System Crash

"A CSR OVERRUN ERROR says that a CSR command packet on the downhose contained too many longwords. That suggests a hardware problem in the TIOP or DWLPB, or possibly a bad connection on the hose."

You probably want to log a hardware call to get this looked at. Could be a bad (hardware) module somewhere.

Disclaimer: I'm a software guy. :)

Greg
amrelsayed
Frequent Advisor
Solution

Re: Alpha Server 8200 System Crash

hello emad,

can you please send me your binary.errlog file for this Alpha Server8200 to analyze it for giving you a proper answer.

my email is: aelsayed@ncs.com.kw

Best Regards,
Amr
Try To Be Smart