Operating System - Tru64 Unix
1752610 Members
4008 Online
108788 Solutions
New Discussion юеВ

Re: Alpha Server 8200 System Crash

 
SOLVED
Go to solution
Emad Omar
Regular Advisor

Alpha Server 8200 System Crash

Hi all,

We have a frequent crash at Alpha Server 8200. Tru64 V5.1A is running at this machine and when system crash the following message appear :
"Signing disks CPU09 unexpected machine check through vector 060000000 processor machine
CPU 8 halted
halt code = 7
machine check while in PAL mode
PC=18100"

Can anybody help ?????
9 REPLIES 9
Michael Schulte zur Sur
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

did the machine create a crash dump? If so could you post crash_data as attachment?
This looks like a case for a HP call.

greetings,

Michael
Emad Omar
Regular Advisor

Re: Alpha Server 8200 System Crash

Thank you.

Find attached the crash dump files.
Please do your best.
Michael Schulte zur Sur
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

this is the cause line
panic (cpu 8): pciaerror

However I do not have any idea, what's behind it.

This type of error seems to have occurred already a long time ago as you can see here.
http://ftp.support.compaq.com.au/pub/patches/Digital_UNIX/v3.2c/duv32cas00004-19980414.html

Do you have decevent or webes on your machine? With dia -R you may see more information on this error. I hope you have a maintenance contract with HP.

greetings,

Michael
Johan Brusche
Honored Contributor

Re: Alpha Server 8200 System Crash


There is a hardware problem in this AS8200.

The symptoms point to a problem on a PCI bus, probably the one where the Gigabit interface card is sitting. Another candidate is the KFTIA I/O module.

So you will have to call the nearest service provider and ask them to fix the machine.

Johan.

_JB_
Emad Omar
Regular Advisor

Re: Alpha Server 8200 System Crash

Dear all,
I checked the system and I got the attached log file by using DECevent (dia command).

Please advise. . .
Michael Schulte zur Sur
Honored Contributor

Re: Alpha Server 8200 System Crash

Hi,

what patch kit do you run? Did you check compatibility of the hardware to 5.1?
Can anyone give a comment on this message in binary errorlog?
ERR 0 x00004009 ERROR SUMMARY
CSR OVERRUN ERROR
PCI NONEXISTENT ADDRESS ERROR

thanks,

Michael
Emad Omar
Regular Advisor

Re: Alpha Server 8200 System Crash

Hi,

Yesterday I got another system crash . I checked the decevent and found something maybe help to diagnose the crash problem. I think there is a power regulator problem as you can see at my attached file. Can you agree with me. Please advise . . . .

Regards,

Emad Omar
Greg Yates
Valued Contributor

Re: Alpha Server 8200 System Crash

"A CSR OVERRUN ERROR says that a CSR command packet on the downhose contained too many longwords. That suggests a hardware problem in the TIOP or DWLPB, or possibly a bad connection on the hose."

You probably want to log a hardware call to get this looked at. Could be a bad (hardware) module somewhere.

Disclaimer: I'm a software guy. :)

Greg
amrelsayed
Frequent Advisor
Solution

Re: Alpha Server 8200 System Crash

hello emad,

can you please send me your binary.errlog file for this Alpha Server8200 to analyze it for giving you a proper answer.

my email is: aelsayed@ncs.com.kw

Best Regards,
Amr
Try To Be Smart