cancel
Showing results for 
Search instead for 
Did you mean: 

DL585 G2 MCE

SOLVED
Go to solution
rccmum
Super Advisor

DL585 G2 MCE

Hi,

Can anyone help me interpreting this MCE please?


HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 4 0 data cache TSC a4d4c13bd54b26
Data cache ECC error (syndrome 2a)
bit46 = corrected ecc error
bit57 = processor context corrupt
bit61 = error uncorrected
memory/cache error 'data write mem transaction, data transaction, level 1'
STATUS b615400000000145 MCGSTATUS 4

Thanks in advance.
6 REPLIES
Sameer_Nirmal
Honored Contributor
Solution

Re: DL585 G2 MCE

It looks like a problem with data cache on CPU 4 to me.

I would run the offline diagnostics on the box and see if it shows a problem with CPU 4.

rccmum
Super Advisor

Re: DL585 G2 MCE

Ran the insight offline diagnostics but it came up clean.

Do you know what is STATUS , MCGSTATUS , syndrome 2a ?
SMR
Valued Contributor

Re: DL585 G2 MCE

That looks indeed like a cpu cache problem, so typically an MCE shows the error reporting bank on its output.. is this a log file ran through mcelog? I would swap the cpu and check if the error follows the cpu. You can also try AMD's mcat tool to have a second opinion.
Sameer_Nirmal
Honored Contributor

Re: DL585 G2 MCE

I think the mce log should have a panic string above "HARDWARE ERROR".

In that case, you may use AMD's MCAT tool for the interpretation.
rccmum
Super Advisor

Re: DL585 G2 MCE

Thanks for the help.

Replaced the CPU 4 and that fixed the problem.
rccmum
Super Advisor

Re: DL585 G2 MCE

Replaced CPU4 and that fixed the problem.