Integrity Servers
cancel
Showing results for 
Search instead for 
Did you mean: 

itanium rx2620 error mca

 
Homsan
Occasional Advisor

itanium rx2620 error mca

We are getting error logs (from dmesg) in a few of our itanium rx2620. I
would like to know if the error is a 2-bit ECC error or 1-bit and which is
the faulty DIMM. I did look at various manuals of intel and i believe it is
a 1-bit error but I am not certian. And I do not know how to decifer the
card and module etc to find which is the faulty DIMM so as to replace it.

Thanks very much in advance! Stelios

Here is the error log:

+BEGIN HARDWARE ERROR STATE AT CPE
+Err Record ID: 522987 SAL Rev: 0.02
+Time: 10/02/2007 13:26:34 Severity 0
+Platform Memory Device Error Info Section
+ Mem Error Detail: Error Status: 0x441000, Physical Address: 0x40407ecd60, Card: 66, Module: 48, ,Responder Address: 0xfed00000, Bus Specific Data: 0x20,
OEM Memory Controller ID: 2b 12 00 00 00 00 00 00 00 00 00 00 00 00 00 00
OEM Specific Data: 00 00 00 00 00 00 00 00 aa ac 73 14 93 dc 8d 47 97 42 aa 85 4f 51 0c fb ff 1d 00 00 00 00 00 00 17 01 00 00 00 00 00 00 80 00 00 00 00 00 00 00 80 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 64 cd 7e 40 80 18 00 ff 00 00 08 00 00 00 00 00 00 00 00 00 00 00 00 00 c0 c4 79 e4 40 00 00 e0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 c0 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ce 4e d5 40 00 70 08 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
+END HARDWARE ERROR STATE AT CPE
+BEGIN HARDWARE ERROR STATE AT CMC
+Err Record ID: 71961197 SAL Rev: 0.02
+Time: 10/02/2007 13:26:34 Severity 2
+Processor Device Error Info Section
Processor Error Map: 0x1000000
Processor State Param: 0x0
Processor LID: 0x1000000
+ BUS Check Info [0]
+ Status Info: 0 ,Severity: 0 ,Transaction Type: 3 ,Transaction Size: 0 ,Error: External
+END HARDWARE ERROR STATE AT CMC
5 REPLIES 5
Torsten.
Acclaimed Contributor

Re: itanium rx2620 error mca

Please run

echo "selclass qualifier memory;info;wait;infolog" | /usr/sbin/cstm

and post the results.

Hope this helps!
Regards
Torsten.

__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.

__________________________________________________
No support by private messages. Please ask the forum!

If you feel this was helpful please click the KUDOS! thumb below!   
Homsan
Occasional Advisor

Re: itanium rx2620 error mca

hi...
i canot find file name cstm in /usr/sbin
my OS is RHAS 2.1 , where i can find this file ..??

thank's
Torsten.
Acclaimed Contributor

Re: itanium rx2620 error mca

The command was for hp-ux, but you can try this from efi:

> pdt
> info mem
> info warning

Hope this helps!
Regards
Torsten.

__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.

__________________________________________________
No support by private messages. Please ask the forum!

If you feel this was helpful please click the KUDOS! thumb below!   
Sameer_Nirmal
Honored Contributor

Re: itanium rx2620 error mca

Within running OS environment, you can run "salinfo" to decode the SAL information. If this utility is not installed, you can download it from RedHat website.

In the offline mode, you can use IPF ODE CD ,E-diag tool "memdiag.efi"
From EFI shell prompt shell> errdump cpe
Homsan
Occasional Advisor

Re: itanium rx2620 error mca

i have replace my memory.