1828340 Members
3231 Online
109976 Solutions
New Discussion

Re: Error Log

 
Daniel Duarte
Frequent Advisor

Error Log

Please, All

When I run (TAR XVF SHCTRU64V4-V351-build606.tar) the hardware entry in Halt.
Show in console these messages.
Too Many processor correct error detected on CPU0.
Machine Check processor error Abort
Machine Fatal Error 0x100000098

Verify error log (binary.errlog) the show results.
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 2.
Timestamp of occurrence 05-APR-2004 15:38:33
Host name ds10

System type register x00000022 Systype 34. (Regatta Family)
Number of CPUs (mpnum) x00000001
CPU logging event (mperr) x00000000

Event validity 1. O/S claims event is valid
Event severity 5. Low Priority
Entry type 310. Time Stamp
-1. - (minor class)

stdin: END
scsi2 at itpsa0 slot 0
rz16 at scsi2 target 0 lun 0 (LID=1) (COMPAQ BD0366459B B016)
(Wide16)
rz17 at scsi2 target 1 lun 0 (LID=2) (COMPAQ BF01864663 3B0B)
(Wide16)
Created FRU table binary error log packet
lvm0: configured.
lvm1: configured.
kernel console: ace0
dli: configured
ATM Subsystem configured with 1 restart threads
ATM IFMP: configured
i2c: Server Management Hardware Present
ATMUNI: configured
ATMSIG: 3.x (module=uni3x) configured
ILMI: 3.x (module=ilmi) configured
ATM IP: configured
ATM LANE: configured.

I already re-did memory bank, although I still having the same problem.

Thank you in advance for any HELP.

Daniel Duarte

13 REPLIES 13
Michael Schulte zur Sur
Honored Contributor

Re: Error Log

Hi,

please reseat the cpu(s),

greetings,

Michael
Daniel Duarte
Frequent Advisor

Re: Error Log

The equipament that I'm using is DS10, and de processor is fixed on the motherboard.

Michael Schulte zur Sur
Honored Contributor

Re: Error Log

Oops,

I hope, you have a maintenance contract, cause it may be the cpu.

do you have decevent?
You are running 4.0X, right?

hth,

Michael
Daniel Duarte
Frequent Advisor

Re: Error Log

Michael

I'm not sure that it's realeted with CPU.
The error show when I run the command tar in a file greater than 30 MB.
I have garantee in hardware, that I must have sure before open a ticket.
I have DecEvent, any command in special?

Daniel Duarte





Michael Schulte zur Sur
Honored Contributor

Re: Error Log

Hi,

use dia -R
to see the details of the error
Could you post that?

In my oppinion it is either cpu or memory.
Please open a call.

greetings,

Michael
Daniel Duarte
Frequent Advisor

Re: Error Log

Michael

I was running TEST in the HALT prompt and this error message appear.

Machine Check Logout Frame @ 0x6000 code = 0x86

Alpha 21264 iprs (cpu0):
...
Processor corrrectable error througth vector 630

Attached is file uerf.tst

Thanks
Michael Schulte zur Sur
Honored Contributor

Re: Error Log

Hi,

you don't seem to have the dia command, right? I would open a call with HP. This is occurring to often to garuantee you a good and safe performance.

greetings,

Michael
Daniel Duarte
Frequent Advisor

Re: Error Log

Michael

Excuse me, following is attached in file dia.tst.

Only last events.

Thanks
Michael Schulte zur Sur
Honored Contributor

Re: Error Log

Daniel,

unfortunately the panic was not in that listing. I could just see the last boot and configuration. can you please try again?
Just entries with errors.

thanks,

Michael
Daniel Duarte
Frequent Advisor

Re: Error Log

Michael

Now new file dia.tst

Thanks
Johan Brusche
Honored Contributor

Re: Error Log

Daniel,

Since this is a DS10, nor uerf or dia will be able to decode the binary.errlog info in this errorlog packet.

Only Compaq Analyze ie. "ca", nowadays also called "wsea" by the newHP, will be able to tell us anything sensible. These commands are part of the WEBES software kit on your Associated Products CD's.

This tar-file is undoubtfully checking the health of your system ;-)

Cheers,
Johan.

_JB_
Mobeen_1
Esteemed Contributor

Re: Error Log

Daniel,
I have seen these errors many a times and they seem to be related to some form of correctable errors on your CPU.

Too Many processor correct error detected on CPU0.
Machine Check processor error Abort
Machine Fatal Error 0x100000098


If i were you i would log a call with HP and have them analyse the logs and then let them determine if any CPU replacement is needed.

All the best

regards
Mobeen
Mohamed  K Ahmed
Trusted Contributor

Re: Error Log

Daniel,
I agree, there is an entry that says
panic (cpu 0): Processor Machine Check

you might have correctable errors "detected" by the cpu0, and it means it is eaither the CPU or memory having some kind of problem.

Log a call and getit replaced

Mohamed