Operating System - OpenVMS
1753419 Members
4848 Online
108793 Solutions
New Discussion юеВ

Re: CPU error for MICROVAX 3100

 
prasad_35
Occasional Advisor

CPU error for MICROVAX 3100


Hi when I do a "show err " I did notice couple of errors as below :

Device Error Coun
CPU 1
PAA0: 2
PAB0: 1
PTA0: 1
$5$MIA5: (TF867B) 22

Upon searching for the CPU error this what I could see.. Can some pleasae let me know how critical this error is and how to resolve this.



******************************* ENTRY 55. *******************************
ERROR SEQUENCE 16784. LOGGED ON: SID 13000202
DATE/TIME 12-OCT-2006 05:06:33.79 SYS_TYPE 05230E01
SYSTEM UPTIME: 43 DAYS 11:48:31
SCS NODE: ASCENT VAX/VMS V6.2

INT54 ERROR KA681-AA CPU Microcode Rev # 2. CONSOLE FW REV# 2.3
Standard Microcode Patch Patch Rev # 1.

REVISION 00000000
SYSTAT 00000001
ATTEMPTING RECOVERY
FLAGS 00000002
KA681-AA subpacket

KA681 REGISTER SUBPACKET

BPCR ECC80024
PAMODE 00000000
30 bit physical address mode
MMEPTE 8995D314
MMESTS 1C008000
PCSCR 01000200
Patchable control store enabled
standard microcode patch
CPU microcode Patch Rev # = 1.
ICSR 00000001
virtual instruction cache enabled
ECR 000000CA
full external interval timer enabled
TBSTS 800001D0
PCCTL FFFFFE13
pcache enabled for D-stream reference
pcache enabled for I-stream reference
pcache parity error detection enabled
pcache redundancy elements enabled
PCSTS FFFFF800
CCTL 00000007
backup cache enabled
bcache tag ram speed:
read = 4 cycles, write = 4 cycles
bcache data ram speed:
read = 3 cycles, write = 4 cycles
128 kilobyte backup cache
BCEDSTS 00000000
BCETSTS 000000E2
BCACHE TAG CORRECTABLE ECC ERROR
data stream tag lookup
MESR 000CB000
MMCDSR 0994DE00
2600 cycles before disown write tmeout
enable logging soft errors
CQBIC on CP_I02
CESR 00000000
CMCDSR 0000C108
DMA prefetching enabled
3200 Cycles Before NDAL Timeout
144 cycles before cp1 mt timeout
144000 cycles before cp2 mt timeout
cp1 interrupts pending:
none
cp2 interrupts pending:
none
CEFSTS 00019200
NESTS 00000000
NEOCMD 0000F004
NEICMD 00000000
DSER 00000000
CBTCR 00004000
CDAL Bus Timeout Interval = 16.384 MSEC
MMEADR 00008AFC
VMAR 000007E0
TBADR 00000000
PCADR FFFFFFF8
BCEDIDX 00000000
BCEDECC 00000000
BCETIDX 07A3CFA0
bcache tag error address
Index = 0E7D
Tag = 3D1(X)
BCETAG 07AB1200
Tag block in error valid
bcache tag: stored check bits = 22(X)
bcache error tag = 3D5(X)
MEAR 08406010
MOAMR 00000000

CSEAR1 00000000
CSEAR2 00000000
CIOEAR1 0000465C
CIOEAR2 00000300
CNEAR 00000000
CEFADR E0000300
NEOADR 07B69CE0
NEDATHI 00000000
NEDATLO 00000000
QBEAR 0000000C
DEAR 00000000
IPCR0 0020
Local Memory External Access Enabled
4 REPLIES 4
John Gillings
Honored Contributor

Re: CPU error for MICROVAX 3100

Prasad,

Here's the simple description:

> BCACHE TAG CORRECTABLE ECC ERROR

So, some data path in a cache has dropped a bit, which has been detected and corrected by ECC.

Since it's a VAX, this is very old hardware. You have to expect the occasional error. The hardware itself has fixed the error, so, from an application perspective there is no problem.

The criticality depends on what the system does. If it's running a nuclear power plant, I'd be concerned, and would look at upgrading the hardware. On the other hand if the system has a less critical role, I'd make a note that it's happened and watch for repeats. It may be a proverbial cosmic ray, nothing to do with the age of the hardware, just a fluke event which will never be repeated. On the other hand, it might be the first indicator of the hardware failing through old age.

About the only way to "fix" this type of problem in a 3100 is to find another one and replace the hardware, but any 3100 you find will probably be in a similar condition and may well suffer the same fault (on the other hand, they're likely to be cheap, if not free!). Another option is to upgrade the hardware, either to Alpha or Integrity, or, if you're stuck on an OpenVMS version as ancient as your hardware, maybe consider one of the VAX emulators?
A crucible of informative mistakes
Doug Phillips
Trusted Contributor

Re: CPU error for MICROVAX 3100

If this system has been up all the time since then, it's probably a transient (like John suggested.)

I have a 3100 that just sits around serving up some stuff that I don't have anyplace else (because of licensing) and it does things like that occasionally. (I do have a spare 3100 just in case.)

Maybe a power or temperature glitch. If you haven't had the cover off for a while, you could have a dust-bunny infestation. One time I had a memory error and found some books stacked up against the one of the vents. That was years ago and the old beast is still humming away.
prasad_35
Occasional Advisor

Re: CPU error for MICROVAX 3100


Thanks for the responses.

Prasad
Jan van den Ende
Honored Contributor

Re: CPU error for MICROVAX 3100

Prasad,

To say "thanks" the Forum way:


http://forums1.itrc.hp.com/service/forums/helptips.do?#33

Mind, I do NOT say you necessarily need to give lots of points. It is fully up to _YOU_ to decide how many. If you consider an answer is not deserving any points, you can also assign 0 ( = zero ) points, and then that answer will no longer be counted as unassigned.
Consider, that every poster took at least the trouble of posting for you!

To easily find your streams with unassigned points, click your own name somewhere.
This will bring up your profile.
Near the bottom of that page, under the caption "My Question(s)" you will find "questions or topics with unassigned points " Clicking that will give all, and only, your questions that still have unassigned postings.
If you have closed some of those streams, you must "Reopen" them to "Submit points". (After which you can "Close" again)

Do not forget to explicitly activate "Submit points", or your effort gets lost again!!

Thanks on behalf of your Forum colleagues.

PS. - Zero points for this.

Proost.

Have one on me.

jpe
Don't rust yours pelled jacker to fine doll missed aches.