cancel
Showing results for 
Search instead for 
Did you mean: 

tombstone

 
JOHN TURNER_2
Frequent Advisor

tombstone

Hi all

i have just received a tombstone that needs analysis, i think that the problem is memory related, but can someone has a quick look and advise

HPMC Chassis Codes

Chassis Code Extension
------------ ---------
0x0000082000ff6242 0x0000000000000000
0x1800082011006342 0xcb83000000000000
0x0000087000ff6292 0x0000000000000000
0x6000082070006062 0x0000000000000010
0x7000082070006082 0x0000000000392400
0x7000082379006133 0xc1bff0fffed08040
0x0000080080006310 0x0000000000000001
0x000008008000631f 0x0000000000000000
0x0000082000ff6462 0x0000000000000000
0x0000080080006300 0x0000000000000001
0x7000082382006343 0x0000000000070200
0x7000082382016343 0x0000000000070200
0x7000082382026343 0x0000000000070200
0x7000082382036343 0x0000000000070200
0x7000082382046343 0x0000000000070200
0x7000082382056343 0x0000000000070200
0x7000082382066343 0x0000000000070200
0x7000082382076343 0x0000000000070200
0x0000080089006200 0x0000000000000000
0x0000080086006200 0x0000000000000000
0x000008008000630f 0x0000000000000000
0x0000080080006360 0x0000000000000000
0x000008008000636f 0x0000000000000000


General Registers 0 - 31
00-03 0000000000000000 0000000000000010 000000000016edb4 0000000043f2c100
04-07 0000000043fe5800 0000000000000002 0000000000000000 0000000000000000
08-11 00000000441cc000 000000007f780000 0000000000001000 000000007f781000
12-15 0000000000000002 0000000000000002 00000000006e4140 0000000000000002
16-19 0000000000000002 0000000000000002 0000000000000002 0000000000000018
20-23 0000000000000020 0000000000000040 0000000000000060 000000007f780840
24-27 0000000000000010 00000000441cc840 000000000800001f 0000000000895488
28-31 00000000000fee78 0000000000000000 400003ffffff1510 000000000001fab0


Control Registers 0 - 31
00-03 000000007feaf5b3 0000000000000000 0000000000000000 0000000000000000
04-07 0000000000000000 0000000000000000 0000000000000000 0000000000000000
08-11 000000000000c8b3 0000000000005d5e 00000000000000c0 000000000000003f
12-15 0000000000000000 0000000000000000 000000000002c000 fffffff0ffffffff
16-19 0000fa8de5b8b3e3 0000000000000000 0000000000037118 000000002f209004
20-23 0000000000340019 00000000561cd840 000000000804001b 0000000000000000
24-27 000000000080da80 000000000465d1d6 000000007f7807a8 000000007f7ac038
28-31 0000000000000003 0000fa8de5b41d63 00000000c0027403 000000007f7f32c0

Space Registers 0 - 7
00-03 090efc00 0cfb7000 00000000 00000000
04-07 00000000 07990c00 0b938800 00000000


IIA Space (back entry) = 0x0000000000000000
IIA Offset (back entry) = 0x000000000003711c
Check Type = 0x20000000
CPU State = 0x9e000004
Cache Check = 0x00000000
TLB Check = 0x00000000
Bus Check = 0x00105035
Assists Check = 0x00000000
Assist State = 0x00000000
Path Info = 0x00000000
System Responder Address = 0x0000000000000000
System Requestor Address = 0xfffffffffffa0000


Floating Point Registers 0 - 31
00-03 0800000000000000 0000000000000000 0000000000000000 0000000000000000
04-07 4000355a00000000 0000000000000000 0000000000000000 0000000000000000
08-11 0000000000000000 0000000000000000 0000000000000000 0000000000000000
12-15 5555555555555555 5555555555555555 5555555555555555 5555555555555555
16-19 5555555555555555 5555555555555555 5555555555555555 5555555555555555
20-23 5555555555555555 5555555555555555 000000004003c000 0000000000000000
24-27 0000200000000000 0000000000000000 0000000000000401 0000000000000000
28-31 0000000000000000 0000000000000000 8000000000000000 3ff0000000000000


Check Summary = 0xcb83000000000000
Available Memory = 0x0000000040000000
CPU Diagnose Register 2 = 0x0204000004802204
CPU Status Register 0 = 0x6420c24000000000
CPU Status Register 1 = 0x8000000000000000
SADD LOG = 0x0000000000000080
Read Short LOG = 0xc18080fff8020014


-------------- Memory Error Log Information --------------

Bus 0 Log Information

Timestamp = Fri Jul 7 16:45:12 GMT 2006 (20:06:07:07:16:45:12)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
X ERR_ERROR X X

Bus Requestor Address = 0xfffffffffffa0000
Bus Target Address = 0x0000000000000000
Bus Responder Address = 0xfffffffffed00000

Error Status Reg = 0x0000000000000010
Runway Control Reg = 0x0000021c00000018
Runway Address Reg = 0xc1bff0fffed08040
Runway Data High Reg = 0xf8018a1cf820ca01
Runway Data Low Reg = 0xf8018a1cf820ca01
Memory Address Reg = 0x000001ff3fffffff
Memory Address Corr Reg = 0x000001ff3fffffff
Memory Syndrome Reg = 0x0000000000000000
Memory Syndrome Corr Reg = 0x0000000000000000



Address/Control Parity Error Registers

Address/Control Parity Error Bit (mem_addr_par_stat) Not Set



------------ I/O Module Error Log Information ------------

Summary of IO subsystem log entries
-----------------------------------
Phys Loc Vendor Device Severity
Description (hex) Id Id CORR UNC FE CW
----------- ----- ------ ------ ----------------
System Bus Adapter RP 0x000000ffff01ff83 0x103c 0x1051 X
System Bus Adapter RP 0x000000ffff06ff83 0x103c 0x1051 X
System Bus Adapter RP 0x000000ffff08ff83 0x103c 0x1051 X
System Bus Adapter RP 0x000000ffff0aff83 0x103c 0x1051 X
System Bus Adapter RP 0x000000ffff0cff83 0x103c 0x1051 X
System Bus Adapter RP 0x000000ffff07ff83 0x103c 0x1051 X
System Bus Adapter RP 0x000000ffff09ff83 0x103c 0x1051 X
System Bus Adapter RP 0x000000ffff0bff83 0x103c 0x1051 X


Detail display of IO subsystem log entries
------------------------------------------

System Bus Adapter -- Rope Interface
------------------------------------------

Timestamp = Fri Jul 7 16:45:21 GMT 2006 (20:06:07:07:16:45:21)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
ERR_FUNCTION X

IO Requestor Address = 0x0000000000000000
IO Target Address = 0x0000000000000000
IO Responder Address = 0x0000000000000000
IO Physical Location = 0x000000ffffffff82
IO Hardware Path = 0x00ffffffffffff00

Module Error Register = 0x0000000000000000
Rope Physical Location = 0x000000ffff01ff83

System Bus Adapter -- Rope Interface
------------------------------------------

Timestamp = Fri Jul 7 16:45:21 GMT 2006 (20:06:07:07:16:45:21)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
ERR_FUNCTION X

IO Requestor Address = 0x0000000000000000
IO Target Address = 0x0000000000000000
IO Responder Address = 0x0000000000000000
IO Physical Location = 0x000000ffffffff82
IO Hardware Path = 0x00ffffffffffff00

Module Error Register = 0x0000000000000000
Rope Physical Location = 0x000000ffff06ff83

System Bus Adapter -- Rope Interface
------------------------------------------

Timestamp = Fri Jul 7 16:45:21 GMT 2006 (20:06:07:07:16:45:21)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
ERR_FUNCTION X

IO Requestor Address = 0x0000000000000000
IO Target Address = 0x0000000000000000
IO Responder Address = 0x0000000000000000
IO Physical Location = 0x000000ffffffff82
IO Hardware Path = 0x00ffffffffffff00

Module Error Register = 0x0000000000000000
Rope Physical Location = 0x000000ffff08ff83

System Bus Adapter -- Rope Interface
------------------------------------------

Timestamp = Fri Jul 7 16:45:21 GMT 2006 (20:06:07:07:16:45:21)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
ERR_FUNCTION X

IO Requestor Address = 0x0000000000000000
IO Target Address = 0x0000000000000000
IO Responder Address = 0x0000000000000000
IO Physical Location = 0x000000ffffffff82
IO Hardware Path = 0x00ffffffffffff00

Module Error Register = 0x0000000000000000
Rope Physical Location = 0x000000ffff0aff83

System Bus Adapter -- Rope Interface
------------------------------------------

Timestamp = Fri Jul 7 16:45:21 GMT 2006 (20:06:07:07:16:45:21)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
ERR_FUNCTION X

IO Requestor Address = 0x0000000000000000
IO Target Address = 0x0000000000000000
IO Responder Address = 0x0000000000000000
IO Physical Location = 0x000000ffffffff82
IO Hardware Path = 0x00ffffffffffff00

Module Error Register = 0x0000000000000000
Rope Physical Location = 0x000000ffff0cff83

System Bus Adapter -- Rope Interface
------------------------------------------

Timestamp = Fri Jul 7 16:45:21 GMT 2006 (20:06:07:07:16:45:21)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
ERR_FUNCTION X

IO Requestor Address = 0x0000000000000000
IO Target Address = 0x0000000000000000
IO Responder Address = 0x0000000000000000
IO Physical Location = 0x000000ffffffff82
IO Hardware Path = 0x00ffffffffffff00

Module Error Register = 0x0000000000000000
Rope Physical Location = 0x000000ffff07ff83

System Bus Adapter -- Rope Interface
------------------------------------------

Timestamp = Fri Jul 7 16:45:21 GMT 2006 (20:06:07:07:16:45:21)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
ERR_FUNCTION X

IO Requestor Address = 0x0000000000000000
IO Target Address = 0x0000000000000000
IO Responder Address = 0x0000000000000000
IO Physical Location = 0x000000ffffffff82
IO Hardware Path = 0x00ffffffffffff00

Module Error Register = 0x0000000000000000
Rope Physical Location = 0x000000ffff09ff83

System Bus Adapter -- Rope Interface
------------------------------------------

Timestamp = Fri Jul 7 16:45:21 GMT 2006 (20:06:07:07:16:45:21)

OV RQ RS ESTAT A C D corr unc fe cw pf
-- -- -- ----- - - - ---- --- -- -- --
ERR_FUNCTION X

IO Requestor Address = 0x0000000000000000
IO Target Address = 0x0000000000000000
IO Responder Address = 0x0000000000000000
IO Physical Location = 0x000000ffffffff82
IO Hardware Path = 0x00ffffffffffff00

Module Error Register = 0x0000000000000000
Rope Physical Location = 0x000000ffff0bff83


Module Revision

------ --------

System Board A43938

PA 8500 CPU Module 2.4

cheers

john
GUI's are for wimps!
3 REPLIES 3
Sandman!
Honored Contributor

Re: tombstone

Hi John,

You can verify the memory errors by going thru the memory log file in cstm as follows:

1. go into cstm
# /usr/sbin/cstm
2. run logtool utility
cstm> ru logtool
3. view the memory log file
Logtool Utility> vd

~hope it helps
Bill Hassell
Honored Contributor

Re: tombstone

Here's a 1-liner to show all the current memory details:

echo "selclass qualifier memory;info;wait;infolog" | cstm


Bill Hassell, sysadmin
Mridul Shrivastava
Honored Contributor

Re: tombstone

Follwoing commands can also be used to collect memory errors:

# echo "gop cstmpager cat;scl type mem;info;wait;il"|cstm > /tmp/mem.out
# echo "gop cstmpager cat;ru l\nvd\n"|cstm >> /tmp/mem.out
Time has a wonderful way of weeding out the trivial