1833016 Members
3285 Online
110048 Solutions
New Discussion

Memory Problem

 
SOLVED
Go to solution
yyghp
Super Advisor

Memory Problem

I got the following when i used:
# echo "selclass qualifier memory;info;wait;infolog" | cstm


Hardware path: 8


Basic Memory Description

Module Type: MEMORY
Total Configured Memory : 4096 MB
Page Size: 4096 Bytes

Memory interleaving is supported on this machine and is ON.

Memory Board Inventory

DIMM Slot Size (MB)
--------- ---------
01 1024
02 1024
03 512
04 512
05 512
06 512
--------- ---------
System Total (MB): 4096

Memory Error Log Summary

Error
Board Error Address Error Type Page Count
------------- ----------------- ---------- --------- -----
04 0x0000000019384f00 Single-Bit 0x0019384 1
04 0x0000000018395f00 Single-Bit 0x0018395 1
04 0x0000000019389f00 Single-Bit 0x0019389 1
04 0x000000001cf45f00 Single-Bit 0x001cf45 1
04 0x000000001931f080 Single-Bit 0x001931f 2
04 0x00000000793c99c0 Single-Bit 0x00793c9 1
04 0x00000000793df9c0 Single-Bit 0x00793df 1
04 0x00000000793cb9c0 Single-Bit 0x00793cb 1
04 0x000000001cf019c0 Single-Bit 0x001cf01 1
04 0x00000000793c7280 Single-Bit 0x00793c7 1
04 0x000000007bf88fc0 Single-Bit 0x007bf88 48
04 0x00000000793da980 Single-Bit 0x00793da 1
04 0x00000000793cf200 Single-Bit 0x00793cf 1
04 0x00000000793ca1c0 Single-Bit 0x00793ca 1
04 0x00000000793dd240 Single-Bit 0x00793dd 1
04 0x000000001831cec0 Single-Bit 0x001831c 1
04 0x00000000793ce1c0 Single-Bit 0x00793ce 1
04 0x0000000019317fc0 Single-Bit 0x0019317 2
04 0x0000000018345d80 Single-Bit 0x0018345 18
04 0x00000000793c21c0 Single-Bit 0x00793c2 1
04 0x000000001cf44080 Single-Bit 0x001cf44 3
04 0x000000007b3c24c0 Single-Bit 0x007b3c2 1
04 0x0000000019344480 Single-Bit 0x0019344 1
04 0x0000000018398440 Single-Bit 0x0018398 1
04 0x000000007bf580c0 Single-Bit 0x007bf58 13
04 0x00000000793c59c0 Single-Bit 0x00793c5 1
04 0x00000000793d4240 Single-Bit 0x00793d4 1
04 0x000000001934cd40 Single-Bit 0x001934c 10
04 0x000000001931d540 Single-Bit 0x001931d 4
04 0x000000001cb46f00 Single-Bit 0x001cb46 1
04 0x00000000793dc980 Single-Bit 0x00793dc 1
04 0x000000001834b6c0 Single-Bit 0x001834b 40
04 0x000000001cf43fc0 Single-Bit 0x001cf43 13
04 0x0000000019343f00 Single-Bit 0x0019343 1
04 0x000000001cb49f00 Single-Bit 0x001cb49 1
04 0x000000001938e480 Single-Bit 0x001938e 1
04 0x000000001838ef00 Single-Bit 0x001838e 1
04 0x000000001838ff00 Single-Bit 0x001838f 1
04 0x000000001cf0dec0 Single-Bit 0x001cf0d 1
04 0x00000000793c6900 Single-Bit 0x00793c6 1
04 0x00000000793c0140 Single-Bit 0x00793c0 3
04 0x00000000793110c0 Single-Bit 0x0079311 482
04 0x0000000010f98280 Multi-Bit 0x0010f98 0
04 0x0000000000389000 Single-Bit 0x0000389 3
04 0x000000001478f9c0 Single-Bit 0x001478f 5
04 0x0000000007f9bf00 Single-Bit 0x0007f9b 1
04 0x0000000000306f00 Single-Bit 0x0000306 311
04 0x0000000000b97880 Single-Bit 0x0000b97 13
04 0x0000000000b9bb00 Single-Bit 0x0000b9b 6
04 0x0000000000394100 Single-Bit 0x0000394 7
04 0x0000000003f54600 Single-Bit 0x0003f54 1
04 0x000000007df80000 Single-Bit 0x007df80 1
01 000000000000000000 Single-Bit 000000000 1

System start: Wed Jun 30 13:05:44 2004.
Last error check: Mon Jul 5 15:58:54 2004.
Logging interval: 3600 seconds.
53 address(es) with errors logged by memory logging daemon.

The Logtool Utility provides full details about the memory error log.

Page Deallocation Table (PDT)

Slot/Set Error Address Error Type Page
---------------- --------------- ---------- ---------
04 0x0000000000389001 Single-Bit 0x0000389
04 0x0000000000306f01 Single-Bit 0x0000306
04 0x0000000000b97881 Single-Bit 0x0000b97
04 0x0000000000394101 Single-Bit 0x0000394
04 0x000000001478f9c1 Single-Bit 0x001478f
04 0x0000000000b9bb01 Single-Bit 0x0000b9b
04 0x0000000010f98280 Multi-Bit 0x0010f98
04 0x00000000793110c1 Single-Bit 0x0079311
04 0x000000001cf43fc1 Single-Bit 0x001cf43
04 0x000000001931d501 Single-Bit 0x001931d
04 0x000000001834b6c1 Single-Bit 0x001834b
04 0x000000007bf580c1 Single-Bit 0x007bf58
04 0x000000001cf449c1 Single-Bit 0x001cf44
04 0x000000001934cd41 Single-Bit 0x001934c
04 0x0000000018345d81 Single-Bit 0x0018345
04 0x00000000793c0981 Single-Bit 0x00793c0
04 0x000000007bf88fc1 Single-Bit 0x007bf88
04 0x000000001931f2c1 Single-Bit 0x001931f
04 0x0000000019317fc1 Single-Bit 0x0019317

PDT Entries Used: 19
PDT Entries Free: 31
PDT Total Size: 50



It seems something wrong with Slot 4, I switched the memory between Slot 4 and Slot 3, still have the same result, I guess something wrong with Slot 4, or mainboard problem...

Any suggestion for me ?
Thanks!
6 REPLIES 6
Geoff Wild
Honored Contributor

Re: Memory Problem

Yes - place a call with HP to investigate the hardware.

Rgds...Geoff
Proverbs 3:5,6 Trust in the Lord with all your heart and lean not on your own understanding; in all your ways acknowledge him, and he will make all your paths straight.
Anand_20
Advisor

Re: Memory Problem

Hi,

Can u give me the model # of the machine and also the copy of /var/tombstones/ts99 file - I shall decode for you and analyze the root cause.

anand
yyghp
Super Advisor

Re: Memory Problem

hi Anand:

Model# rp2470

and I have attached the file ts99 within this post, please check.

Thanks a lot !
Anand_20
Advisor
Solution

Re: Memory Problem

Here is the report

Looks like the problem with AMC Bus0 problem detected.

Report
------

PDC version has been found to be : 42.19
This HPMC appears to come from a Lclass system. No System Bus Adapter (IKE) has been found in the PIMDUMP
Because we have no IKE found, I will assume that this is a L1xx or L2xx.
Looks like this is a 2 CPUs system
All timestamps are valid and within each others ..
There is at least one chassis code different from 6322. Let s look at it
The CPU 1 has a chassis code different from 6322.
The second chassis code is equal to 6302 : Runway Path error.
Most likely we have a DIMM problem on this system .
Estat in CPU status0 indicates a Master Path Error
Most likely the problem comes from a failing DIMM. Let's try to decode some additional chassis code to identify the DIMM.
We could not find a chassis code to decode directly the DIMM. Let's look at the other chassis codes .
Found one chassis code describing a bad DIMM .
The chassis extension is : 0x0000ff000004ff74
The failing DIMM is : Carrier 00 Slot 04
Found one chassis code describing a bad DIMM .
The chassis extension is : 0x0000ff000004ff74
The failing DIMM is : Carrier 00 Slot 04
The most suspect part in this HPMC is : DIMM0004DIMM0004
yyghp
Super Advisor

Re: Memory Problem

thanks Anand !
So, what can i do now ?
And what kind of tool did you use to generate such report ?

thanks again !
Anand_20
Advisor

Re: Memory Problem

These results are decoded from wtec site- which is accessible only through HP network.

The next step is to replace the part.

Anand