Integrity Servers
cancel
Showing results for 
Search instead for 
Did you mean: 

RX4640 Memory Config Error

 
SOLVED
Go to solution
Mark_Woodward
Occasional Contributor

RX4640 Memory Config Error

I have recently powered on my RX4640 server to find that it stops with the system error led flashing red and the "Memory Config Error" diagnostic LED lit. No other diagnostic leds are lit (except "check event log").

 

No memory has been added or removed, I have tried reseating all 4 dimms and even changing the order of them but the same error persists.

 

The output from the MP system event log is show below - this remains exactly the same irrespective of the order of the dimms - the MEM_CHIPSPARE_DEALLOC_RANK message always refers to the dimm in slot 0D even when I change the order of the dimms.

 

The only thing that has happened is the battery on the main board has failed - this is now replaced but the error persists - could the NVRAM be corrupt - how can I reset this?

 

Anyone any ideas? - the system log output is shown below.

 

 

0     SFW  1   2  0x5480006301E007E0 0000000000000000 BOOT_START    00:00:14

1     SFW  1   0  0x1600001D01E00000 001F020200000001 BOOT_CPU_CONFIG

2     SFW  0   0  0x1600001D00E00000 001F020200000000 BOOT_CPU_CONFIG

3     SFW  1   0  0x1600000E01E00000 0000000000032000 BOOT_CELL_MONARCH_SEL_START

4     SFW  0   0  0x1600000E00E00000 0000000000032000 BOOT_CELL_MONARCH_SEL_START

5     SFW  0   1  0x3600000C00E00000 0000000000000000 BOOT_CELL_MONARCH

6     SFW  0   1  0x3600026100E00000 0000000000000000 BOOT_CPU_PRESENT

7     SFW  1   1  0x3600026101E00000 0000000000000001 BOOT_CPU_PRESENT

8     SFW  0   0  0x160015B200E00000 0000000001860729 BOOT_TIME_EVENT

9     SFW  0   0  0x0000000800E00000 0000000000000000 BOOT_CELL_CONFIG_START

10    SFW  0   0  0x0000005600E00000 0000000000000000 BOOT_SCR_TEST_START

11    SFW  1   0  0x0300005D01E00000 0000000000000002 BOOT_SLAVE_RENDEZ_HANDLER_START

12    SFW  0   0  0x0000024B00E00000 0000000000000000 BOOT_EARLY_PLATFORM_CHECK

13    BMC      2  0x20441B81D7020800 FFFF0103FDC00300 Type-02 c00301 12583681

                                                      18 Mar 2006 03:43:19

14    BMC      2  0x20441B81D8020810 FFFF010B4F090300 Type-02 090b01 592641

                                                      18 Mar 2006 03:43:20

15    SFW  0   0  0x0300000600E00000 0000000000000000 BOOT_BUS_CONFIG_VALUE

16    SFW  0   0  0x1600004400E00000 01000000001E0400 BOOT_NEW_BUS_CONFIG_VALUE

17    BMC      2  0x20441B81C3020820 FFFF027000120300 Type-02 127002 1208322

                                                      18 Mar 2006 03:42:59

18    SFW  0   0  0x168002C500E00830 0000000000000000 BOOT_REBOOT

                                                      18 Mar 2006 03:43:01

19    SFW  1   0  0x160002C501E00000 0000000000000000 BOOT_REBOOT

20    SFW  1   0  0x1600001D01E00000 001F020200000001 BOOT_CPU_CONFIG

21    SFW  0   0  0x1600001D00E00000 001F020200000000 BOOT_CPU_CONFIG

22    SFW  1   0  0x1600000E01E00000 0000000000032000 BOOT_CELL_MONARCH_SEL_START

23    SFW  0   0  0x1600000E00E00000 0000000000032000 BOOT_CELL_MONARCH_SEL_START

24    SFW  0   1  0x3600000C00E00000 0000000000000000 BOOT_CELL_MONARCH

25    SFW  0   1  0x3600026100E00000 0000000000000000 BOOT_CPU_PRESENT

26    SFW  1   1  0x3600026101E00000 0000000000000001 BOOT_CPU_PRESENT

27    SFW  1   0  0x0300005D01E00000 0000000000000002 BOOT_SLAVE_RENDEZ_HANDLER_START

28    SFW  0   0  0x160015B200E00000 0000000001195652 BOOT_TIME_EVENT

29    SFW  0   0  0x0000000800E00000 0000000000000000 BOOT_CELL_CONFIG_START

30    SFW  0   0  0x0000024B00E00000 0000000000000000 BOOT_EARLY_PLATFORM_CHECK

31    SFW  0   0  0x0300000600E00000 01000000001E0400 BOOT_BUS_CONFIG_VALUE

32    SFW  0   0  0x0000024C00E00000 0000000000000000 BOOT_PLATFORM_CHECK

33    SFW  0   0  0x160002AF00E00000 000000000000001F SETTING_PROC_TIMEOUT

34    SFW  0   0  0x160015B200E00000 0000000008743418 BOOT_TIME_EVENT

35    SFW  0   0  0x000000B100E00000 0000000000000000 MEM_DISCOVERY

36    SFW  0   0  0x000000C600E00000 0000000000000000 MEM_INIT_SCR_TABLES

37    SFW  0   1  0x200000FE00E00000 0000000000000000 MEM_WARN_REG_TEST_BYPASS

38    SFW  0   0  0x000000EC00E00000 0000000000000000 MEM_SPD_START

39    SFW  0   0  0x000000A600E00000 0000000000000000 MEM_CONFIG_FROM_NVM

40    SFW  0   0  0x040000E300E00000 FFFFFFFF000AFF74 MEM_SPD_1G_DIMM_FOUND

41    SFW  0   0  0x040000E300E00000 FFFFFFFF000BFF74 MEM_SPD_1G_DIMM_FOUND

42    SFW  0   0  0x040000E300E00000 FFFFFFFF000CFF74 MEM_SPD_1G_DIMM_FOUND

43    SFW  0   0  0x040000E300E00000 FFFFFFFF000DFF74 MEM_SPD_1G_DIMM_FOUND

44    SFW  0   0  0x0000020500E00000 0000000000000000 MEM_LOADING_ORDER

45    SFW  0   0  0x000000B200E00000 0000000000000000 MEM_DISCOVERY_EXIT

46    SFW  0   0  0x200012DC00E00000 0000000000000000 MEM_EARLY_CONFIG

47    SFW  0   0  0x000000C900E00000 0000000000000000 MEM_MAIN_MEM

48    SFW  0   0  0x000000C200E00000 0000000000000000 MEM_GENERATE_INTERLEAVING

49    SFW  0  *3  0x64800FA000E00850 FFFFFFFF000DFF74 MEM_CHIPSPARE_DEALLOC_RANK

                                                      18 Mar 2006 03:43:10

50    SFW     *5  0xC1441B81CE020870 FF3F4070000F0300 Type-02 0f7000 1011712

                                                      18 Mar 2006 03:43:10

51    SFW  0  *7  0xE08000D100E00880 0000000000000000 MEM_NO_MEM_FOUND

                                                      18 Mar 2006 03:43:10

52    SFW     *5  0xC1441B81CE0208A0 FF3F4070000F0300 Type-02 0f7000 1011712

                                                      18 Mar 2006 03:43:10

53    SFW  0  *7  0xF480003700E008B0 000000000000000F BOOT_HALT_CELL

                                                      18 Mar 2006 03:43:10

54    SFW  1   0  0x1600005E01E00000 00000009FEF208BF BOOT_SLAVE_RENDEZ_INT_RECEIVED

55    MP   0   2  0x5E800A7A00E008D0 0000000000000003 MP_SELFTEST_RESULT

                                                      18 Mar 2006 03:57:18

 

   -> This is the last entry in the selected log.

 

Thanks

Mark

3 REPLIES 3
hvhari
Esteemed Contributor
Solution

Re: RX4640 Memory Config Error

This indicates, there was some memory issue for which it had de-allocated the rank.  Re-seating was the first option which you have already tried out. One of the module out of 4 may have some issue. If you have spare memory DIMMS, you can try swapping them one by one.

You may try the information on HP Guided troubleshooting at

 http://h20180.www2.hp.com/apps/Nav?h_audiencerestrict=true&h_client=s-h-e010-1&h_product=top&lang=en&cc=in&h_cc=in&h_lang=en&h_audience=smb&h_pagetype=s-905

Regards,
Hari

If this post was useful , click the Kudos Star on the left side to say Thanks!
Robert_Jewell
Honored Contributor

Re: RX4640 Memory Config Error

I am thinking this is more likely a memory carrier issue (a hardware problem resolved by replacing that component). The fact that you can change DIMMs around without changing the error location supports this.  Since the quad is being deconfigured and you only have the 4 DIMMs, the system has no memory to use. 

 

If you dont have maintenance for this system, perhaps as a cheaper workaround you could install 4 more DIMMs?  Bank 0 would still fail, but the system may be able to boot with bank 1 operational. 

 

-Bob

 

 

----------------
Was this helpful? Like this post by giving me a thumbs up below!
Mark_Woodward
Occasional Contributor

Re: RX4640 Memory Config Error

Thanks all,

I've managed to get hold of another 4Gb and unstalled these in place of the old ones - It all works now.
Mark