- Community Home
- >
- Servers and Operating Systems
- >
- HPE BladeSystem
- >
- BladeSystem - General
- >
- Server(s) crash after mem upgrade, varying results
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-18-2010 10:58 AM
тАО01-18-2010 10:58 AM
Server(s) crash after mem upgrade, varying results
SYSTEM_FIRMWARE_ERROR
MEM_ECC_ERROR_UNCORRECTABLE
Thinking a bad dimm, I removed the new memory and restarted the server. No more problems with that server. The next day, a 2nd blade reacted the same, so I removed the new memory from that server, and since the other 2 in that chassis with new memory were also displaying memory errors, but hadn't crashed. I shut them down and removed the new memory as well. The 5th server, in another chassis, was showing only the common single-bit errors, but I removed that new memory as well. I opened a case with HP and was told that either ALL the new memory was bad and that the current firmware on the blades had a known power distribution issue with memory and either or both could be the culprit.
I was given the 5th server, the one in a different chassis to use as a test. It had firmware older than the other 4. Also, I noticed that the 2 I had upgraded successfully months ago, had NEWER firmware than these. I loaded up the server in the separate chassis with 48GB memory, fully populated and exercised it for days with STM, no problems, no errors other than the single-bit correctable errors I see commonly. I then upgraded this server to the same firmware as the problem servers and exercised for days, still no issues. So, now I'm perplexed and don't know how to proceed.
I will definitely upgrade all blades to the current firmware and I will replace all the new memory from the vendor, but since I could not duplicate the problem with the test server, I have an uneasy feeling about giving this my stamp of approval, especially since these are production servers.
I will add a reply with the firmware versions of each below.
Any ideas?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-18-2010 11:03 AM
тАО01-18-2010 11:03 AM
Re: Server(s) crash after mem upgrade, varying results
MP FW : T.02.17
BMC FW : 05.20
EFI FW : ROM A 06.20, ROM B 06.20
System FW : ROM A 03.02, ROM B 03.02, Boot ROM A
PDH FW : 50.07
UCIO FW : 03.0b
PRS FW : 00.08 UpSeqRev:02,DownSeqRev: 05
PIC FW : 00.05
4 blades in same chassis as servers above, 2 crashed, 2 were about to crash
MP FW : T.02.17
BMC FW : 05.20
EFI FW : ROM A 06.20, ROM B 06.20
System FW : ROM A 03.01, ROM B 03.01, Boot ROM A
PDH FW : 50.07
UCIO FW : 03.0b
PRS FW : 00.08 UpSeqRev:02,DownSeqRev: 05
PIC FW : 00.05
1 blade in separate chassis, used to test with no issues, older firmware
MP FW : T.02.17
BMC FW : 05.20
EFI FW : ROM A 05.67, ROM B 06.20
System FW : ROM A 01.01, ROM B 03.01, Boot ROM B
PDH FW : 50.07
UCIO FW : 03.0b
PRS FW : 00.08 UpSeqRev:02,DownSeqRev: 05
PIC FW : 00.05
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-18-2010 11:41 AM
тАО01-18-2010 11:41 AM
Re: Server(s) crash after mem upgrade, varying results
Added support for "B" memory DIMMs.
Maybe this is the reason.
Upgrade and test.
What memory was installed (product number)?
Hope this helps!
Regards
Torsten.
__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.
__________________________________________________
No support by private messages. Please ask the forum!
If you feel this was helpful please click the KUDOS! thumb below!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-18-2010 01:08 PM
тАО01-18-2010 01:08 PM
Re: Server(s) crash after mem upgrade, varying results
Part NO. AB566AX
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-18-2010 08:03 AM
тАО02-18-2010 08:03 AM
Re: Server(s) crash after mem upgrade, varying results
Did the upgrade of your firmware resolved the problem? I'm having the same problem with new blades we purchased.