BladeSystem - General
cancel
Showing results for 
Search instead for 
Did you mean: 

ProLiant BL460c unexpected/random reboots with Redhat Linux 4.5 64bit

SOLVED
Go to solution
MJ99
Occasional Visitor

ProLiant BL460c unexpected/random reboots with Redhat Linux 4.5 64bit

Hi Everyone,

We are experiencing an issue with our one blade rebooting - its rebooted 7 times in the last two months unexpectedly. I am trying to find the root cause because we have the same blade setup for our production servers.

OS Details:
command: uname -a
Linux dbserver.domainname.com 2.6.9-55.0.0.0.2.ELsmp #1 SMP Wed May 2 15:06:32 PDT 2007 x86_64 x86_64 x86_64 GNU/Linux
command: cat /etc/redhat-release
Enterprise Linux Enterprise Linux AS release 4 (October Update 5)

Rack: BladeSystem c7000
- Firmware Version 2.25

Blade: Slot 9 - ProLiantBL460c G1
- ROM Version: I15 04/01/2008
- iLO Model: iLO2
- iLO Firmware Version 1.50 Mar 12 2008
- 2 CPUs Quad-Core Intel Xeon, 2333 MHz
- Memory 16384 MB
- Mezzanine Card Information: QLogic QMH2462 4Gb FC HBA for HP c-Class BladeSystem (Slot 2)


I am attaching the latest messages log file. The reboot happened around Aug 19 4:06 AM. The log doesn't show anything related to the system going down for reboot.

In the iLO 2 Log for this device I only see two events around the 4 AM timeframe.

Informational iLO 2 08/19/2009 04:04 08/19/2009 04:04 1 Server power restored.
Caution iLO 2 08/19/2009 04:04 08/19/2009 04:04 1 Server reset.

Any insight / comments would be greatly appreciated.

Thanks. MJ

4 REPLIES
JKytsi
Honored Contributor
Solution

Re: ProLiant BL460c unexpected/random reboots with Redhat Linux 4.5 64bit

Some user in server room forgot that it was in your use ?

Well ... the standard troubleshooting with x86 servers starts with updating lates firmwares etc.

Start with ILO firmware (to all blades)
server BIOS and rest of server firmwares (download FDT 1.60 from HP)
update latest OA 2.52

And maybe an updated OS someday (update 8 maybe).
Remember to give Kudos to answers! (click the KUDOS star)

You can find me from Twitter @JKytsi
MJ99
Occasional Visitor

Re: ProLiant BL460c unexpected/random reboots with Redhat Linux 4.5 64bit

Jarkko,

Thanks for the info. I was placed on finding the root cause of this issue just today so I am catching up to speed in regards to this server and blade configuration.

I am getting the question of why this box goes down when the production box is up despite having identical configuration.

Once I update the firmware I will see what happens.

Thanks for the comment.
Jase4772
Regular Advisor

Re: ProLiant BL460c unexpected/random reboots with Redhat Linux 4.5 64bit

Hi,

I had an issue with our BL460c G1's with Linux and they'd reboot themselves. This was driven by the ASR receiving a prompt from the IPMI Null Driver which was crashing.

My fix was to update the Management Agents through the VC Agent and this was resolved. The readme did indicate this as an error.

HTHs
JKytsi
Honored Contributor

Re: ProLiant BL460c unexpected/random reboots with Redhat Linux 4.5 64bit

And also ..if You are using anykind of cluster software (you have a cluster?), please disable ASR (from server ILO web page or from RBSU=BIOS settings).
Remember to give Kudos to answers! (click the KUDOS star)

You can find me from Twitter @JKytsi