ProLiant Servers (ML,DL,SL)
cancel
Showing results for 
Search instead for 
Did you mean: 

DL360:BMC IPMI Watchdog Timer Timeout: Action=System Power Reset, TimerUse: 0x44, timerActions: 0x01

 
Balaramesh
Occasional Visitor

DL360:BMC IPMI Watchdog Timer Timeout: Action=System Power Reset, TimerUse: 0x44, timerActions: 0x01

Hi ,

I am facing server restart issue with ProLiant DL360 G7 Server. ALl 4 DL360 Servers in same network have similar restarts. Sometimes all 4 servers restarts at same time.

iLO Log has below information:

BMC IPMI Watchdog Timer Timeout: Action=System Power Reset, TimerUse: 0x44, timerActions: 0x01

Watchdog reset on smif_domain, pc 01780a28, sp 0000ec98.

Software Information:

Server has Centos release 5.11 (Final) installed.

iLO Firmware Version 1.82 Jan 15 2015

BIOS Version : P68

1 REPLY 1
VPR1
HPE Pro

Re: DL360:BMC IPMI Watchdog Timer Timeout: Action=System Power Reset, TimerUse: 0x44, timerActions:

Hi,

This is in regards to DL360 G7 - Server reboot with BMC IPMI Watchdog Timer Timeout: Action=System Power Reset, TimerUse: 0x44, timerActions: 0x01.

 

Please provide below information for better understanding..

1. Since when is the issue reported..?

2. Was the servers working fine earlier..? If yes, was there any changes done before the issue..?

3. Did all the server rebooted at the same time..? 

4. Was there any power outgae at site..?

Note:

Since all 4 servers going down at same timestamp, I wont suspect this to be hardware issue

 

I would suggeste below action for now.

1. Update the server to latest BIOS version

2. Update the ILO firmware to version 1.91.

3. Update the MCP for CentOS 5

https://support.hpe.com/hpsc/swd/public/detail?sp4ts.oid=4091411&swItemId=MTX_96d45eb6465b4c738de3d85ef3&swEnvOid=4184

3. Monitor the server.

If issue still seen, then capture below logs and send a Support case to HPE.

- IML/ILO logs

-Complete offline Survey report

- OS logs.

Regards,

Vijaya

 

Where there is will there is way..!!

I am a HPE Employee