ProLiant Servers (ML,DL,SL)
1753675 Members
5218 Online
108799 Solutions
New Discussion юеВ

Re: DL180G6 on VMWARE ESX4 problem

 
DariuszLyskoit
Occasional Advisor

DL180G6 on VMWARE ESX4 problem

I have one DL180G6 server with internal disk array. I have installed ESX4U1 operating system on it.
Problem is that it is not working correctly since deployed. It is rebooting randomly every two days at random time.
My server specification is:
4x SATA HDD 1TB (configured with RAID6)
6x DDR3 4GB ECC MEMORY (24GB)
2x 460W power supplies

I found those lines in my log file (тАЬ/var/log/messagesтАЭ):
-------------------------------------
Mar 31 23:58:13 esx1 hpasmpld[16076]: OsBmcGetSelEntry Reserve SEL Failed: 0xD2
Mar 31 23:58:24 esx1 hpasmpld[16076]: WARNING: System Power Supplies Not Redundant
Mar 31 23:58:24 esx1 hpasmpld[16076]: CRITICAL: System Fan Removed (Fan 1, Location System)
Mar 31 23:58:24 esx1 hpasmpld[16076]: CRITICAL: System Fan Removed (Fan 2, Location System)
Mar 31 23:58:24 esx1 hpasmpld[16076]: CRITICAL: System Fan Removed (Fan 3, Location System)
Mar 31 23:58:24 esx1 hpasmpld[16076]: CRITICAL: System Fan Removed (Fan 4, Location System)
Mar 31 23:58:34 esx1 hpasmpld[16076]: CRITICAL: System Power Supply Removed (Power Supply 1)
Mar 31 23:58:34 esx1 hpasmpld[16076]: A System Reboot has been requested by the management processor in 60 seconds.
Mar 31 23:58:35 esx1 hpasmpld[16076]: CRITICAL: System Power Supply Removed (Power Supply 2)
Mar 31 23:58:35 esx1 hpasmpld[16076]: Sensor 0x80 Not Redundant FtrFlags 1 (OsHlthfncUpdateFtrRedundancy)
Mar 31 23:58:35 esx1 hpasmpld[16076]: BMC Communications Error - Attempting synchronization!
Mar 31 23:59:20 esx1 hpasmpld[16076]: The BMC has responded to reset request . . .
Mar 31 23:59:20 esx1 hpasmpld[16076]: Stopping the IPMI Watchdog Timer . . .
Mar 31 23:59:20 esx1 hpasmpld[16076]: Resetting Internal Data structures . . .
Mar 31 23:59:20 esx1 hpasmpld[16076]: Initializing Internal Data structures from the BMC . . .
Mar 31 23:59:21 esx1 hpasmpld[16076]: The BMC reset / synchronization has completed successfully
Apr 1 00:00:21 esx1 hpasmpld[16076]: A System Reboot has been initiated by the management processor.
Apr 1 00:00:21 esx1 shutdown[21399]: shutting down for system reboot
Apr 1 00:00:25 esx1 hpasmpld[16076]: The poll API returned unexpected error:
Apr 1 00:00:25 esx1 hpasmpld[16076]: Error Code: 29 - Interrupted system call
-------------------------------------

Can someone help me to understand what is going on with it?

9 REPLIES 9
Fran Garcia
Frequent Advisor

Re: DL180G6 on VMWARE ESX4 problem

Update firmware of iLO to latest ; actually I would update all firmwares ;-) .

Try the Firmware Update CD (v8.70) if you want to go the quickest way.
DariuszLyskoit
Occasional Advisor

Re: DL180G6 on VMWARE ESX4 problem

Thank you for you reply.
I have updated all my firmwares to the latest, but that does not helped me.

PS: I can't use "Firmware Update CD" on DL180 series servers, becose it is not supported.
Creek
Frequent Advisor

Re: DL180G6 on VMWARE ESX4 problem

Can you post your current BIOS and BMC versions?
DariuszLyskoit
Occasional Advisor

Re: DL180G6 on VMWARE ESX4 problem

BMC Firmware Version: 4.06;
BIOS: HP ProLiant DL180 G6/DL160se G6 System ROM - O20;
Creek
Frequent Advisor

Re: DL180G6 on VMWARE ESX4 problem

And what is the date of the O20 BIOS? It will be displayed at POST.
DariuszLyskoit
Occasional Advisor

Re: DL180G6 on VMWARE ESX4 problem

BIOS date is 2010.02.04
anis.h
Occasional Visitor

Re: DL180G6 on VMWARE ESX4 problem

i Have the same Issue with :
> - DL160 G6
> - Operating system : Suse SLES 11
> - Bios : 04/02/2010
> - IPMI : 4.04c
> - PSP : 8.30
Creek
Frequent Advisor

Re: DL180G6 on VMWARE ESX4 problem

This could be a problem with the IPMI driver. The comments below speak to Linux, but may also apply to VMware:

DESCRIPTION
On an HP ProLiant ML350 G5 server configured with a single processor, if
the HP System Health Application and Insight Management Agents for Linux
are installed without the HP OpenIPMI (hp-OpenIPMI) device driver
loaded, a console message is displayed indicating that there is a
problem with the system fan and that the server will shut down in 60
seconds. After 60 seconds has passed, the server reboots. When this
occurs, the following message is written to the /var/log/messages file:

hpasmlited: WARNING: System Fan Removed (Fan 6, Location CPU)

The HP OpenIPMI (hp-OpenIPMI) device driver reports information on the
status of the system fan to the HP System Health Application. If the HP
OpenIPMI (hp-OpenIPMI) device driver is not loaded, then the OpenIPMI
device driver that is included with the Linux kernel is loaded by
default. The hpasmlited application uses the IPMI Sensor Device Records
(SDRs) to determine what devices are present and working and what action
to take. The SDR for Fan 6 is delivered to the hpasmlited application
with instructions to shut down the server if the fan is missing or
failed. As a result, the HP System Health Application flags the system
fan as not being present and shuts down the server. The HP OpenIPMI
(hp-OpenIPMI) device driver relies on OEM messages sent by the Base
Management Controller (BMC) to shut the server down. The IPMI 2.0 OEM
messages are not supported by the standard Linux drivers shipped with
the Linux kernel.

When the ProLiant Support Pack for Linux is loaded, the HP OpenIPMI
(hp-OpenIPMI) device driver is loaded by default. However, if a user
chooses to not load the HP OpenIPMI (hp-OpenIPMI) device driver, then
the OpenIPMI device driver that is included with the Linux kernel is
loaded.
DariuszLyskoit
Occasional Advisor

Re: DL180G6 on VMWARE ESX4 problem

My problem disappears after updating BMC firmware to 4.20A version.