Operating System - Linux
1754051 Members
2845 Online
108811 Solutions
New Discussion

Re: CentOS 6.2 + hp-health = fail

 
busche
Visitor

Re: CentOS 6.2 + hp-health = fail (something else broke hp-health!)

Too sad, same issue here with CentOS 6.2 on a DL 180 G6 w/o iLO.

Initial install 3-4 months ago: all tools worked perfectly.

A plain normal OS update last week (and after hot-swapping a malfunctioning disc drive) resulted in the issue every one is reporting. This morning, I tested the following kernels:

 

2.6.32-279.1.1.el6.x86_64 #1 SMP Tue Jul 10 13:47:21 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux
2.6.32-220.23.1.el6.x86_64 #1 SMP Mon Jun 18 18:58:52 BST 2012 x86_64 x86_64 x86_64 GNU/Linux

 

 

each with hp-health 8.7, 9.0 and 9.1, always with reboots - same failures.

 

I just raised an issue on their hotline... lets see what happens.

 

Some further remark; the following appears in my /var/log/messages (just tested this one again; thus the strange time information):

 

Jul 20 16:20:12 host kernel: hpasmpld:22300 map pfn expected mapping type uncached-minus for 9e000-a0000, got write-back
Jul 20 16:20:12 host hpasmpld[22300]: ehpsmb_parse_SMBIOS: SMBIOSInitTable was not successful.
Jul 20 16:20:12 host kernel: hpasmpld[22300]: segfault at 0 ip 0000000000414918 sp 00007fff53881ef8 error 4 in hpasmpld[400000+2a000]
Jul 20 16:20:12 host abrtd: Directory 'ccpp-2012-07-20-16:20:12-22300' creation detected
Jul 20 16:20:12 host abrt[22301]: Saved core dump of pid 22300 (/opt/hp/hp-health/bin/hpasmpld) to /var/spool/abrt/ccpp-2012-07-20-16:20:12-22300 (516096 bytes)
Jul 20 16:20:12 host abrtd: Package 'hp-health' isn't signed with proper key
Jul 20 16:20:12 host abrtd: 'post-create' on '/var/spool/abrt/ccpp-2012-07-20-16:20:12-22300' exited with 1
Jul 20 16:20:12 host abrtd: Corrupted or bad directory /var/spool/abrt/ccpp-2012-07-20-16:20:12-22300, deleting

 

The problems wrt. the package signing stuff is also reported (for some other package) in the CentOS forums.

However, I doubt that in my case this is related to additional/alternate repositories.

 

btw: the direct command causing the segfault is

/opt/hp/hp-health/bin/hpasmpld -f /dev/ipmi0

 

for me ...

 

Best,

  /b

 

 

busche
Visitor

Re: CentOS 6.2 + hp-health = fail (something else broke hp-health!)

OK, first thing is that I accidentally upgraded my CentOS to version 6.3. I am quite sure that this is - somehow - the root cause of my troubles.

 

After some discussions with HP we came to the following thoughts:

* the IPMI on DL180G6 should be automatically disabled (since hp health uses the ILO which is present in all G6s.) by hp-health

Unluckily, the ipmi-modules are automatically loaded when /etc/init.d/hp-health start is called. rmmod for the IPMI modules beforehand does not work.

 

even the most recent CentOS 6.3 (from Mon, 23th July 2012) does not work with either hp-health 8.7 or 9.1.

 

As a concluding remark:

1) I will give a plain new installation of 6.2 a try - somewhen in 2 weeks

2) HP support is really movitated in trying to solve problems even for officially unsupported Operating Systems. Thank you! (And please try to continue the software support for at least one open source // freely available Linux distribution, as it had been the case since CentOS 6.1)

 

Best,

 /b

 

xonxoff
Visitor

Re: CentOS 6.2 + hp-health = fail (something else broke hp-health!)

Logged a call with HP on this.

 

They had me run cfg2html and send th output to them, which I did.

 

After reviewing the output they got back to me to say that CentOS is not a supported OS. And sent this helpful information :

 

For support on Centos and other  non supported Linux distribution.

 

Please check the below web link for more information:

http://h18004.www1.hp.com/products/servers/linux/linuxcommunity/linuxfaqs.html

 

Brilliant, thanks HP!

Fabrice Bacchella
New Member

Re: CentOS 6.2 + hp-health = fail

I'm just another poor victim of this bug.

 

Brand new DL180 G6, full firmware patch.

 

On a Scientific Linux 6.2 (another redhat clone) and all HP tools up to date, hp-health-9.1.0.42

Naresh_Kumar
Occasional Visitor

Re: CentOS 6.2 + hp-health = fail

Work Around: "Start the hp-health service prior to mcelogd service"

 

Steps:
Example-

 

1. List the service priorities using command


# ll /etc/rc.d/rc3.d/ |grep "mce\|hp-health"

lrwxrwxrwx. 1 root root 17 Aug 20 10:50 S50mcelogd -> ../init.d/mcelogd

lrwxrwxrwx. 1 root root 19 Aug 20 10:50 S91hp-health -> ../init.d/hp-health


As can be seen hp-health has lower priority (higher number S91) compared to mcelogd (numbered S50).

If hp-health to load befor mcelogd then hp-health service numbering should be lesser than mcelogd numbering.

 

This can be achieved by editing hp-health file.

 

2. Edit hp-health file as follows:

 

# vi /etc/init.d/hp-health

# (c) Copyright 2002, 2008 Hewlett-Packard Development Company, L.P.
#
# See "man chkconfig" for information on next two lines (Red Hat only)
# chkconfig: 2345 91 2
# description: hp System Health Monitor and Command line Utility Package.

 

3. Find the service number in line chkconfig (in this case 91) and change it to a number lesser than that of mcelogd (50).

 

4. For example, lets change it to by overwriting the existin number. chkconfig: 2345 41 2

 

5. Save the file and exit the editor.

 

6. Run th following commands to newly made changes to take affect

 

# chkconfig hp-health --list
hp-health 0:off 1:off 2:on 3:on 4:on 5:on 6:off

 

# chkconfig --del hp-health

 

# chkconfig hp-health --list
service hp-health supports chkconfig, but is not referenced in any runlevel (run 'chkconfig --add hp-health')

 

# chkconfig --add hp-health

 

# chkconfig hp-health --list
hp-health 0:off 1:off 2:on 3:on 4:on 5:on 6:off

 

7. Check for the service priorities changed by using following command

 

# ll /etc/rc.d/rc3.d/ |grep "mce\|hp-health"
lrwxrwxrwx. 1 root root 19 Aug 21 05:37 S41hp-health -> ../init.d/hp-health
lrwxrwxrwx. 1 root root 17 Aug 21 05:37 S50mcelogd -> ../init.d/mcelogd

Regards
NK
Gagan_Brahmi
Occasional Visitor

Re: CentOS 6.2 + hp-health = fail

The following link should help for temporary work around.

 

http://gaganonthenet.com/2012/08/22/fix-for-hp-health-on-dl100-series-running-centos6/

xonxoff
Visitor

Re: CentOS 6.2 + hp-health = fail

Many thanks NK.

 

All working again now.

 

Tim