ProLiant Servers (ML,DL,SL)
1748170 Members
4060 Online
108758 Solutions
New Discussion

SPP 2018.03.0 hp-health and hp-snmp-agents failed to start : hpasmlited ILO not responding

 
SOLVED
Go to solution
plecoq
Occasional Collector

SPP 2018.03.0 hp-health and hp-snmp-agents failed to start : hpasmlited ILO not responding

hp-health & hp-snmp-agents failed to start with error :

 hpasmlited check_ilo2: BMC Returned Error:  ccode  0x0,  Req. Len:  15, Resp. Len:  21
 hpasmlitedThe Integrated Lights-Out Management Processor is not responding!

Could you help us find the cause of this error ?

Thanks.

 

ProLiant DL380 Gen10 Centos 7.4 o r Redhat 7.4 kernel 3.10.0-693.21.1.el7.x86_64

SPP 2018.0.3.0 installed components :
hp-health-10.70-1846.6.rhel7.x86_64
hponcfg-5.2.0-0.x86_64
hp-smh-templates-10.7.0-1485.2.noarch
hp-snmp-agents-10.70-2962.5.rhel7.x86_64
ssa-3.25-4.0.x86_64
ssacli-3.25-4.0.x86_64
ssaducli-3.25-4.0.x86_64
sut-2.2.0-46.linux.x86_64

[root@hostDL380Gen10 ~]# systemctl status  hp-health.service
â hp-health.service - HP System Health Monitor
   Loaded: loaded (/usr/lib/systemd/system/hp-health.service; enabled; vendor preset: disabled)
   Active: failed (Result: timeout) since Thu 2018-04-19 10:03:00 CEST; 8h ago
  Process: 1257 ExecStart=/usr/lib/systemd/scripts/hp-health.sh start (code=killed, signal=TERM)

Apr 19 10:02:01 hostDL380Gen10 hpasmlited[1617]: check_ilo2: BMC Returned Error:  ccode  0x0,  Req. Len:  15, Resp. Len:  21
Apr 19 10:02:01 hostDL380Gen10 hpasmlited[1617]: The Integrated Lights-Out Management Processor is not responding!
Apr 19 10:02:01 hostDL380Gen10 hpasmlited[1617]: Sleeping 30 seconds and will retry . ..
Apr 19 10:02:31 hostDL380Gen10 hpasmlited[1617]: check_ilo2: BMC Returned Error:  ccode  0x0,  Req. Len:  15, Resp. Len:  21
Apr 19 10:02:31 hostDL380Gen10 hpasmlited[1617]: The Integrated Lights-Out Management Processor is not responding!
Apr 19 10:02:31 hostDL380Gen10 hpasmlited[1617]: Sleeping 30 seconds and will retry . ..
Apr 19 10:03:00 hostDL380Gen10 systemd[1]: hp-health.service start operation timed out. Terminating.
Apr 19 10:03:00 hostDL380Gen10 systemd[1]: Failed to start HP System Health Monitor.
Apr 19 10:03:00 hostDL380Gen10 systemd[1]: Unit hp-health.service entered failed state.
Apr 19 10:03:00 hostDL380Gen10 systemd[1]: hp-health.service failed.
[root@hostDL380Gen10 ~]# systemctl status  hp-snmp-agents.service  
â hp-snmp-agents.service - HP SNMP Agents
   Loaded: loaded (/usr/lib/systemd/system/hp-snmp-agents.service; enabled; vendor preset: disabled)
   Active: failed (Result: timeout) since Thu 2018-04-19 10:04:30 CEST; 8h ago
  Process: 1805 ExecStart=/usr/lib/systemd/scripts/hp-snmp-agents.sh start (code=killed, signal=TERM)

Apr 19 10:03:31 hostDL380Gen10 hpasmlited[1886]: check_ilo2: BMC Returned Error:  ccode  0x0,  Req. Len:  15, Resp. Len:  21
Apr 19 10:03:31 hostDL380Gen10 hpasmlited[1886]: The Integrated Lights-Out Management Processor is not responding!
Apr 19 10:03:31 hostDL380Gen10 hpasmlited[1886]: Sleeping 30 seconds and will retry . ..
Apr 19 10:04:01 hostDL380Gen10 hpasmlited[1886]: check_ilo2: BMC Returned Error:  ccode  0x0,  Req. Len:  15, Resp. Len:  21
Apr 19 10:04:01 hostDL380Gen10 hpasmlited[1886]: The Integrated Lights-Out Management Processor is not responding!
Apr 19 10:04:01 hostDL380Gen10 hpasmlited[1886]: Sleeping 30 seconds and will retry . ..
Apr 19 10:04:30 hostDL380Gen10 systemd[1]: hp-snmp-agents.service start operation timed out. Terminating.
Apr 19 10:04:30 hostDL380Gen10 systemd[1]: Failed to start HP SNMP Agents.
Apr 19 10:04:30 hostDL380Gen10 systemd[1]: Unit hp-snmp-agents.service entered failed state.
Apr 19 10:04:30 hostDL380Gen10 systemd[1]: hp-snmp-agents.service failed.
[root@hostDL380Gen10 ~]#

 

[root@hostDL380Gen10 ~]# systemctl status hp-asrd.service
â hp-asrd.service - Starts hp asrd (HP ...)
   Loaded: loaded (/usr/lib/systemd/system/hp-asrd.service; enabled; vendor preset: disabled)
   Active: active (running) since Thu 2018-04-19 10:03:00 CEST; 8h ago
  Process: 1804 ExecStart=/usr/lib/systemd/scripts/hp-asrd.sh start (code=exited, status=0/SUCCESS)
 Main PID: 1837 (hp-asrd)
   CGroup: /system.slice/hp-asrd.service
           ââ1837 /opt/hp/hp-health/bin/hp-asrd -p 1
           ââ1838 /opt/hp/hp-health/bin/hp-asrd -p 1

Apr 19 10:03:00 hostDL380Gen10 systemd[1]: Starting Starts hp asrd (HP ...)...
Apr 19 10:03:00 hostDL380Gen10 hpasrd[1838]: Starting with poll 1 and timeout -60
Apr 19 10:03:00 hostDL380Gen10 hpasrd[1838]: Setting the watchdog timer.
Apr 19 10:03:00 hostDL380Gen10 hpasrd[1838]: Found iLO memory at 0xd9b9e000.
Apr 19 10:03:00 hostDL380Gen10 hpasrd[1838]: Successfully mapped device.
Apr 19 10:03:00 hostDL380Gen10 hpasrd[1838]: WARNING: Can not open /dev/cpqhealth/casr.
Apr 19 10:03:00 hostDL380Gen10 hpasrd[1838]:
                                           ERROR: Failed to get ASR enabled state.
Apr 19 10:03:00 hostDL380Gen10 hp-asrd.sh[1804]: Starting HP Advanced Server Recovery Daemon[  OK  ]
Apr 19 10:03:00 hostDL380Gen10 systemd[1]: Started Starts hp asrd (HP ...).

 

1 REPLY 1
plecoq
Occasional Collector
Solution

Re: SPP 2018.03.0 hp-health and hp-snmp-agents failed to start : hpasmlited ILO not responding

Do not install components for HP Gen10/ILO5 (not supported) :

hp-health-10.70-1846.6.rhel7.x86_64
hp-smh-templates-10.7.0-1485.2.noarch
hp-snmp-agents-10.70-2962.5.rhel7.x86_64

asmd must be installed instead :

 amsd-1.2.0-2657.49.rhel7

and configure SMA in reverse mode :

  • AMS (forward mode) Agentless Management Service (AMS) - The standard configuration of AMS is to pass information from the OS toiLO. (default)
  • SMA (reverse mode) System Management Assistant (SMA) - When SMA is enabled, information is passed from iLO to the OS. (add option -R /etc/sysconfig/smad)

SNMP is working fine now :

example :

CPQIDA-MIB::cpqDaMibCondition.0 = INTEGER: ok(2)

CPQHLTH-MIB::cpqHeThermalCpuFanStatus.0 = INTEGER: other(1)

CPQHLTH-MIB::cpqHeThermalSystemFanStatus.0 = INTEGER: ok(2)

CPQHLTH-MIB::cpqHeResilientMemCondition.0 = INTEGER: ok(2)

CPQHLTH-MIB::cpqHeFltTolPwrSupplyCondition.0 = INTEGER: degraded(3)

CPQHLTH-MIB::cpqHeMibCondition.0 = INTEGER: degraded(3)