Server Management - Systems Insight Manager
1823156 Members
3505 Online
109647 Solutions
New Discussion юеВ

Re: Health Status Major Warning problem

 
Davide De Michino
Occasional Advisor

Health Status Major Warning problem

SIM notify me that 2 servers have a health warning. If I click on the icon, SIM opens me the System management homepage that seems to have no problem: all is correct.
What's the problem?
Thanks

P.S. I've also an other server, the same of the other two, and have no problem!!!!
10 REPLIES 10
JKytsi
Honored Contributor

Re: Health Status Major Warning problem

All NICs connected ?
Remember to give Kudos to answers! (click the KUDOS star)

You can find me from Twitter @JKytsi
Davide De Michino
Occasional Advisor

Re: Health Status Major Warning problem

yes yes....
all seems ok in the SMH!!!
I also have disabled tg remote insight information service (I saw a post as mine), but it doesn't resolve my problem.
Help Me!
:-)
NJK-Work
Honored Contributor

Re: Health Status Major Warning problem

Check the Integrate Log in SMH. I don't know at what version it started happening, but it no longers shows the IML has an error on the SMH homepage. So SIM detects an error, but SMH does not "show" it. You have to dig into the logs and then clear them.

At least, that has been my experience with this problem.

Nelson
NJK-Work
Honored Contributor

Re: Health Status Major Warning problem

Oh, and I forgot to mention:

I have also seen it where you have to reboot the managed machine to clear the error. Restarting the HP agents does not work. I think I even had to delete from SIM:

1) Delete from SIM
2) Reboot the managed node
3) Re-add to SIM

I have only had this a few times, but it is very irratating and it was the ONLY way I found to clear the alert.

Nelson
Davide De Michino
Occasional Advisor

Re: Health Status Major Warning problem

Very strange.
I clear the log. delete the machine from SIM, reboot the machine, re-add.
all seems ok, after few seconds...boom! the same...Major alert!
NJK-Work
Honored Contributor

Re: Health Status Major Warning problem

Is the iLO disconnected?

Nelson
Davide De Michino
Occasional Advisor

Re: Health Status Major Warning problem

if you say the manament processor...yes is in "ignore status".
From th SMH all is ok!
sandeep_raman
Honored Contributor

Re: Health Status Major Warning problem

What is the PSP version on the 2 servers which have a health warning?

/SRH
Davide De Michino
Occasional Advisor

Re: Health Status Major Warning problem

mmm... I don't now.
I installed all the agent from SMH


System ROM P56-08/03/2008 Deployable BIOS
HPQILO2.SYS 1.8.0.0 Running/Full Power Driver
ATI2MTAG.SYS 6.14.10.6744 Running/Full Power Driver
ipmidev_v.dll 1.0.0.0 Running/Full Power Configuration Software
BXVBDX.SYS 4.4.23.0 Running/Full Power Driver
VCAGENT.EXE 2.1.10.801 Running/Full Power Instrumentation
hpetdev_v.dll 1.0.1.0 Running/Full Power Configuration Software
3234103C 5.20 Deployable Firmware
HPONCFG.EXE 2.0.4.1 Running/Full Power Configuration Software
hpadu.exe 8.10.3.0 Running/Full Power Configuration Software
hpacucli.exe 8.10.2.0 Running/Full Power Configuration Software
P56 2008.08.03 Deployable Firmware
HPDIAGS.EXE 8.1.5.2890 Running/Full Power Configuration Software
hpcisss2.sys 6.12.4.32 Running/Full Power Driver
CPQCIDRV.SYS 1.13.0.0 Running/Full Power Driver
cissesrv.exe 6.10.0.32 Running/Full Power Configuration Software
RI7 1.60 Deployable Firmware
CPQRCMC.EXE 5.20.0.0 Running/Full Power Configuration Software
cpqacuxe.exe 8.10.1.0 Running/Full Power Configuration Software
HPSMHD.EXE 3.0.0.64 Running/Full Power Configuration Software
CPQTEAM.SYS 9.30.0.0 Running/Full Power Driver
AGENTVER.DLL 8.15.0.0 Running/Full Power Configuration Software
NJK-Work
Honored Contributor

Re: Health Status Major Warning problem

From this post: http://forums11.itrc.hp.com/service/forums/questionanswer.do?threadId=1223096

The source of this problems stems from the use of Thresholds (the Java based sliders for Disk, Server, and Processor in the SMH).

It appears the thresholds are not being reset properly when a server goes back to within normal operating parameters when running the 8.00 Management Agents. For example, if the process TH is set for 90% and the server hits 91%, it kicks off a warning idicator, but the server does not go back to "Normal" when the processor drops back below 90%.

This is componded by the use of the WBEM homepage view, which does not show this (it shows all normal even though SIM is reporting warnings). Switching to the SNMP view shows the "incorrect" errors, even though disk, server and process are all now running normally. Reseting the HP agents then resets this incorrect warning messages until the next time the thresholds become exceeded again, and then the whole process repeats.

The "fix" for this is to disable all thresholds (drag all the sliders to the far left, or 0 value).

Nelson