Server Management - Systems Insight Manager
1753646 Members
6038 Online
108798 Solutions
New Discussion юеВ

Re: Reboot and Hardware Status Polling tasks

 
SOLVED
Go to solution
Mahesh Shah_3
Frequent Advisor

Reboot and Hardware Status Polling tasks

We have over 1550 servers in HP SIM and schedule reboots every Sunday different time zones. We received тАШSystem is unreachableтАЩ then тАШSystem is reachableтАЩ messages. I unchecked both options under Options, Events then тАШStatus Change Event SettingsтАЩ, now we do not receiving any unreachable/reachable event messages, but when server is down or reboot hung or not ping-able we are not receiving any alert message either during the hardware status polling task, is there any way the schedule reboot will not send unreachable/reachable event messages.
12 REPLIES 12
Rob Buxton
Honored Contributor

Re: Reboot and Hardware Status Polling tasks

The server reboots faster than the polling interval. So HPSIM polls, you reboot the server and it's back before the next polling cycle.
By default for servers it's every 5 minutes. More than enough time for a server to reboot.
Mahesh Shah_3
Frequent Advisor

Re: Reboot and Hardware Status Polling tasks

I changed the Hardware Status polling tasks from 5 to15 minutes; still we are receiving false positive pager/email for servers indicating ├в System is unreachable├в then ├в system is reachable├в . I would like to avoid these types of messages so you can have good night sleep.
Rob Buxton
Honored Contributor

Re: Reboot and Hardware Status Polling tasks

What do you mean false positives?

The polling task will generate an event if it cannot access the server.
It doesn't know it's a scheduled reboot. If you're getting events when servers are actually available then you may have network issues.

There is a time filter on the event set up, you could use that to not check at certain times. But that would be difficult with a generic event handler and different time zones.

Also, you'd need to check where the events are coming from. If you've got an event that triggers a page based on a "Warm Start" or similar then that's independent of the HW Polling.
Mahesh Shah_3
Frequent Advisor

Re: Reboot and Hardware Status Polling tasks

We know that servers are rebooted on the schedule reboot time, and hardware status polling tasks runs during that time, now server is not ping able during that time, so it send us pager/email ├в system is unreachable├в , and then 15 minutes later we receive ├в system is reachable├в message, how can we avoid this of type messages. We like to receive the message only when server is down or has a hardware issues, not during the schedule reboot.

You are correct about time filter it is very cumbersome to configure.

I did check the event message and they are coming from HW status polling tasks.
ex:
'The current system is no longer reachable from the central management server, hardware status polling has marked this system as not responding'.

David Claypool
Honored Contributor

Re: Reboot and Hardware Status Polling tasks

I'd suggest using Suspend during your maintenance window. You can schedule it as a command using 'mxnode.'
Rob Buxton
Honored Contributor

Re: Reboot and Hardware Status Polling tasks

You'll always get the events, but you don't need to e-mail, page etc. on all events generated.

System Is Reachable is an "Normal" message, you'd need to check why your event handler task is doing anything with these messages.
I only have event tasks that alert me to Critical or Major events. So I certainly do not get any e-mails when a Server is back.

Mahesh Shah_3
Frequent Advisor

Re: Reboot and Hardware Status Polling tasks

We don├в t have any problem greeting email/pager for critical server events, it is great tool for monitoring HP ProLiant server├в s hardware and working just fine for us.

Since we have 1500 servers in HP SIM and they are rebooted different time zone, is there any way not to receive any alerts when server rebooted at weekly schedule time.

You are correct about the ├в System is reachable├в is ├в Normal├в message and we are not getting email/pager message since we have configured for critical and major events alert tasks to send pager/email only.

Thanks.
Jimmy Rueedi
Frequent Advisor

Re: Reboot and Hardware Status Polling tasks

Same problem here (not with 1500 server, only 150)

We have several different server groups which are rebooting at different times.
So I configured a timeout of 120 seconds and a retry count of 5.
Our idea was to get a tolerance of 10 minutes for the rebooting issue. (Same way we did in the very old Compaq Insight Manager)

This problem seems not to be new:
http://forums1.itrc.hp.com/service/forums/questionanswer.do?threadId=1039565

Any help would kindly be appreciated.

Regards
Jimmy
Mahesh Shah_3
Frequent Advisor

Re: Reboot and Hardware Status Polling tasks

I made the same changes which you recommended, but we are still receiving the system reachable/unreachable messages during reboot. Did you make the timeout of 120 seconds and retry count of 5 changes into Hardware Status Polling tasks only. Or make changes into Global Protocol setting or System Protocol Settings as well as.