- Community Home
- >
- Servers and Operating Systems
- >
- HPE ProLiant
- >
- Server Management - Systems Insight Manager
- >
- Incorrect Hardware Status
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-09-2004 12:34 PM
03-09-2004 12:34 PM
Incorrect Hardware Status
In our environment occasionally the HP agents and/or SNMP stop. In IM7 the hardware status becomes Unknown to alert us to an issue with the system. HP SIM on the other hand continues to report the hardware as Normal (green). As another step in my testing, to make sure it wasn't getting a "Normal" status from a source other than SNMP, I pulled a drive on the system with the agents and SNMP stopped. HP SIM still reported it as "Normal".
Has anybody seen this?? This is an issue that would stop us from going to HP SIM. These are W2K SP4 systems running 7.0 agents.
Any help would be appreciated.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-09-2004 02:11 PM
03-09-2004 02:11 PM
Re: Incorrect Hardware Status
IM 7 would go from Green to a blue dot and Unknown, SIM would change to UnManaged but the Green Tick would remain. The difference is that there are now two states, Unknown and UnManaged. Unknown indicates SNMP is active but it doesn't recognise the device. UnManaged indicates there is no SNMP coimmunication. In both cases, the stopping of SNMP doesn't generate a Server Out alert, rather the change is identified in different places.
In both cases it would, I think, be the running of the Device Identification process that picked up the change in state, it may be the Auto-Discovery you'd need to test.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 06:29 AM
03-10-2004 06:29 AM
Re: Incorrect Hardware Status
Unfortunetly we only run device identification once a day, during off hours, where we run hardware status poling every minute. And as you probably know, a lot can go wrong when a system basically goes unmonitored (other than ping) for a day.
This raises a couple of questions from me. Why would HP SIM, when not recieving SNMP from a system it has identified as a server still consider that server in a normal state? Why did this change from IM7?
Thanks for your help on this.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 08:50 AM
03-10-2004 08:50 AM
Re: Incorrect Hardware Status
The IM 7 Hardware Polling Task used SNMP, the HPSIM will try a number of protocols, SNMP, Ping, DMI.
You could remove Ping from the List of methods used to contact the Server to try and see if that registered the change.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 09:27 AM
03-10-2004 09:27 AM
Re: Incorrect Hardware Status
We think too much alike. :-)
I actually have tried that. Actually removed everything but SNMP and it is still normal. I also removed everything but DMI, HTTP, and ping just to see and they all had the same result, the server is normal.
Thanks for the ideas.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 09:55 AM
03-10-2004 09:55 AM
Re: Incorrect Hardware Status
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 10:03 AM
03-10-2004 10:03 AM
Re: Incorrect Hardware Status
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 10:24 AM
03-10-2004 10:24 AM
Re: Incorrect Hardware Status
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 10:25 AM
03-10-2004 10:25 AM
Re: Incorrect Hardware Status
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 02:00 PM
03-10-2004 02:00 PM
Re: Incorrect Hardware Status
Ping is self-explanatory. The SNMP poll is the same poll you would make of SNMP regardless of whether the Insight agents are running or not (basically doing an SNMP get of the system name OID and seeing if you get a response).
The poll identified as the agents is a neat little thing created 10 years ago which is an SNMP get of the OID of the "ProLiant Status Array." You may have noticed that it's really only for ProLiants that you get an indication of "minor." That's because within that array are indications of things like a failed device of redundant systems like drives in an array. This is unique to the Insight agents (as we get real Insight agents for some of the other HP systems--already available for Integrity SuperDome on HP-UX--they will have that ability also. Over the next several months, we'll be getting that for the rest of the "rx" Itanium line also (it arrives a little bit at a time as newly announced systems get it first and then it trickles down to previously announced platforms like the rx2600). This is one of the efforts sometimes referred to as the "ProLiantization" of Integrity.
hpSIM has the ability to do HTTP polling, but we don't have agents with the ability to be polled via HTTP yet. This is one of the things we're pursuing that will eventually allow you to eliminate SNMP if you wish to do so in the future (an item high on the customer wish list because of the perceived lack of security of SNMP). We won't eliminate the possibility of using SNMP altogether because that is what so many third party products that provide an interface to the Insight agents rely on to get their information.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-10-2004 03:25 PM
03-10-2004 03:25 PM
Re: Incorrect Hardware Status
are you able to replicate Gene's issue?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-11-2004 06:15 AM
03-11-2004 06:15 AM
Re: Incorrect Hardware Status
Yes, I see the problem Gene is reporting, there's a change in behaviour and I thought it would be possible to rejig the polling tasks in HPSIM to replicate the old behaviour.
I shutdown SNMP plus Web Agents on a Server, the HW Status changed within about 5 minutes on CIM 7. On HPSIM the HW status never changed, despite me removing all but SNMP from the polling tasks and just went to UnManaged after the device identification task ran.
I need to reread what David has said, but I'm not sure how you can address the problem Gene has raised, if SNMP stops on a Server there's no easy way to pick it up with HPSIM.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-11-2004 06:31 AM
03-11-2004 06:31 AM
Re: Incorrect Hardware Status
I've not tried this, but you might want to follow it up.
There are two Hardware Polling tasks, Server and Non-Server. You'd need to remove ping etc. from both as when SNMP stops on the Server, the Server then moves to the non-Server list (as it is no longer recognised) where ping still reports it as okay.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-11-2004 07:25 AM
03-11-2004 07:25 AM
Re: Incorrect Hardware Status
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-11-2004 08:35 AM
03-11-2004 08:35 AM
Re: Incorrect Hardware Status
I gave that a shot, but still no luck. All my hardware polling tasks are now only polling via snmp. Yet system status is still normal with snmp stopped. I even ran a device identification to change the system type to unmanaged, and still no change in status. It seems to me if it was only polling via snmp and snmp on a system was stopped, the system would go critical???
Mike,
I may have to resort to that, but I'd rather not add additional jobs on my servers to check snmp/agent service state, when it seems to me HP SIM should give us some indication when doing snmp polling (like IM7 does)that something is wrong.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-11-2004 10:18 AM
03-11-2004 10:18 AM
Re: Incorrect Hardware Status
My only guess is that when IM tries the SNMP connection it must get something back from the Server that indicates it is still alive.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-11-2004 10:34 AM
03-11-2004 10:34 AM
Re: Incorrect Hardware Status
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-17-2004 11:56 AM
03-17-2004 11:56 AM
Re: Incorrect Hardware Status
Found out that you better not uncheck ping from your status polling. If you do and a server goes down, the HW status of the server remains normal (does not go critical until you poll it using ping or run a device ID). But hey with the server turned off at least the Agents and SNMP finally went into an unknown state.
So it appears that HW critical status is entirely driven by ping. I wonder what would happen if we have to block ICMP (ping)traffic on our network. All servers critical?