- Community Home
- >
- Servers and Operating Systems
- >
- HPE ProLiant
- >
- Server Management - Systems Insight Manager
- >
- Predicitve Failure Notification
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-19-2008 10:34 AM
12-19-2008 10:34 AM
Predicitve Failure Notification
I recently found that one of my servers (ML370 G4) has a predictive failure on one of the hard drives. I discovered this accidentally while in the server room (orange light flashing on the drive - which is behind the door of the server in a tower configuration). The problem is, SIM does not alert on this condition, as it is considered a "Minor Event". I have alerting turned on for Major and Critical Events, but I have not included Minor Events because of the level of noise produced. Does anyone have a creative way to catch these predictive failures, while still keeping the noise level down? For example, filtering the Minor Event capturing to specific machines, or possibly (but less preferable) filtering it to specific events? Of course the second option means you have to know what you specifically want to capture, which usually happens after the first crisis you encounter :>)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-19-2008 11:49 AM
12-19-2008 11:49 AM
Re: Predicitve Failure Notification
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-19-2008 12:29 PM
12-19-2008 12:29 PM
Re: Predicitve Failure Notification
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-19-2008 12:40 PM
12-19-2008 12:40 PM
Re: Predicitve Failure Notification
We also have ML370 G4s, so hopefully this will be the correct trap for you. If not, your system logs will tell you which MIB and which trap you want to change and you can use the same procedure.
In SIMS, go to Options, Events, SNMP Trap Settings.
Click the drop down box by MIB name and select cpqida.mib
CLick the drop down box by Trap name and select (SNMP) Physical Drive Status Change (3046)
In the Severity box, select critical.
This is what I have done to get alerts.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-19-2008 02:49 PM
12-19-2008 02:49 PM
Re: Predicitve Failure Notification
I have followed your directions, but when I get to the Trap Name drop down, I don't have any entries that start with SNMP. What I have is numerous cpqDA(x) entries, where (x) is either 2-7, or there is a number of entries with cpqDA with no number after. So there are a number of "PhyDrvStatusChange" entries, but all of these already appear to be set to "Critical".
What you indicated (Physical Drive Status Change (3046)) shows up in the field below Trap Name titled "Event Type". The number (3046) is different for each of the cpqDA entries.
I have double checked the basic alerting functionality by sending a test trap, and it does appear to be working. I have the SNMP configured, and I do occasionally receive other alerts.
Thanks again for the help.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-05-2009 02:22 PM
01-05-2009 02:22 PM
Re: Predicitve Failure Notification
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-12-2009 02:54 PM
03-12-2009 02:54 PM
Re: Predicitve Failure Notification
Thanks.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-16-2009 08:51 AM
03-16-2009 08:51 AM
Re: Predicitve Failure Notification
did you check the logs as rancher suggested to identify the mib / trap involved ?