- Community Home
- >
- Servers and Operating Systems
- >
- Integrity Servers
- >
- Montecito Cache Errors
Integrity Servers
1752552
Members
4550
Online
108788
Solutions
Forums
Categories
Company
Local Language
юдл
back
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
юдл
back
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Blogs
Information
Community
Resources
Community Language
Language
Forums
Blogs
Go to solution
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-20-2007 10:07 PM
тАО08-20-2007 10:07 PM
Hi All
I have a number of rx7660's some with single and others with multiple cell boards as well as a number of rx2620's and rx4640's. All are multiple CPU servers.
We seem to be getting intermittent cache errors on the CPU's in all these units. There is not a high incidence of these just one a month spread on a random CPU.
I don't think its cause for concern, but I just need to confirm this. Of course EMS logs this as a MAJOR_WARNING, but the local HP techs seem to think EMS is being overzealous.
Of course the problem was noticed when an HP rep called at 1am on a Monday wanting to replace a CPU called in faulty by ISEE.
Regards
Andrew Y
I have a number of rx7660's some with single and others with multiple cell boards as well as a number of rx2620's and rx4640's. All are multiple CPU servers.
We seem to be getting intermittent cache errors on the CPU's in all these units. There is not a high incidence of these just one a month spread on a random CPU.
I don't think its cause for concern, but I just need to confirm this. Of course EMS logs this as a MAJOR_WARNING, but the local HP techs seem to think EMS is being overzealous.
Of course the problem was noticed when an HP rep called at 1am on a Monday wanting to replace a CPU called in faulty by ISEE.
Regards
Andrew Y
Si hoc legere scis, nimis eruditionis habes
Solved! Go to Solution.
3 REPLIES 3
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-21-2007 12:22 PM
тАО08-21-2007 12:22 PM
Re: Montecito Cache Errors
I am glad you did post another thread with details to get more insight of the issue.
I hope my reply in your another post http://forums1.itrc.hp.com/service/forums/questionanswer.do?threadId=1154618 would help.
I hope my reply in your another post http://forums1.itrc.hp.com/service/forums/questionanswer.do?threadId=1154618 would help.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-22-2007 07:55 AM
тАО08-22-2007 07:55 AM
Solution
The cache errors logged are not an issue, with the larger cache sizes in today's cpu's a higher (relative) number of cache errors are expected.
Those cache errors are so called CMC's (correctable errors) and will not cause an issue at the higher (OS) level since they are already corrected at the H/W level. If more then the allowed (per the CPU specifications) number of cache errors occur within a certain timeframe, the prefailure monitor (term used in the Windows OS management agents world, not sure of how this is called under Unix/EMS) will log a warning/error to let you know the CPU crossed the number of cache errors and it will advise you to replace the CPU pro-actively to prevent a OS issue in case the CPU would produce more errors over time that could influence the performance and stability of the system.
As long as the error threshold is not crossed there is nothing to worrie about.
From reading the thread(s) on this subject it looks like ISEE isn't filtering out those errors as it should.
Those cache errors are so called CMC's (correctable errors) and will not cause an issue at the higher (OS) level since they are already corrected at the H/W level. If more then the allowed (per the CPU specifications) number of cache errors occur within a certain timeframe, the prefailure monitor (term used in the Windows OS management agents world, not sure of how this is called under Unix/EMS) will log a warning/error to let you know the CPU crossed the number of cache errors and it will advise you to replace the CPU pro-actively to prevent a OS issue in case the CPU would produce more errors over time that could influence the performance and stability of the system.
As long as the error threshold is not crossed there is nothing to worrie about.
From reading the thread(s) on this subject it looks like ISEE isn't filtering out those errors as it should.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-22-2007 09:06 AM
тАО08-22-2007 09:06 AM
Re: Montecito Cache Errors
Hi All
I have been dealing with the local HP reps and it seems that the original ES threshold of 10 errors per 24 hours was set too low. Newer versions of EMS and ISEE fix this. We are in the process of updating EMS and ISEE on all these servers.
Thanks for your assistance.
Andrew Y
I have been dealing with the local HP reps and it seems that the original ES threshold of 10 errors per 24 hours was set too low. Newer versions of EMS and ISEE fix this. We are in the process of updating EMS and ISEE on all these servers.
Thanks for your assistance.
Andrew Y
Si hoc legere scis, nimis eruditionis habes
The opinions expressed above are the personal opinions of the authors, not of Hewlett Packard Enterprise. By using this site, you accept the Terms of Use and Rules of Participation.
News and Events
Support
© Copyright 2024 Hewlett Packard Enterprise Development LP