- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - OpenVMS
- >
- CPU failure
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-13-2004 06:23 PM
тАО07-13-2004 06:23 PM
CPU failure
I would like to know what chance I have that after this the cpu will never fail again.
I had some cpu failures before and certain cpu's never gave the failure again. But maybe I was just very lucky.
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-14-2004 01:16 AM
тАО07-14-2004 01:16 AM
Re: CPU failure
This is NOT an official answer by any strech, but froma practial point of view we find you can indeed often get away with a spurious CPU failure.
Of course I just work with lab/test equipment, not on production systems so I can just try and try again.
Sometimes we just reboot, and if we have the chance will power down, re-seat modules, power up. In doing so the problem has often gone away for ever (or long enough we no longer remmeber an earlier failure).
Of course we also have a 'three strikes and you are out rule'. One failure... bad luck. reboot. Two failures... hmm let's try the power-cycle + jiggle routine. Three failures... in the thrash pile.
Grins,
Hein.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-14-2004 01:19 AM
тАО07-14-2004 01:19 AM
Re: CPU failure
I was thinking of doing a replacement after a 2nd failure. So you agree, allthough not officially.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-14-2004 04:38 AM
тАО07-14-2004 04:38 AM
Re: CPU failure
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-14-2004 03:36 PM
тАО07-14-2004 03:36 PM
Re: CPU failure
This is more a question of
a.) How important is keeping that system up, b.) How much the downtime costs you.
C.) Do you have a support contract that covers the replacement, or does it come out of your pocket?
If it is a test/development machine that you really do not care about... So what if it craps out on you once a month or so. A power cycle will often clear a problem on a CPU that was looping bugchecks.
If it is a 24x7x365 system... can you take that chance?
It really is an unknown.
I'm changing out a CPU on one of our production boxes later tonight that caused a crash last Saturday. I have Platinum support on production system, and our management pushes for the replacement even though HP recommends waiting for a second outage. To us, it is not worth the second outage. We have it replaced from the onsite spares!
Mike Naime
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-14-2004 06:25 PM
тАО07-14-2004 06:25 PM
Re: CPU failure
Webes :
Bcache tag parity error reported by CPU1, CPU Slot1 of SoftQbb0 (HardQbb0)
Simular case found in which they say that it might be an overheated cpu. So, a 1 time event ?
http://groups.google.com/groups?num=100&hl=nl&lr=&ie=UTF-8&q=%
22analyzing+hw+error+on+21164LX%22
In any case, replacing the cpu is also a risk. The new one may have problems too. In 2 years, we have replaced 2 out of 4 cpu's in the GS160. So a new cpu can be a bigger risk than keeping the old one.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-14-2004 08:28 PM
тАО07-14-2004 08:28 PM
Re: CPU failure
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-15-2004 12:14 AM
тАО07-15-2004 12:14 AM
Re: CPU failure
This way you will have done 2 things at the same time:
Re-seated the boards
Pointed to the exact faulty module
Mohamed
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО07-15-2004 12:18 AM
тАО07-15-2004 12:18 AM
Re: CPU failure
The system is up since 2 days now without any problems.
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО08-18-2004 09:59 AM
тАО08-18-2004 09:59 AM