- Community Home
- >
- Servers and Operating Systems
- >
- Legacy
- >
- HPE 9000 and HPE e3000 Servers
- >
- Re: system on line with a cpu failure
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-24-2003 09:32 AM
тАО10-24-2003 09:32 AM
Let's say I'm running an rp7000 series server with HPUX 11.11 on it. Let's say it has 4 cpus. If anyone one of the cpus fails will the server continue to run without crashing??
I know the "old school" T520s will crash and de-configure the bad CPU. I wasn't sure if the newer generation servers worked differently.
Also, why does the server need to crash in the event of a cpu failure? Why can't it continue to work on the remaining cpus. Is it because applications are multi-threaded?
Is there any OS/Hardware that can continue to run if a CPU fails.
Thanks
Kirk
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-24-2003 09:53 AM
тАО10-24-2003 09:53 AM
Re: system on line with a cpu failure
If you run top, you will see one less cpu. Same deal with glance.
EMS will not detect the problem and email you. This is the kind of thing you have to spot as a sysadmin.
SEP
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-27-2003 01:14 AM
тАО10-27-2003 01:14 AM
Re: system on line with a cpu failure
Most, if not all, of the time the system will panic. This is deliberate because the system may not be able to determine the resources that the failed CPU had allocated & will now never release. In fact this is what directly causes the panic. The timer pops on another CPU because it times out waiting for the resource that the failed CPU is *never* going to release.
I suppose that if a completely idle CPU failed, then there would be no need for a panic - but I suspect this happens rarely. If it's doing nothing, why would it fail?
Plus the reboot give the system the opportunity to deallocate the CPU.
My 2 cents,
Jeff
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-27-2003 02:30 AM
тАО10-27-2003 02:30 AM
Re: system on line with a cpu failure
On HP servers with the processors PA8500 onwards, dynamic processor deallocation is possible. All processor except CPU0 (known as monarc processor) can be deallocated online with the product HP EMS and Support Tool Manager installed.
Once the defective processor is deallocated online, it will be deconfigured completely upon next system reboot.
There is a white paper on this at http://docs.hp.com/hpux/onlinedocs/diag/dynamic.pdf
Hope this help.
Pramod
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-28-2003 01:09 PM
тАО10-28-2003 01:09 PM
SolutionTo answer your question, there are some unique hardware/OS products that are extremely expensive which can survive the majority of processor/memory/backplane failures but for most data centers, it is far more economical to run failover clusters.
Bill Hassell, sysadmin