- Community Home
- >
- Servers and Operating Systems
- >
- Legacy
- >
- Operating System - Tru64 Unix
- >
- Server crash
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-29-2004 04:21 AM
01-29-2004 04:21 AM
Halted CPU 1
CPU 0 is not halted
halt code = 7
machine check while in PAL mode
PC = 1D0C0
warning too many processor corrected errors detected on cpu 0. Reporting suspended.
On cold boot the server managed to boot but worried might crash again. Pls advise
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-29-2004 05:55 AM
01-29-2004 05:55 AM
SolutionCheck the /var/adm/messages file for more information about the error.
Most of the times this happens, you will need to replace the CPU.
Call HP support and let them know that you have a problem with the CPU crashing. Hop you have service agreement with them.
If you do not have a service agreement, you can troubleshoot the problem by swapping the 2 CPU's that you have (since the problems are reported from CPU 0) and disabling the faulty CPU (which is now CPU 1)from the SRM console commands.
P000> show cpu_enabled
most probably it will be ff.
change it to 1 to just enable one CPU and disable others
P000> set cpu_enabled 1
and boot the system and monitor it.
HTH
Mohamed
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-29-2004 07:52 PM
01-29-2004 07:52 PM
Re: Server crash
you may look with decevent into binary errorlog and see, what the cpu is complaining about. If so, can you post it?
Otherwise I concur with Mohamed, replace it.
Michael
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-29-2004 08:42 PM
01-29-2004 08:42 PM
Re: Server crash
memory, cpu, cache
so please open a call within an HP support center and provice binary.errlog for further investigation. This is a hardware issue.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-29-2004 09:25 PM
01-29-2004 09:25 PM
Re: Server crash
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-01-2004 07:56 PM
02-01-2004 07:56 PM
Re: Server crash
Are you sure the binary.errlog was properly analyzed?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-08-2004 12:00 PM
02-08-2004 12:00 PM
Re: Server crash
Binary.errlog was analyzed by HP engineer and he couldn't find anything in it. After crash, the binary.errlog contained a lot of corrupted data(1010...)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-08-2004 04:34 PM
02-08-2004 04:34 PM
Re: Server crash
Thus, when any other CPU crashes, binary.errlog typically does have errors logged. If CPU0 crashes - it's often not.
Exactly we've seen.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-08-2004 09:21 PM
02-08-2004 09:21 PM
Re: Server crash
A "too many processor corrected errors" indicates, that the cpu tries to correct the problem so logging must be done within binary.errlog. Maybe the second sentence "reporting suspended" gives us a clue that reporting stopped due to the errors.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-03-2004 06:38 PM
03-03-2004 06:38 PM
Re: Server crash
I am also facing the same problem in one of our Alpha servers.
At the time of boot, the console shows the following error:
"Too many processor corrected errors detected on cpu (8, 16 & 24 -- the m/c has 4 cpus).Reporting suspended"
Does this indicate that all the 3 cpu's are gone !!!!!!!!!
What is the best course of action for me ??
Thanks & Regards,
Ramesh.K.R.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-03-2004 06:56 PM
03-03-2004 06:56 PM
Re: Server crash
It seems a cpu/memory/cache problem!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-03-2004 06:56 PM
03-03-2004 06:56 PM
Re: Server crash
I bet you would have decevent on your Alphas. Why don't you try and run it to look for precise errors.
In any case the error "Too many processor corrected errors on CPU0" suggest that you may need to have the CPU0 replaced.
Are your Alphas on h/w support from HP. If they are, please log a call with them and have them replace CPU0. If they don't want to replace the CPU0, i would suggest that you create a crash dump by force (from the boot prompt issue Ctrl+P) and pass that on to HP for review.
Hope this helps
Keep us updated.
regards
Mobeen
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-03-2004 07:08 PM
03-03-2004 07:08 PM
Re: Server crash
Many thanks for the quick response. I will definitly book a call with HP support. What i wanted to know in the mean time was, this error is repeated for cpu no's 8, 16 & 24. So, doeas it mean all 3 cpu's are having problem?? or only the cpu "0" ??
Regards,
Ramesh.K.R.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-03-2004 07:39 PM
03-03-2004 07:39 PM
Re: Server crash
time will tell ;-). the binary.errlog is the first step to analyze the problem but it is useless to do that without the programs and register information HP staff have.
But this problematic was written several times here in the forum, so please read first and follow on the HP support center way, there is nothing we can do here at this point!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-03-2004 08:24 PM
03-03-2004 08:24 PM
Re: Server crash
I will update this forum, once i have any furthur info on this.
Regards,
Ramesh.K.R.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-14-2004 10:31 PM
03-14-2004 10:31 PM
Re: Server crash
http://forums1.itrc.hp.com/service/forums/questionanswer.do?threadId=507209
-Karthik S S
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-15-2004 04:09 PM
03-15-2004 04:09 PM
Re: Server crash
Were your CPUs replaced as we guessed, let us know the outcome of your call with HP
rgds
Mobeen