- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- RHES3U4 system was auto rebooted twice whin 11 hou...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 08:37 AM
тАО12-13-2007 08:37 AM
I have a Redhat Linux physical server, HP Blade, RH ES 3 update 4, Installed oracle 10g RAC. I have no idea why this server was auto rebooted twice from Dec, 12 21:57 to Dec, 13 07:30
#last reboot
reboot system boot 2.4.21-27.ELsmp Thu Dec 13 07:30 (04:04)
reboot system boot 2.4.21-27.ELsmp Wed Dec 12 21:57 (13:37)
I didn't find out any usaful information from system log file and dmesg.
How to check this kind of reason?
Thank you very much any answers will be very appreciate.
-Gary
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 09:54 AM
тАО12-13-2007 09:54 AM
SolutionIf this is the cause, then start troubleshotting the nodes interconnect.
Another possible option is if you have a environment problem, for example, a failed fan, so the server will reboot if the temperature goes high.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 10:20 AM
тАО12-13-2007 10:20 AM
Re: RHES3U4 system was auto rebooted twice whin 11 hours
Thank you very much for your fast reply.
Questions for you:
1. Except the Fan reason, whatelse could cause the system reboot? Disk(s), NIC(s) etc.? As you know, after the twice system auto rebooted, so far the server running as normal.
2. How to check the health status of hardwares through server's console or GUI Desktop?
thank a lot
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 10:33 AM
тАО12-13-2007 10:33 AM
Re: RHES3U4 system was auto rebooted twice whin 11 hours
So far, I just saw server reboots like yours (controlled reboots) caused by fan so power supply failure issued by APCI.
2. How to check the health status of hardwares through server's console or GUI Desktop?
This depends of the hardware model, on Itanium based machines, you have a console where you can check hardware logs, on proliant servers, you should rely on "Proliant Support Pack" and email notifications.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 10:37 AM
тАО12-13-2007 10:37 AM
Re: RHES3U4 system was auto rebooted twice whin 11 hours
My physcial server is Proliant box.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 11:06 AM
тАО12-13-2007 11:06 AM
Re: RHES3U4 system was auto rebooted twice whin 11 hours
the Processors informations have some different with others
This server: (3.2G Dual CPU installed on ProLiant BL20p G3)
Proc 1: 3200 MHz
Processor 1 Internal L1 Cache: 16 KB
Processor 1 Internal L2 Cache: 1024 KB
Proc 2: unavailable
Others: ( the same configurations with above)
Proc 1: 3200 MHz
Processor 1 Internal L1 Cache: 16 KB
Processor 1 Internal L2 Cache: 1024 KB
Proc 2: 3200 MHz
Processor 2 Internal L1 Cache: 16 KB
Processor 2 Internal L2 Cache: 1024 KB
Whether there is a CPU failed caused system reboot?
How to make sure it?
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 11:13 AM
тАО12-13-2007 11:13 AM
Re: RHES3U4 system was auto rebooted twice whin 11 hours
CPU states: cpu user nice system irq softirq iowait idle
total 0.8% 0.0% 0.0% 0.0% 0.0% 1.1% 97.9%
cpu00 1.5% 0.0% 0.0% 0.0% 0.0% 1.1% 97.2%
cpu01 0.1% 0.0% 0.0% 0.0% 0.0% 1.1% 98.6%
what's going on?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 12:08 PM
тАО12-13-2007 12:08 PM
Re: RHES3U4 system was auto rebooted twice whin 11 hours
>>> How to make sure it?
Are you sure this server had 2 physical CPUS? Or always had 1 physical COU. CPU failures normally cause the server to PANIC!.
>>> But through check the system via command "top", it looks not failed,
If you have only one cpu, and it's dual core or hyperthreading enabled, you will see 2 CPU (or more) from the O.S. view for each physical cpu.
Have you already verified oracle CRS logs?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 12:53 PM
тАО12-13-2007 12:53 PM
Re: RHES3U4 system was auto rebooted twice whin 11 hours
I'm not quite sure the physcial CPU number, because this server located in another city, I will go there for checking tomorrow. Maybe it's one physcial CPU that with Dual core.
I have been checking the oracle log file with oracle team.
thanks a lot.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-13-2007 04:25 PM
тАО12-13-2007 04:25 PM
Re: RHES3U4 system was auto rebooted twice whin 11 hours
If it's a HW problem, u can check IML in iLO.
But this sounds something like your OS or your RAC.
Try to search out your /var/log/messages to check events before the reboot line.