- Community Home
- >
- Servers and Operating Systems
- >
- HPE ProLiant
- >
- ProLiant Servers (ML,DL,SL)
- >
- Re: ML370 W2K3 Random Reboots
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-15-2006 04:51 AM
тАО02-15-2006 04:51 AM
ML370 W2K3 Random Reboots
I've been having the strangest issue for the past few weeks, and was hoping someone could provide some insight on the issue.
Our environment is about 200 users on 6 terminal servers (standard MS-RDP, not Citrix). The servers are a mix of ML370 G3s, G4s, and ML350 G4s. Our DCs and file servers are also ML350s.
Due to various issues, we've rebuilt all of the servers over the last three months. Everything was running fine for about a month, when one of the ML370 G3 started randomly rebooting, sometimes twice a day. We took that server down, and then it began happening to the servers (all models) as well. Now it is like a whack-a-mole game, any of the servers may go down at any time during operational hours.
When I say "randomly reboot", I mean that the system just goes straight down, and boots back up; no STOP error, no power down cycle. The system's event log has no entries for the few minutes preceding the reboot, and none of the Insight Agents report anything.
All the servers have the latest drivers, and latest firmware. The reboots primarily tend to happen during standard peak hours (beginning of day, lunchtime, and end of day), but I've run performance monitoring, and seen nothing that should crash a server. I'm now going through and trying to diagnose the various software installed on the machines, but I figure if that was the case, Windows would log something, or get a STOP error.
I'm really all out of ideas. The servers are connected to UPSes, and on a separate circuit from everything else. Some also have redundant PSUs, so that's not the case. Any suggestions you have would be MUCH appreciated.
Thanks in advance.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-15-2006 06:30 AM
тАО02-15-2006 06:30 AM
Re: ML370 W2K3 Random Reboots
In such cases since you are not getting dump.
Two thing we can do.
1) check for IML logs any error.
2) isable ASR on server. Since this the feature of proliant server if server stop responding then ASR is trigered on server. that leads to reboot so inthat case no dump is saved.
For RCA we need to think more on such kind of things.
Regards,
Prashant S.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-15-2006 07:31 AM
тАО02-15-2006 07:31 AM
Re: ML370 W2K3 Random Reboots
I appreciate your advice. The IML log shows nothing at all. I did disable ASR on one of the servers, so now I'll wait and see what happens. Thanks.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-22-2006 02:58 AM
тАО02-22-2006 02:58 AM
Re: ML370 W2K3 Random Reboots
Even with ASR disabled, the systems still reboot. The only change we've made recently is printer drivers, so I'm going to try to roll those back.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-22-2006 03:46 AM
тАО02-22-2006 03:46 AM
Re: ML370 W2K3 Random Reboots
thanks.
--Andy
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-22-2006 03:59 AM
тАО02-22-2006 03:59 AM
Re: ML370 W2K3 Random Reboots
We keep the room at about 70 degrees, and the machines don't report overheating issues. Humidity I don't think is a factor. We've also tested the UPSes that they are connected to, and they seem to be fine. I'm fairly confident that it's not a hardware issue, although that was my initial guess, too.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-22-2006 05:11 AM
тАО02-22-2006 05:11 AM
Re: ML370 W2K3 Random Reboots
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-22-2006 05:18 AM
тАО02-22-2006 05:18 AM
Re: ML370 W2K3 Random Reboots
Yes, all the machines have Symantec Enterprise resident on them at all times. I have performed a full scan, and run three different malware removal utilities. The terminal servers are completely locked down (no write access to most of the hard drive), so I don't think it's that, either.
Thanks, and thanks to everyone who has replied.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-22-2006 05:58 AM
тАО02-22-2006 05:58 AM
Re: ML370 W2K3 Random Reboots
thanks.
--Andy
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-22-2006 06:06 AM
тАО02-22-2006 06:06 AM
Re: ML370 W2K3 Random Reboots
As for your most recent suggestion, we don't have BackupExec on the Terminal Servers. There's no critical data on them, so there's no need. But thanks.