- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- Server reboots roughly every three hours
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-13-2007 03:47 AM
тАО03-13-2007 03:47 AM
Server reboots roughly every three hours
I am running Debian/etch on a ProLiant ML370 G3 server. I have also managed to install the hpasm tools.
The install went great but the server occasionally reboots itself spontaneously. Usually that is after three hours or so. There are no error messages in /var/log/kern.log or /var/log/messages. Also, "hplog -v" does not show anything about a reboot. Only a POST warning "Array Accelerator Battery Charge Low". That is all.
Has anyone experienced this before? Any suggested fixes, or even a way to find out why the reboot occurs (besides staring at the console for three hours straight?)
Thanks in advance for any help.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-13-2007 07:43 AM
тАО03-13-2007 07:43 AM
Re: Server reboots roughly every three hours
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-13-2007 08:00 AM
тАО03-13-2007 08:00 AM
Re: Server reboots roughly every three hours
I happened to view one of the crashes in action by pure chance. The system froze for a couple of seconds, then a large amount of stuff was dumped to the console and then it rebooted. The stuff went by so fast that I didn't have a chance to see what it was, but it wasn't random binary garbage. It did say something. And apparently it's not saved anywhere.
Is there a way to log all console output so I can see what it was?
In the mean time I'm going to reboot, disable ASR and wait for the next crash in the hope that my console will stay visible.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-13-2007 08:08 AM
тАО03-13-2007 08:08 AM
Re: Server reboots roughly every three hours
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-13-2007 02:33 PM
тАО03-13-2007 02:33 PM
Re: Server reboots roughly every three hours
it might be helpful to gather a crash dump here. or you could always set the console to a serial port and hook up something there, so you can record the error as it passes by.
florian
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-13-2007 06:50 PM
тАО03-13-2007 06:50 PM
Re: Server reboots roughly every three hours
The server hasn't run since end-2004. Have there been firmware updates that fix ASR related crashes since then? Or mayve ASR doesn't work properly on a Xen kernel? I am using linux-image-2.6.18-4-xen-686 kernel. hpasm is installed in Xen domain0.
I am going to turn ASR back on and see if it starts crashing again.
> or you could always set the console to a serial port and hook up something there, so you can record the error as it passes by.
Good idea but I don't have anything to hook up. Maybe I can borrow something.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-13-2007 07:09 PM
тАО03-13-2007 07:09 PM
Re: Server reboots roughly every three hours
(Next thing I asked was if adapting the support in a hotfix kernel patch would be covered in software support :)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-14-2007 03:42 AM
тАО03-14-2007 03:42 AM
Re: Server reboots roughly every three hours
One of the Xen guest systems is used as an NFS server. When I upload a couple of GB from my desktop to the NFS share, the system comes down. So it looks like a Xen kernel / NFS issue. I've submitted a bug at Debian's BTS.
The system logs show no crash after a reboot but that could be because NFS is keeping the drives busy upto the point of the crash.
Anyway, thanks for the help so far. I guess I'm in the market for a different file server protocol that does behave well under Xen :-)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-16-2007 03:07 AM
тАО03-16-2007 03:07 AM
Re: Server reboots roughly every three hours
I used to run various domUs based on nfs -
a) linux nfs code is _not stable_, no matter what people state.
b) i currently have a linux domU that servers as fileserver, and often push 10-80GB in or out, without stability issues.
c) I remember having NFS-bound crashes taking the system down, but back then I ran nfs in dom0 (stupid idea). The reason was i nfs-exported loopback-mounted filesystem images that were corrupt. the fs corruption error message only went to the kernel console and it took days to finally see the error message.
if, as you write your nfs server is in a domU, but the dom0 crashes, then this is not an nfs, but a load issue. (still points to the xen kernel though ;)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-16-2007 07:31 PM
тАО03-16-2007 07:31 PM
Re: Server reboots roughly every three hours
> b) i currently have a linux domU that servers as fileserver, and often push 10-80GB in or out, without stability issues.
Pushing out isn't the problem. It's taking in that I experience crashes with NFS.
> c) I remember having NFS-bound crashes taking the system down [...] The reason was i nfs-exported loopback-mounted filesystem images that were corrupt.
I am exporting whole LVM volume groups. I don't use loopback filesystems, so that can't be it.
> if, as you write your nfs server is in a domU, but the dom0 crashes, then this is not an nfs, but a load issue. (still points to the xen kernel though ;)
Yup :-) I have managed to find a workaround though. I replaced the nfs-kernel-server package with unfs3, a userspace NFS3 server. It's a lot more stable now. The only downside is that it doesn't support file locking but that's not really an issue for me. I use it in a SOHO setting with only a few computers using the fileserver (and mostly for reading at that).
Thanks!