ProLiant Servers (ML,DL,SL)
cancel
Showing results for 
Search instead for 
Did you mean: 

RedHat NMI -reboot

David J Andersson
Occasional Visitor

RedHat NMI -reboot

Hello,

I have a DL380 server running Red Hat Enterprise Linux ES release 4 (Nahant Update 5). It runs just fine untill I installed HP ProLiant Support Pack - hphealth.

When the hphealth service is running, my server reboots several times per day. If I shutdown the hphealth service my server runs with any problems.

Is it possible to configure hphealth to not reboot my server, when a NMI is recived?

When I check my logs the /var/log/messages says:

kernel: Uhhuh. NMI received. Dazed and confused, but trying to continue
kernel: You probably have a hardware problem with your RAM chips
hpasmd[9574]: CRITICAL: hpasmd: An Unrecoverable System Error (NMI) has occured
shutdown: shutting down for system reboot
init: Switching to runlevel: 6

And Yes I have run diagnostics on my hardware and it did not find anything suspicious.
1 REPLY
David J Andersson
Occasional Visitor

Re: RedHat NMI -reboot

I also get this:

kernel: irq 169: nobody cared! (screaming interrupt?)
kernel: irq 169: Please try booting with acpi=off and report a bug
kernel: [] __report_bad_irq+0x3a/0x77
kernel: [] note_interrupt+0xea/0x115
kernel: [] do_IRQ+0x143/0x1ae
kernel: [] common_interrupt+0x18/0x20
kernel: [] mwait_idle+0x33/0x42
kernel: [] cpu_idle+0x26/0x3b
kernel: handlers:
kernel: [] (usb_hcd_irq+0x0/0x4b)
kernel: [] (usb_hcd_irq+0x0/0x4b)
kernel: Disabling IRQ #169