ProLiant Servers (ML,DL,SL)
1825770 Members
2056 Online
109687 Solutions
New Discussion

Re: bl460c gen9 bios/hardware health failed

 
AZalkat
Occasional Visitor

bl460c gen9 bios/hardware health failed

hi

i have this probleme in server bl460c gen 9

the error in IML:

"ID","Severity","Class","Last Update","Initial Update","Count","Description",
"22","Critical","System Error","09/22/2021 07:26","09/22/2021 07:26","1","An Unrecoverable System Error (NMI) has occurred (Service Information: 0x00000000, 0xF000E2C3)",

I hope someone can help tnks

4 REPLIES 4
Ihaqueit
Trusted Contributor

Re: bl460c gen9 bios/hardware health failed

This NMI seems to be know issue with RHEL 

NMI An Unrecoverable System Error (NMI) has occurred (iLO application watchdog timeout NMI, Service Information: 0x0000002B, 0x00000000)

please see the below adviosry from REDHAT

https://access.redhat.com/solutions/1309033

 

IML log has the following entry:


An Unrecoverable System Error (NMI) has occurred (System error code 0x0000002B, 0x00000000)
Resolution

By default systemd starts a watchdog timer on shutdown. Disable ShutdownWatchdogSec to resolve this issue. To disable it, please open /etc/systemd/system.conf file and find following line:


#ShutdownWatchdogSec=10min
Change them to:


ShutdownWatchdogSec=0
Save the file and after that run:


# systemctl daemon-reexec
to allow systemd to know about the updated configuration or reboot the system.

NOTE: You may also wish to look at RuntimeWatchdogSec in the same file, it is disabled by default, please do not enable -it without specific reasons for doing so.

--------------------------------------------------------------------------------------------------------------------------------------
If still issue persist we recommand log a case with REDHAT. 

If you need futher troubleshooting from Hardware side kindly log a case with HPE and share all the logs (AHS and SOS report)

I Haq
AZalkat
Occasional Visitor

Re: bl460c gen9 bios/hardware health failed

tks for your help but the code  error it's not the same

in my case the core error is

"An Unrecoverable System Error (NMI) has occurred (Service Information: 0x00000000, 0xF000E2C3)",

not

NMI An Unrecoverable System Error (NMI) has occurred (iLO application watchdog timeout NMI, Service Information: 0x0000002B, 0x00000000)

 

 

bande18
Occasional Advisor

Re: bl460c gen9 bios/hardware health failed

@AZalkat Did you ever find a root cause for this problem? The exact same issue (same message and service information) happened to me seemingly out of nowhere on my BL460c gen9 server yesterday.

Thanks,

Levi

Sunitha_Mod
Honored Contributor

Re: bl460c gen9 bios/hardware health failed

@bande18 

Hello Levi, 

Thank you for writing to us! 

You might want to consider creating a new topic by utilizing the "New Discussion" button, as this will not only enhance visibility compared to the old topic but also boost your chances of receiving responses from experts.