- Community Home
- >
- Servers and Operating Systems
- >
- ProLiant
- >
- ProLiant Servers (ML,DL,SL)
- >
- Proliant dl360p gen8An Unrecoverable SystemError (...
-
-
Forums
- Products
- Servers and Operating Systems
- Storage
- Software
- Services
- HPE GreenLake
- Company
- Events
- Webinars
- Partner Solutions and Certifications
- Local Language
- China - 简体中文
- Japan - 日本語
- Korea - 한국어
- Taiwan - 繁體中文
-
- Advancing Life & Work
- Advantage EX
- Alliances
- Around the Storage Block
- HPE Ezmeral: Uncut
- OEM Solutions
- Servers & Systems: The Right Compute
- Tech Insights
- The Cloud Experience Everywhere
- HPE Blog, Austria, Germany & Switzerland
- Blog HPE, France
- HPE Blog, Italy
- HPE Blog, Japan
- HPE Blog, Middle East
- HPE Blog, Latin America
- HPE Blog, Russia
- HPE Blog, Saudi Arabia
- HPE Blog, South Africa
- HPE Blog, UK & Ireland
- HPE Blog, Poland
-
Blogs
- Advancing Life & Work
- Advantage EX
- Alliances
- Around the Storage Block
- HPE Blog, Latin America
- HPE Blog, Middle East
- HPE Blog, Saudi Arabia
- HPE Blog, South Africa
- HPE Blog, UK & Ireland
- HPE Ezmeral: Uncut
- OEM Solutions
- Servers & Systems: The Right Compute
- Tech Insights
- The Cloud Experience Everywhere
-
Information
- Community
- Welcome
- Getting Started
- FAQ
- Ranking Overview
- Rules of Participation
- Tips and Tricks
- Resources
- Announcements
- Email us
- Feedback
- Information Libraries
- Integrated Systems
- Networking
- Servers
- Storage
- Other HPE Sites
- Support Center
- Aruba Airheads Community
- Enterprise.nxt
- HPE Dev Community
- Cloud28+ Community
- Marketplace
-
Forums
-
Forums
-
Blogs
-
Information
-
English
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
04-26-2019 01:36 AM
04-26-2019 01:36 AM
An Unrecoverable System Error (NMI) has occurred (iLO application watchdog timeout NMI, Service Information: 0x0000002B, 0x00000000)
Hi All ,
We have one DL380p gen 8 server runnign Red Hat Enterprise Linux Server Release 6.2, it is second time that server crashed and reboot with the following IML logs :
ASR 04/24/2019 22:00 04/24/2019 22:00 1 ASR Detected by System ROM
System Error 04/24/2019 21:58 04/24/2019 21:58 1 An Unrecoverable System Error (NMI) has occurred (iLO application watchdog timeout NMI, Service Information: 0x0000002B, 0x00000000)
sys logs :
========================================
Apr 24 22:01:54 prs3-tir hp-ams[2834]: hpHelper Started . .
Apr 24 22:02:08 prs3-tir hpasmlited[2895]: hpDeferSPDThread: Starting thread to collect DIMM SPD Data.
Apr 24 22:02:08 prs3-tir hpasmlited[2895]: Initialize data structures succesful
Apr 24 22:02:13 prs3-tir hp-ams[2834]: CRITICAL: An Unrecoverable System Error (NMI) has occurred (iLO application watchdog timeout NMI, Service Information: 0x0000002B, 0x00000000)
Apr 24 22:02:14 prs3-tir hp-ams[2834]: CRITICAL: ASR Detected by System ROM
Apr 24 22:02:15 prs3-tir hpasrd[2922]: Starting with poll 1 and timeout 600
Apr 24 22:02:15 prs3-tir hpasrd[2922]: Setting the watchdog timer.
Apr 24 22:02:15 prs3-tir hpasrd[2922]: Found iLO memory at 0xf7df0000.
Apr 24 22:02:15 prs3-tir hpasrd[2922]: Successfully mapped device.
Apr 24 22:02:15 prs3-tir cmanicd: Entering iml_log_link_up(slot: 0, port: 1)
Apr 24 22:02:15 prs3-tir cmanicd: Entering get_event_id(slot: 0, port: 1
Apr 24 22:02:47 prs3-tir hpasmlited[2895]: hpDeferSPDThread: End of Collecting DIMM SPD data.
Apr 24 22:02:48 prs3-tir cmanicd: Existing event id(4) found for the slot and port.
Apr 24 22:02:48 prs3-tir cmanicd: Entering repair_iml_event(slot: 0, port: 1, event: 4)
Apr 24 22:02:48 prs3-tir cmanicd: Entering read_iml_event(slot: 0, port: 1, eventid: 4)
Apr 24 22:02:48 prs3-tir cmanicd: Calling ioctl() to read event id: 4)
Apr 24 22:02:48 prs3-tir cmanicd: Successfully read the event id: 4)
Apr 24 22:02:48 prs3-tir cmanicd: Trying to repair the existing IML Event.
Apr 24 22:02:48 prs3-tir cmanicd: Successfully repaired the IML Event.
Apr 24 22:02:48 prs3-tir cmanicd: Returning from repair_iml_event().
=======================================================
Any one has faced this issue and know how to resolve?
Thanks in Advacnce
Florant
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Email to a Friend
- Report Inappropriate Content
05-02-2019 06:41 AM
05-02-2019 06:41 AM
SolutionHi Florant,
This NMI seems to be know issue with RHEL
NMI An Unrecoverable System Error (NMI) has occurred (iLO application watchdog timeout NMI, Service Information: 0x0000002B, 0x00000000)
please see the below adviosry from REDHAT
https://access.redhat.com/solutions/1309033
IML log has the following entry:
An Unrecoverable System Error (NMI) has occurred (System error code 0x0000002B, 0x00000000)
Resolution
By default systemd starts a watchdog timer on shutdown. Disable ShutdownWatchdogSec to resolve this issue. To disable it, please open /etc/systemd/system.conf file and find following line:
#ShutdownWatchdogSec=10min
Change them to:
ShutdownWatchdogSec=0
Save the file and after that run:
# systemctl daemon-reexec
to allow systemd to know about the updated configuration or reboot the system.
NOTE: You may also wish to look at RuntimeWatchdogSec in the same file, it is disabled by default, please do not enable -it without specific reasons for doing so.
--------------------------------------------------------------------------------------------------------------------------------------
If still issue persist we recommand log a case with REDHAT.
If you need futher troubleshooting from Hardware side kindly log a case with HPE and share all the logs (AHS and SOS report)
Regards,
Sangam.
Hewlett Packard Enterprise International
- Communities
- HPE Blogs and Forum
© Copyright 2022 Hewlett Packard Enterprise Development LP