- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Re: critical temperature warning difference ia64 a...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-26-2006 09:27 PM
тАО11-26-2006 09:27 PM
I used to work with HP-UX PA-RISC systems and in case of an airco problem I nicely saw an ' OVERTEMP_CRIT WARNING ' message appearing in the syslog.
Now, I'm mostly dealing with HP-UX Itanium systems and in the scenario above the syslog only shows this message :
EMS [4471]: EMS Event Notification
Value: "MAJORWARNING (3)" for Resource: "/system/events/ia64_corehw/core_hw" (Threshold: >= " 3")
event details: /opt/resmon/bin/resdata -R 293011458 -r /system/events/ia64_corehw/core_hw -n 293011457 -a
I now know that I can retrieve more from the ELM message :
>-- Event Monitoring Service Event Notification --<
/system/events/ia64_corehw/core_hw is >= 3.
Its current value is MAJORWARNING(3).
Event data from monitor:
Event Time..........: Thu Nov 23 18:34:19 2006
Severity............: MAJORWARNING
Monitor.............: ia64_corehw
Event #.............: 101011
System..............: myHost.mydomain
Summary:
System Temperature is at non-recoverable level.
Is there a way to have those 'old' clear messages back into the syslog of an HP-UX Itanium ?
Regards
Franky
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-26-2006 10:11 PM
тАО11-26-2006 10:11 PM
Re: critical temperature warning difference ia64 and PA-RISC ?
in case that the temperature of schemes arrives at one definitive temperature, the serving anger to enter in way halt
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-28-2006 02:34 AM
тАО11-28-2006 02:34 AM
SolutionOn PA systems, you will get an EMS event 33 from dm_core_hw when the first temperature threshold (Low) is reached, and you will get the OVERTEMP_CRIT message from envd in syslog at around the same time.
You then get an EMS event 34 when the next threshold (Mid) is reached, and envd should initiate a shutdown (according to the configuration in /etc/envd.conf).
If the last level (Hi) is reached, because the system is still running for some reason, it will be powered off.
On IPF systems, things have changed. At the low threshold, you should still be seeing OVERTEMP_CRIT in syslog. You will also get an EMS event, from either fpl_em or ia64_corehw (it depends on whether the system is cell-based or not). At the Mid overtemp threshold, the system will be shutdown by the Firmware, not by envd, and no events or messages get logged. This is a 'soft' shutdown. If the High threshold is reached, again it's a hard power off.
What system type do you have, and what version of the OnlineDiags (STM)? There were some problems where envd was not getting notified of the Low threshold (OVERTEMP_CRIT) on non-cellular systems, but should be fixed in current versions of the OnlineDiags.
Andrew
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-28-2006 03:20 AM
тАО11-28-2006 03:20 AM
Re: critical temperature warning difference ia64 and PA-RISC ?
Even better, always have at least N + 1 cooling capacity so that you can tolerate the failure of any 1 unit without problems. The problem with relying upon a warning scheme is getting someone to actually shutdown the equipment in a timely manner. The computer may shutdown itself but what about other devices such as disk and tape drives which will continue to run even if the temperature excursion is extreme.
... and even better still is to have an auxiliary trip coil equipped main breaker connected to a thermal switch that will disconnect all power should a preset value be exceeded. This keeps you out of trouble should more than 1 of your HVAC units fail.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-28-2006 03:39 AM
тАО11-28-2006 03:39 AM
Re: critical temperature warning difference ia64 and PA-RISC ?
The question was about the apparent differences in warnings for overtemp conditions. It was NOT about different temperatures at which the warnings appear.
Andrew
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-28-2006 03:45 AM
тАО11-28-2006 03:45 AM
Re: critical temperature warning difference ia64 and PA-RISC ?
Thanks for your answers all, Clay too, even that was not the question.
Franky
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-28-2006 03:50 AM
тАО11-28-2006 03:50 AM
Re: critical temperature warning difference ia64 and PA-RISC ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-28-2006 03:53 AM
тАО11-28-2006 03:53 AM
Re: critical temperature warning difference ia64 and PA-RISC ?
See you..
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-29-2006 08:57 PM
тАО11-29-2006 08:57 PM
Re: critical temperature warning difference ia64 and PA-RISC ?
Thank for the inputs.
I agree that in most case, logging temperature errors, makes no sense because the time to do an intervention is too small.
However, these systems are located on the other side of the planet and the warnings gives us a good idea to find out what was going on.
Franky
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-29-2006 09:04 PM
тАО11-29-2006 09:04 PM
Re: critical temperature warning difference ia64 and PA-RISC ?
The STM version we run is :
Support Tools Manager, Version C.46.05, Product Number B4708AA
Franky