<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Disabling node TOC? in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389409#M538014</link>
    <description>It is indeed a rare event, but when it happens (e.g. during a maintenance event) it can be catastrophic.  It seems like in the situation where it would normally reboot (all HB links down and no cluster lock) it could just shut *itself* down, but that doesn't seem to be an option...</description>
    <pubDate>Fri, 27 Mar 2009 13:56:37 GMT</pubDate>
    <dc:creator>Sarah Nordstrom</dc:creator>
    <dc:date>2009-03-27T13:56:37Z</dc:date>
    <item>
      <title>Disabling node TOC?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389407#M538012</link>
      <description>Is there any (supported) way to completely disable Serviceguard rebooting a node entirely?  We have a number of machines that run more than one service (some not using MC/SG), and when SG reboots a node, all the other services on the node are also killed.</description>
      <pubDate>Fri, 27 Mar 2009 13:12:24 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389407#M538012</guid>
      <dc:creator>Sarah Nordstrom</dc:creator>
      <dc:date>2009-03-27T13:12:24Z</dc:date>
    </item>
    <item>
      <title>Re: Disabling node TOC?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389408#M538013</link>
      <description>If your packages have the "service_fail_fast_enabled" flag set, this forces the node to reboot whenever a failure is detected. That would be a case of "you asked for it, you got it."&lt;BR /&gt;&lt;BR /&gt;If you don't use the "service_fail_fast_enabled" setting, the only reason for SG to trigger a reboot is if all the heartbeat connections are lost and the node failed to get the cluster lock. If this happens often, you should work on improving the reliability and/or redundancy of your heartbeat connections. &lt;BR /&gt;&lt;BR /&gt;In a robust cluster, a SG-triggered reboot should be a very rare event - and when it happens, the rebooting node will usually be isolated by network faults so it could not provide any services to anyone at that time anyway.&lt;BR /&gt;&lt;BR /&gt;Avoiding the reboots completely would require developing a new strategy for avoiding the split-brain situations. This strategy must be absolutely bulletproof: a wrong decision here means either not providing a service when you could, or corrupting your package disks.&lt;BR /&gt;&lt;BR /&gt;MK</description>
      <pubDate>Fri, 27 Mar 2009 13:52:12 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389408#M538013</guid>
      <dc:creator>Matti_Kurkela</dc:creator>
      <dc:date>2009-03-27T13:52:12Z</dc:date>
    </item>
    <item>
      <title>Re: Disabling node TOC?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389409#M538014</link>
      <description>It is indeed a rare event, but when it happens (e.g. during a maintenance event) it can be catastrophic.  It seems like in the situation where it would normally reboot (all HB links down and no cluster lock) it could just shut *itself* down, but that doesn't seem to be an option...</description>
      <pubDate>Fri, 27 Mar 2009 13:56:37 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389409#M538014</guid>
      <dc:creator>Sarah Nordstrom</dc:creator>
      <dc:date>2009-03-27T13:56:37Z</dc:date>
    </item>
    <item>
      <title>Re: Disabling node TOC?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389410#M538015</link>
      <description>The solution in that case is to tell ServiceGuard in advance that a node is going to go away, using the "cmhaltnode" command. &lt;BR /&gt;&lt;BR /&gt;It causes all running packages on that node to shut down and failover to other nodes (if enabled and possible), *and* it prepares the rest of the cluster to accept that the halting node may go off-line after the halt operation is completed. &lt;BR /&gt;&lt;BR /&gt;The "cmhaltnode" command is *not* in any way similar to the "shutdown" command: the OS of the node and any non-packaged services on the node will keep running, only ServiceGuard operations on the node are shut down.&lt;BR /&gt;&lt;BR /&gt;If you have a two-node cluster and you use cmhaltnode to stop one node, check the syslog of the other node: you will see a message about shutting down the "safety time protection" (= the "deadman" panic reboot module) as the cluster transitions into single-node operation. After that, it's safe to do whatever you want with the halted node.&lt;BR /&gt;&lt;BR /&gt;If this is not applicable to your situation, please explain in more detail. If we can find out exactly why the reboot is triggered in your situation, perhaps we can find a way to avoid it.&lt;BR /&gt;&lt;BR /&gt;MK</description>
      <pubDate>Sun, 29 Mar 2009 16:42:46 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/disabling-node-toc/m-p/4389410#M538015</guid>
      <dc:creator>Matti_Kurkela</dc:creator>
      <dc:date>2009-03-29T16:42:46Z</dc:date>
    </item>
  </channel>
</rss>

