<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Fence Hang when detach power supply, Urgent in Operating System - Linux</title>
    <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381646#M35646</link>
    <description>maxi,&lt;BR /&gt;&lt;BR /&gt;Yes, that is what I expect. If you're removing multiple power sources, you're simulating a larger failure than it can handle. Most clusters prevent against a single point of failure. You're introducing at least 2 points of failure (double power failure or supply failure.)&lt;BR /&gt;&lt;BR /&gt;You're powering off the server as well as the iLO, which it needs to fence.&lt;BR /&gt;&lt;BR /&gt;If you want to simulate something more likely, like a kernel panic, bringing the node down, I'd suggest actually crashing the kernel, such as using the Alt-SysRq trigger. Then the node will be properly fenced and activity should continue if it's configured correctly.</description>
    <pubDate>Thu, 19 Mar 2009 19:51:02 GMT</pubDate>
    <dc:creator>macosta</dc:creator>
    <dc:date>2009-03-19T19:51:02Z</dc:date>
    <item>
      <title>Fence Hang when detach power supply, Urgent</title>
      <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381641#M35641</link>
      <description>Hi boys!&lt;BR /&gt;&lt;BR /&gt;I've a big problem.&lt;BR /&gt;I've installed RH enterprise 5.2 in cluster modality on 2 hp dl380 server, and I've decide to use the fence ilo to fence server.&lt;BR /&gt;&lt;BR /&gt;This solution work very well if I detach ethernet cable, or manual "switch" (clusvcadm) services from a server to another and so on.&lt;BR /&gt;&lt;BR /&gt;But if I detach the 2 cable from the power supply from one server (that go down :-)), the up server start to try to fencing the server that is down and don't understand that the other is down!!!&lt;BR /&gt;&lt;BR /&gt;So the fencing does not work (is normal the other on hasn't no electricity cable on), and so the up server wait wait and wait... and the services that was on the down server don't "switch" because the cluster wait for fencing.&lt;BR /&gt;&lt;BR /&gt;On the up server log I get:&lt;BR /&gt;&lt;BR /&gt;mhs2 fenced[4137]: fence "mhs1.local" failed&lt;BR /&gt;mhs2 fenced[4137]: fencing node "mhs1.local"&lt;BR /&gt;&lt;BR /&gt;and other line that I've attached on "error.txt" attach.&lt;BR /&gt;&lt;BR /&gt;Only when I restart the down server, the cluster restart to works correctly.&lt;BR /&gt;&lt;BR /&gt;Please, HELP ME!!!!!!&lt;BR /&gt;&lt;BR /&gt;Bye&lt;BR /&gt;mb from Italy&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 17 Mar 2009 20:45:44 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381641#M35641</guid>
      <dc:creator>maxi_1</dc:creator>
      <dc:date>2009-03-17T20:45:44Z</dc:date>
    </item>
    <item>
      <title>Re: Fence Hang when detach power supply, Urgent</title>
      <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381642#M35642</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;I don't know much about RH cluster, but reading your post I'd suggest you open a bug report at Redhat, as the behaviour you describe looks like a buggy feature.&lt;BR /&gt;&lt;BR /&gt;Is there any "out of band" heartbeat in RH cluster that you may be lacking which may cause the fencing to hang ?&lt;BR /&gt;</description>
      <pubDate>Tue, 17 Mar 2009 21:37:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381642#M35642</guid>
      <dc:creator>Brem Belguebli</dc:creator>
      <dc:date>2009-03-17T21:37:32Z</dc:date>
    </item>
    <item>
      <title>Re: Fence Hang when detach power supply, Urgent</title>
      <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381643#M35643</link>
      <description>If you remove all server power (unplug both power supplies, assuming you have redundant power,) you are confronting the cluster with a multiple-failure scenario. &lt;BR /&gt;&lt;BR /&gt;If the server chassis has no power, the iLO also has no power. The remaining cluster nodes have no way of knowing that the server is powered off, since communication has been severed. The cluster only knows that it cannot reach the node via the network.&lt;BR /&gt;&lt;BR /&gt;If you shut the node down cleanly, the server should leave the cluster in a clean state and not need to be fenced.&lt;BR /&gt;&lt;BR /&gt;I am not as familiar with RH Cluster Suite, but Polyserve works the same way.</description>
      <pubDate>Tue, 17 Mar 2009 22:21:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381643#M35643</guid>
      <dc:creator>macosta</dc:creator>
      <dc:date>2009-03-17T22:21:32Z</dc:date>
    </item>
    <item>
      <title>Re: Fence Hang when detach power supply, Urgent</title>
      <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381644#M35644</link>
      <description>But are you sure that is a normal functionality of a cluster?&lt;BR /&gt;&lt;BR /&gt;So if a Server go down in a 2 component cluster, the other one can't operate a "switch" of the services of the down server?&lt;BR /&gt;&lt;BR /&gt;Is no good for us....&lt;BR /&gt;&lt;BR /&gt;I attach my cluster.conf</description>
      <pubDate>Wed, 18 Mar 2009 06:59:33 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381644#M35644</guid>
      <dc:creator>maxi_1</dc:creator>
      <dc:date>2009-03-18T06:59:33Z</dc:date>
    </item>
    <item>
      <title>Re: Fence Hang when detach power supply, Urgent</title>
      <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381645#M35645</link>
      <description>Another thing you should have in a 2 nodes cluster is a tie-breaker.&lt;BR /&gt;&lt;BR /&gt;I think RHCS uses a lock lun which is used as  a quorum in case one of the node fails.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 19 Mar 2009 19:42:43 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381645#M35645</guid>
      <dc:creator>Brem Belguebli</dc:creator>
      <dc:date>2009-03-19T19:42:43Z</dc:date>
    </item>
    <item>
      <title>Re: Fence Hang when detach power supply, Urgent</title>
      <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381646#M35646</link>
      <description>maxi,&lt;BR /&gt;&lt;BR /&gt;Yes, that is what I expect. If you're removing multiple power sources, you're simulating a larger failure than it can handle. Most clusters prevent against a single point of failure. You're introducing at least 2 points of failure (double power failure or supply failure.)&lt;BR /&gt;&lt;BR /&gt;You're powering off the server as well as the iLO, which it needs to fence.&lt;BR /&gt;&lt;BR /&gt;If you want to simulate something more likely, like a kernel panic, bringing the node down, I'd suggest actually crashing the kernel, such as using the Alt-SysRq trigger. Then the node will be properly fenced and activity should continue if it's configured correctly.</description>
      <pubDate>Thu, 19 Mar 2009 19:51:02 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381646#M35646</guid>
      <dc:creator>macosta</dc:creator>
      <dc:date>2009-03-19T19:51:02Z</dc:date>
    </item>
    <item>
      <title>Re: Fence Hang when detach power supply, Urgent</title>
      <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381647#M35647</link>
      <description>&lt;BR /&gt;Agree with previous posters.  &lt;BR /&gt;&lt;BR /&gt;Lets look at this.  Polyserve fences a node if there is a network outage where the nodes in the cluster cannot communicate to one of the other nodes.  Until it fences the node that is not talking the rest of the cluster does not want to generate i/o to the shared cluster file system since they do not know that the node that is not responding is down and not writing to the disk too which will corrupt data.&lt;BR /&gt;&lt;BR /&gt;So if a node stops talking to the cluster we fence them to make sure they are down and not accessing the disks.&lt;BR /&gt;&lt;BR /&gt;When you killed the power from the cabinet you stopped the node from commuicating with the rest of the cluster plus you prevented the rest of the cluster from fencing the node.&lt;BR /&gt;&lt;BR /&gt;Server fencing is to shut down the node if the OS fails but the ilo works.&lt;BR /&gt;&lt;BR /&gt;There is a command you can execute &lt;BR /&gt;&lt;BR /&gt;mx server markdown &lt;HOSTNAME&gt; &lt;BR /&gt;&lt;BR /&gt;on one of the nodes to say the node was fenced but this should only be done if the node really is down for another reason.&lt;BR /&gt;&lt;BR /&gt;You should make sure the cabinet has multiple sources of power and does not loose all power.&lt;BR /&gt;&lt;BR /&gt;&lt;/HOSTNAME&gt;</description>
      <pubDate>Thu, 02 Apr 2009 12:50:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381647#M35647</guid>
      <dc:creator>Emil Velez</dc:creator>
      <dc:date>2009-04-02T12:50:58Z</dc:date>
    </item>
    <item>
      <title>Re: Fence Hang when detach power supply, Urgent</title>
      <link>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381648#M35648</link>
      <description>Shalom,&lt;BR /&gt;&lt;BR /&gt;I think this may be a flaw in the fence software.&lt;BR /&gt;&lt;BR /&gt;You may wish to test this with fence from the recently released RHEL 5.3. Other rpms may also be required.&lt;BR /&gt;&lt;BR /&gt;SEP</description>
      <pubDate>Thu, 02 Apr 2009 13:59:22 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/fence-hang-when-detach-power-supply-urgent/m-p/4381648#M35648</guid>
      <dc:creator>Steven E. Protter</dc:creator>
      <dc:date>2009-04-02T13:59:22Z</dc:date>
    </item>
  </channel>
</rss>

