<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Node failure in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021207#M707735</link>
    <description>Hi,&lt;BR /&gt;&lt;BR /&gt;If you are running short of NICs better you configure the heartbeat on RS232. Refer to the SG documentation on how to set this up. Also a quick requirements for heartbeat could be found at,&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www.netsysco.com/pdf/Manuals/Sg/HeartbeatReq.pdf" target="_blank"&gt;http://www.netsysco.com/pdf/Manuals/Sg/HeartbeatReq.pdf&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;karthik S S</description>
    <pubDate>Fri, 11 Jul 2003 11:11:35 GMT</pubDate>
    <dc:creator>Karthik S S</dc:creator>
    <dc:date>2003-07-11T11:11:35Z</dc:date>
    <item>
      <title>Node failure</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021203#M707731</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;I installed a 2 nodes cluster with SG A.11.14 with one shared SCSI external drive (configured as lock device) and only one lan on each node (which is also the heartbeat lan).&lt;BR /&gt;&lt;BR /&gt;I try to test a failover but the things do not go as I expect.&lt;BR /&gt;&lt;BR /&gt;I have 2 cases:&lt;BR /&gt;&lt;BR /&gt;1) I get out the network cable of node 1&lt;BR /&gt;2) I power off node 1&lt;BR /&gt;&lt;BR /&gt;In both cases the second node reboots.&lt;BR /&gt;I expected that it will host all the resources previously on the node1.&lt;BR /&gt;&lt;BR /&gt;After reboot node2 can not even form the cluster, complaining that it can not get the OS version of node1. Shouldn't it go on running the cluster?&lt;BR /&gt;&lt;BR /&gt;Do I have to do a special configuration? &lt;BR /&gt;What could be not well configured?&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Cristi</description>
      <pubDate>Fri, 11 Jul 2003 09:42:11 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021203#M707731</guid>
      <dc:creator>Cristi BODNARIUC</dc:creator>
      <dc:date>2003-07-11T09:42:11Z</dc:date>
    </item>
    <item>
      <title>Re: Node failure</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021204#M707732</link>
      <description>Do you not have a standby lan for the heartbeat? if so, hen you may very well see the incorrect node TOC.&lt;BR /&gt;How are the nodes connected via lans, and how is the scsi connected? what are the disc and controller scsi addresses? what do the syslogs and OLDsyslogs show on each node?&lt;BR /&gt;Read the manuals at http:/docs.hp.com/hpux/ha for an idea on how to configure the cluster</description>
      <pubDate>Fri, 11 Jul 2003 09:46:16 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021204#M707732</guid>
      <dc:creator>melvyn burnard</dc:creator>
      <dc:date>2003-07-11T09:46:16Z</dc:date>
    </item>
    <item>
      <title>Re: Node failure</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021205#M707733</link>
      <description>Hello,&lt;BR /&gt;&lt;BR /&gt;that is why you should have a phyiscally separate HeartBeat-LAN.&lt;BR /&gt;&lt;BR /&gt;If you have no LAN communication between the nodes at all, they both run for the lock disk to decide which one has to TOC. The other one will carry on as a one node cluster. So chances are 50% the node you expected to TOC will TOC.....&lt;BR /&gt;&lt;BR /&gt;This is called arbitration. There is a lot of information about it in the manuals at docs.hp.com&lt;BR /&gt;&lt;BR /&gt;Regards&lt;BR /&gt;Bernhard</description>
      <pubDate>Fri, 11 Jul 2003 09:53:57 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021205#M707733</guid>
      <dc:creator>Bernhard Mueller</dc:creator>
      <dc:date>2003-07-11T09:53:57Z</dc:date>
    </item>
    <item>
      <title>Re: Node failure</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021206#M707734</link>
      <description>Hi, &lt;BR /&gt;&lt;BR /&gt;I thought that for the first case (when taking out the network cable from node1) the problem could come from the fact that there is no dedicated heartbeat way.&lt;BR /&gt;&lt;BR /&gt;But when switching the power off on node1 I think it is not a heartbeat problem anymore and the second node should not TOC.&lt;BR /&gt;&lt;BR /&gt;Maybe I am stil missing something. I will keep reading the manuals :)&lt;BR /&gt;&lt;BR /&gt;The 2 nodes are connected to the company network (both are conected to a switch).&lt;BR /&gt;The external disk has 2 ends, one connected to node1 and the other to node2. It is powered separately.&lt;BR /&gt;&lt;BR /&gt;After the reboot of node2 I can start the cluster with cmruncl -n node2 and the cluster runs well.&lt;BR /&gt;</description>
      <pubDate>Fri, 11 Jul 2003 10:51:02 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021206#M707734</guid>
      <dc:creator>Cristi BODNARIUC</dc:creator>
      <dc:date>2003-07-11T10:51:02Z</dc:date>
    </item>
    <item>
      <title>Re: Node failure</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021207#M707735</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;If you are running short of NICs better you configure the heartbeat on RS232. Refer to the SG documentation on how to set this up. Also a quick requirements for heartbeat could be found at,&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www.netsysco.com/pdf/Manuals/Sg/HeartbeatReq.pdf" target="_blank"&gt;http://www.netsysco.com/pdf/Manuals/Sg/HeartbeatReq.pdf&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;karthik S S</description>
      <pubDate>Fri, 11 Jul 2003 11:11:35 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021207#M707735</guid>
      <dc:creator>Karthik S S</dc:creator>
      <dc:date>2003-07-11T11:11:35Z</dc:date>
    </item>
    <item>
      <title>Re: Node failure</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021208#M707736</link>
      <description>Again, what do your syslog and OLDsyslog files tell you.&lt;BR /&gt;Is your cluster lock disc actually working? what type of disc is it?&lt;BR /&gt;And I would not recommend a serial heartbeat unless you cann really not afford at least another lan card.&lt;BR /&gt;</description>
      <pubDate>Fri, 11 Jul 2003 11:25:33 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021208#M707736</guid>
      <dc:creator>melvyn burnard</dc:creator>
      <dc:date>2003-07-11T11:25:33Z</dc:date>
    </item>
    <item>
      <title>Re: Node failure</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021209#M707737</link>
      <description>Cristi,&lt;BR /&gt;&lt;BR /&gt;I believe there could be a problem with the binary cmclconfig file, since your assumptions for case 2 are correct. So delete them and do another chcheckconf / cmapplyconf.&lt;BR /&gt;&lt;BR /&gt;One other thing to check is your .rhosts or cmclnodelist to include BOTH nodes on BOTH nodes. That could be the problem why node2 cannot form the cluster but a cmruncl -n node2 will work.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Bernhard</description>
      <pubDate>Fri, 11 Jul 2003 11:48:45 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021209#M707737</guid>
      <dc:creator>Bernhard Mueller</dc:creator>
      <dc:date>2003-07-11T11:48:45Z</dc:date>
    </item>
    <item>
      <title>Re: Node failure</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021210#M707738</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;Thank you all for your help.&lt;BR /&gt;&lt;BR /&gt;It seems that he problem was in the binary cluster config file which I did not compile/redistribute after I have changed the SCSI disk (with one with different SCSI ID).&lt;BR /&gt;&lt;BR /&gt;Now if I shutdown a node the other takes over all the packages.&lt;BR /&gt;&lt;BR /&gt;Thanks,&lt;BR /&gt;Cristi</description>
      <pubDate>Tue, 15 Jul 2003 15:54:01 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/node-failure/m-p/3021210#M707738</guid>
      <dc:creator>Cristi BODNARIUC</dc:creator>
      <dc:date>2003-07-15T15:54:01Z</dc:date>
    </item>
  </channel>
</rss>

