<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: lan failure simulation in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651941#M712092</link>
    <description>Christian&lt;BR /&gt;&lt;BR /&gt;I do not run service guard but it looks like a heartbeat setting or a patch related to heartbeat.&lt;BR /&gt;&lt;BR /&gt;Paula</description>
    <pubDate>Thu, 24 Jan 2002 11:59:43 GMT</pubDate>
    <dc:creator>Paula J Frazer-Campbell</dc:creator>
    <dc:date>2002-01-24T11:59:43Z</dc:date>
    <item>
      <title>lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651939#M712090</link>
      <description>Hi guys,&lt;BR /&gt;&lt;BR /&gt;I'm doing lan link tests and I got an unexpect result. When I disconected all cables from one&lt;BR /&gt;machine, the other, that has all cables conected rebooted and the one that has no cables got the lock and still active. the&lt;BR /&gt;command cmviewcl show that lan interfaces wasnt&lt;BR /&gt;down.&lt;BR /&gt;&lt;BR /&gt;Have someone pass throught this problem ? &lt;BR /&gt;Now, I'm just using core-io lan, HP-UX 11 and&lt;BR /&gt;MCSG 11.07&lt;BR /&gt;&lt;BR /&gt;thanks in advace!</description>
      <pubDate>Thu, 24 Jan 2002 11:44:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651939#M712090</guid>
      <dc:creator>christian_2</dc:creator>
      <dc:date>2002-01-24T11:44:27Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651940#M712091</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;Have you checked syslog.log for error messages from cmcld ?&lt;BR /&gt;&lt;BR /&gt;Hilary&lt;BR /&gt;</description>
      <pubDate>Thu, 24 Jan 2002 11:57:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651940#M712091</guid>
      <dc:creator>BFA6</dc:creator>
      <dc:date>2002-01-24T11:57:27Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651941#M712092</link>
      <description>Christian&lt;BR /&gt;&lt;BR /&gt;I do not run service guard but it looks like a heartbeat setting or a patch related to heartbeat.&lt;BR /&gt;&lt;BR /&gt;Paula</description>
      <pubDate>Thu, 24 Jan 2002 11:59:43 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651941#M712092</guid>
      <dc:creator>Paula J Frazer-Campbell</dc:creator>
      <dc:date>2002-01-24T11:59:43Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651942#M712093</link>
      <description>Christian,&lt;BR /&gt;&lt;BR /&gt;Another thought.&lt;BR /&gt;If node A is running package &amp;amp; has cluster lock disk, and you pull lan cables from A (both data &amp;amp; heartbeat), both machines will think the other has died &amp;amp; race for the cluster lock disk.  A already has it, B won't get the lock disk, so will TOC.&lt;BR /&gt;&lt;BR /&gt;Hilary&lt;BR /&gt;</description>
      <pubDate>Thu, 24 Jan 2002 12:09:40 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651942#M712093</guid>
      <dc:creator>BFA6</dc:creator>
      <dc:date>2002-01-24T12:09:40Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651943#M712094</link>
      <description>Hi, this sounds like a patch problem. Be sure you have installed last patches for SG and LAN, etc..&lt;BR /&gt;&lt;BR /&gt;David.</description>
      <pubDate>Thu, 24 Jan 2002 12:13:10 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651943#M712094</guid>
      <dc:creator>David Navarro</dc:creator>
      <dc:date>2002-01-24T12:13:10Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651944#M712095</link>
      <description>Hi, again, I have read Hilary answer,and I agree with their opinion. If you disconnect all cables, communications betwen nodes are lossed, then cluster must be reformed, then first machine that get cluster lock. will form the new cluster.&lt;BR /&gt;&lt;BR /&gt;A good test, can be disconnect just one network cable, look if traffic and addresses are conmuted to the other one. Then disconnect this other, I think package will be transferred to alternate node in this situation.&lt;BR /&gt;&lt;BR /&gt;David.</description>
      <pubDate>Thu, 24 Jan 2002 12:20:50 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651944#M712095</guid>
      <dc:creator>David Navarro</dc:creator>
      <dc:date>2002-01-24T12:20:50Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651945#M712096</link>
      <description>thats normal.. service guard check this..&lt;BR /&gt;&lt;BR /&gt;rather than ifconfig lan down &lt;BR /&gt;use lanadmin to reset the lan.&lt;BR /&gt;That should simulate it.&lt;BR /&gt;&lt;BR /&gt;Or pull cable or wet the card!&lt;BR /&gt;I prefer lanadmin!&lt;BR /&gt;&lt;BR /&gt;Later,&lt;BR /&gt;Bill</description>
      <pubDate>Thu, 24 Jan 2002 12:38:07 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651945#M712096</guid>
      <dc:creator>Bill McNAMARA_1</dc:creator>
      <dc:date>2002-01-24T12:38:07Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651946#M712098</link>
      <description>What you are seeing is normal behaviour, based on the fact that you have Multiple Points of failure.&lt;BR /&gt;As previously said, if the node that stayed up already had access to the cluster lock  disc, because ALL communiactions were lost, that node managed to grab the cluster lock diosc BEFORE the other node, and hence stayed up, forcing the other node to TOC.&lt;BR /&gt;&lt;BR /&gt;You should definitely check you ar patched to the lates possible level, bearing in mind that SG patches are NOT included in the Patch Bundle CD's!</description>
      <pubDate>Thu, 24 Jan 2002 12:56:43 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651946#M712098</guid>
      <dc:creator>melvyn burnard</dc:creator>
      <dc:date>2002-01-24T12:56:43Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651947#M712101</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;Take a look at this thread from the SG FAQ, which tries to explain the scenario you are having,&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://docs.hp.com/hpux/onlinedocs/ha/haFAQindex2.html#All%20Networks%20fail,%20which%20node%20wins?" target="_blank"&gt;http://docs.hp.com/hpux/onlinedocs/ha/haFAQindex2.html#All%20Networks%20fail,%20which%20node%20wins?&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Here is the FAQ,&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://docs.hp.com/hpux/onlinedocs/ha/haFAQindex2.html" target="_blank"&gt;http://docs.hp.com/hpux/onlinedocs/ha/haFAQindex2.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Hope this helps.&lt;BR /&gt;&lt;BR /&gt;Regds&lt;BR /&gt;</description>
      <pubDate>Thu, 24 Jan 2002 13:55:02 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651947#M712101</guid>
      <dc:creator>Sanjay_6</dc:creator>
      <dc:date>2002-01-24T13:55:02Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651948#M712103</link>
      <description>thank you guys, but this&lt;BR /&gt;is not an correct behave.&lt;BR /&gt;there was a GOOD node on&lt;BR /&gt;cluster and this should&lt;BR /&gt;be up, get the lock and&lt;BR /&gt;take the ownership of&lt;BR /&gt;all packages running on node&lt;BR /&gt;that has all cables disconected. this one, that&lt;BR /&gt;have cables disconect should not get the lock because his&lt;BR /&gt;networks were down. if someone&lt;BR /&gt;have tested this on both nodes, please notify me&lt;BR /&gt;&lt;BR /&gt;thanks again</description>
      <pubDate>Sat, 26 Jan 2002 14:02:31 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651948#M712103</guid>
      <dc:creator>christian_2</dc:creator>
      <dc:date>2002-01-26T14:02:31Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651949#M712105</link>
      <description>No this *is* good behaviour when you consider what ServiceGuard is designed to do:&lt;BR /&gt;&lt;BR /&gt;- Protect against single points of failure&lt;BR /&gt;- Under no circumstances corrupt your data&lt;BR /&gt;&lt;BR /&gt;What you have simulated is a situation with multiple points of failure, consider what could be going on in this scenario:&lt;BR /&gt;&lt;BR /&gt;Both nodes cannot talk to each other except via cluster lock - what is very clearly a network failure on one machine to you does not necessarily appear to be from the other machine - consider that from the point of view of the other machine it can still see the network but doesn't know what state the other node is - it could be that the multiple point of failures is networking equipment between the two nodes - now both nodes still have active networks (and presumably some clients can connect to each) but they can't talk to each other. If the nodes behaved as you have suggested they should then they would BOTH beleive they are the good node, and both try to run your package - not good for the integrity of your data.&lt;BR /&gt;&lt;BR /&gt;This is an immutable rule of ServiceGuard in a two node cluster - when either node doesn't know what the other is doing, it's cluster lock time, and it won't always be the genuinely good node that wins.&lt;BR /&gt;&lt;BR /&gt;HTH&lt;BR /&gt;&lt;BR /&gt;Duncan&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Sat, 26 Jan 2002 22:05:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651949#M712105</guid>
      <dc:creator>Duncan Edmonstone</dc:creator>
      <dc:date>2002-01-26T22:05:52Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651950#M712107</link>
      <description>thanks duncan, but&lt;BR /&gt;I still belive that is not&lt;BR /&gt;correct. If the node has&lt;BR /&gt;all networks down, it should&lt;BR /&gt;not stay running and the &lt;BR /&gt;other that has all cables ok&lt;BR /&gt;call the TOC. Imagine this&lt;BR /&gt;situation in a production &lt;BR /&gt;environmet. An node, that&lt;BR /&gt;is all right with it, call&lt;BR /&gt;the TOC unexpect. I agree&lt;BR /&gt;with this situation if BOTH&lt;BR /&gt;have at least one network up.&lt;BR /&gt;I will open an case in HP call&lt;BR /&gt;center. Thank you all</description>
      <pubDate>Mon, 28 Jan 2002 12:18:25 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651950#M712107</guid>
      <dc:creator>christian_2</dc:creator>
      <dc:date>2002-01-28T12:18:25Z</dc:date>
    </item>
    <item>
      <title>Re: lan failure simulation</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651951#M712109</link>
      <description>You are incorrect, in that this is normal SG behaviour. The fact is you are talking about having MULTIPLE points of failure, not a Single one.&lt;BR /&gt;&lt;BR /&gt;If you wish to try to prevent this, then you could consider setting up a serial heartbeat link, as this is designed to do exactly what you are looking for, i.e. ensure the node with the good network connections stays up.&lt;BR /&gt;There are some not good things about having a serial heartbeat as well.&lt;BR /&gt;The bottom line is that you are pullling ALL network connections, and if your network redundancy design is such that this could happen, I would redesign the networking area.&lt;BR /&gt;&lt;BR /&gt;Opening a call with HP should give you the same answer.&lt;BR /&gt;</description>
      <pubDate>Mon, 28 Jan 2002 12:52:36 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/lan-failure-simulation/m-p/2651951#M712109</guid>
      <dc:creator>melvyn burnard</dc:creator>
      <dc:date>2002-01-28T12:52:36Z</dc:date>
    </item>
  </channel>
</rss>

