<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Problem with Cluster in Operating System - OpenVMS</title>
    <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881647#M79488</link>
    <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;Locks, pool, and various quotas come to mind.&lt;BR /&gt;&lt;BR /&gt;Getting a fairly comprehensive T4 output would be helpful.&lt;BR /&gt;&lt;BR /&gt;I would also consider if the problem gives warnings in the hour or so before the freeze actually happens. I would also check if somebody is doing some automated process at or about the time of the freeze. I would also hook up one or more network sniffers to the applicabale network connections to monitor traffic to/from the node (Wireshark, the successor to Ethereal, is available as a free download, so having multiple monitors should not be a problem)_.&lt;BR /&gt;&lt;BR /&gt;- Bob Gezelter, &lt;A href="http://www.rlgsc.com" target="_blank"&gt;http://www.rlgsc.com&lt;/A&gt;</description>
    <pubDate>Tue, 17 Oct 2006 10:58:47 GMT</pubDate>
    <dc:creator>Robert Gezelter</dc:creator>
    <dc:date>2006-10-17T10:58:47Z</dc:date>
    <item>
      <title>Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881643#M79484</link>
      <description>Hello&lt;BR /&gt;Configuration: 2 node cluster (ES47, ES45), OS version is 8.2, 2X storage MA8000, gigabit eth cluster interconnect&lt;BR /&gt;First node (ES47: 4 CPU 24 GB RAM) is running 13 Oracle instances.&lt;BR /&gt;Problem is that this machine hangs about 9:00 AM whem workload reaches top. Second node is currently diong nothing (some databases were not created yet) and is working fine.&lt;BR /&gt;When this node is rebooted, it is working fine until next day 9:00 AM (just four instances are active 24 hours a day).&lt;BR /&gt;Before creating cluster this machine worked fine as standalone. It even worked OK one day as single node cluster (before adding second node to cluster). As single node, it worked with 10 HSG disks, now is working with 28 HSG disks&lt;BR /&gt;No crash dump file, nothing in operator.log.&lt;BR /&gt;From where to start dubuging?&lt;BR /&gt;What system parameters should checked?</description>
      <pubDate>Tue, 17 Oct 2006 10:19:46 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881643#M79484</guid>
      <dc:creator>Vladimir Fabecic</dc:creator>
      <dc:date>2006-10-17T10:19:46Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881644#M79485</link>
      <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;My first question is: A total freeze, or Oracle and the applications freeze (and you retain access from terminal windows)?&lt;BR /&gt;&lt;BR /&gt;I would recommend running T4 (see the OpenVMS www site) and collecting and analyzing the resulting data. You could be running out of something, but there are many possibilities.&lt;BR /&gt;&lt;BR /&gt;Also, I would consider if I can force a crash dump manually. Analysis of the dump file should show what is hung on what (presuming that it is an OpenVMS problem and not a problem within the application or Oracle).&lt;BR /&gt;&lt;BR /&gt;- Bob Gezelter, &lt;A href="http://www.rlgsc.com" target="_blank"&gt;http://www.rlgsc.com&lt;/A&gt;</description>
      <pubDate>Tue, 17 Oct 2006 10:24:49 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881644#M79485</guid>
      <dc:creator>Robert Gezelter</dc:creator>
      <dc:date>2006-10-17T10:24:49Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881645#M79486</link>
      <description>Hello Bob&lt;BR /&gt;Thants for fast reply. It is not total freeze. It even opens new terminal in X but does not give $ prompt. Looks like it is running out of something. Since it is production envirement, I must react fast. I will do some monitoring tomorow. I know there are many possibilities, but what would be your first guess?</description>
      <pubDate>Tue, 17 Oct 2006 10:51:59 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881645#M79486</guid>
      <dc:creator>Vladimir Fabecic</dc:creator>
      <dc:date>2006-10-17T10:51:59Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881646#M79487</link>
      <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;same questions as Bob: what is 'hanging' ?&lt;BR /&gt;&lt;BR /&gt;- can you still do a PING node ?&lt;BR /&gt;- can you login via TELNET, LAT, DECnet ?&lt;BR /&gt;- how do you 'reboot' that node (just hitting restart-switch) ?&lt;BR /&gt;- what does a SHO SYS/NODE=xxx show if issued from the other node when the first one is 'hung' ? Any processes in RW* state ?&lt;BR /&gt;&lt;BR /&gt;Volker.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 17 Oct 2006 10:56:20 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881646#M79487</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2006-10-17T10:56:20Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881647#M79488</link>
      <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;Locks, pool, and various quotas come to mind.&lt;BR /&gt;&lt;BR /&gt;Getting a fairly comprehensive T4 output would be helpful.&lt;BR /&gt;&lt;BR /&gt;I would also consider if the problem gives warnings in the hour or so before the freeze actually happens. I would also check if somebody is doing some automated process at or about the time of the freeze. I would also hook up one or more network sniffers to the applicabale network connections to monitor traffic to/from the node (Wireshark, the successor to Ethereal, is available as a free download, so having multiple monitors should not be a problem)_.&lt;BR /&gt;&lt;BR /&gt;- Bob Gezelter, &lt;A href="http://www.rlgsc.com" target="_blank"&gt;http://www.rlgsc.com&lt;/A&gt;</description>
      <pubDate>Tue, 17 Oct 2006 10:58:47 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881647#M79488</guid>
      <dc:creator>Robert Gezelter</dc:creator>
      <dc:date>2006-10-17T10:58:47Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881648#M79489</link>
      <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;if there is nothing in OPERATOR.LOG, please also watch the console terminal for any messages (Mount-verification ?)&lt;BR /&gt;&lt;BR /&gt;If you don't get a $ prompt, you're likely to hit the RESTART button to reboot your system. Try HALT button and &amp;gt;&amp;gt;&amp;gt; CRASH instead. It will take some time to write the dump, but that will probably be the only way to find out what's wrong.&lt;BR /&gt;&lt;BR /&gt;Try logging in using Username: xxx/NOCOMMAND to skip your login-procedures, they may hang due to some problem.&lt;BR /&gt;&lt;BR /&gt;Try to keep a terminal logged in before 09:00 AM to be able to look around once the problem hits.&lt;BR /&gt;&lt;BR /&gt;Volker.</description>
      <pubDate>Tue, 17 Oct 2006 11:01:56 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881648#M79489</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2006-10-17T11:01:56Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881649#M79490</link>
      <description>&lt;BR /&gt;Are there any console messages displayed?  Assuming a quorum disk, is there any production I/O on the quorum disk?  &lt;BR /&gt;&lt;BR /&gt;You state gigabit ethernet cluster interconnect, is this dedicated to cluster traffic or does it share application traffic and cluster traffic?  &lt;BR /&gt;&lt;BR /&gt;There was a similiar behavior with 7.3-2 and  TCPIP 5.4 corrected in ECO4 if I recall correctly.  Are you current with TCPIP ECOs?  This may or may not be an issue in TCPIP 5.5 included with VMS 8.2.  &lt;BR /&gt;&lt;BR /&gt;Andy</description>
      <pubDate>Tue, 17 Oct 2006 11:11:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881649#M79490</guid>
      <dc:creator>Andy Bustamante</dc:creator>
      <dc:date>2006-10-17T11:11:58Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881650#M79491</link>
      <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;Also consider opening a SYSMAN session on the other cluster node, with a SET ENVIRONMENT to the node that is failing.&lt;BR /&gt;&lt;BR /&gt;I have seen situations where terminal sessions were useless, but the SYSMAN session remained usable.&lt;BR /&gt;&lt;BR /&gt;- Bob Gezelter, &lt;A href="http://www.rlgsc.com" target="_blank"&gt;http://www.rlgsc.com&lt;/A&gt;</description>
      <pubDate>Tue, 17 Oct 2006 11:12:41 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881650#M79491</guid>
      <dc:creator>Robert Gezelter</dc:creator>
      <dc:date>2006-10-17T11:12:41Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881651#M79492</link>
      <description>PING can be done. Telnet gives timeout. 'Reboot' is done with restart switch. Console terminal was not turned on so no messages.&lt;BR /&gt;I got these informations from customer.&lt;BR /&gt;&lt;BR /&gt;Gigabit ethernet cluster interconnect is dedicated to cluster traffic. There is no production I/O on the quorum disk. All newest patches are installed including TCPIP 5.5 ECO1.&lt;BR /&gt;Tomorow I will do some monitoring as suggested by Valker and Bob. &lt;BR /&gt;If no other way I will force crash the day after.&lt;BR /&gt;Terminal will be connected to Reflection session so everithing will be logged.&lt;BR /&gt;I do not think it is Oracle problem because nothing has been changed in Oracle software.&lt;BR /&gt;Looks like a parameter (or quota) problem to me, but I will have much more informations tomorow.&lt;BR /&gt;Guys, thanks a lot for helping me.</description>
      <pubDate>Tue, 17 Oct 2006 11:45:25 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881651#M79492</guid>
      <dc:creator>Vladimir Fabecic</dc:creator>
      <dc:date>2006-10-17T11:45:25Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881652#M79493</link>
      <description>From the node still working, do a &lt;BR /&gt;sh sys/node=other, to see if you have many process in "interesting" states (rwxxx, mutex...).&lt;BR /&gt;&lt;BR /&gt;As said before, try a &lt;BR /&gt;mc sysman set env/node=other&lt;BR /&gt;do any command&lt;BR /&gt;&lt;BR /&gt;If a login fails after the username, this can mean pagedyn is too low.&lt;BR /&gt;&lt;BR /&gt;Take a crash, you will have something to analyse&lt;BR /&gt;&lt;BR /&gt;The best advice: install Amds or Availability Manager, you will have all the good data available to know what is going wrong.</description>
      <pubDate>Tue, 17 Oct 2006 11:52:03 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881652#M79493</guid>
      <dc:creator>labadie_1</dc:creator>
      <dc:date>2006-10-17T11:52:03Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881653#M79494</link>
      <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;if PING works, but TELNET gives a timeout, could it be a process creation/scheduling problem ? A high PRIO looping job preventing any other processes to receive any CPU time ?&lt;BR /&gt;&lt;BR /&gt;Volker.</description>
      <pubDate>Tue, 17 Oct 2006 12:28:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881653#M79494</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2006-10-17T12:28:27Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881654#M79495</link>
      <description>after the reboot, check in the (previous) operator.log messages such as&lt;BR /&gt;pagefrag&lt;BR /&gt;pagecrit&lt;BR /&gt;noslot&lt;BR /&gt;no pcb available&lt;BR /&gt;&lt;BR /&gt;Good hunt&lt;BR /&gt;</description>
      <pubDate>Tue, 17 Oct 2006 14:26:20 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881654#M79495</guid>
      <dc:creator>labadie_1</dc:creator>
      <dc:date>2006-10-17T14:26:20Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881655#M79496</link>
      <description>Hi Vladimir,&lt;BR /&gt;did you consider a lock tree remastering?&lt;BR /&gt;&lt;BR /&gt;What are the values for the SYSGEN parameters&lt;BR /&gt;LOCKDIRWT and PE1 on both nodes?&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Albert</description>
      <pubDate>Tue, 17 Oct 2006 14:46:09 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881655#M79496</guid>
      <dc:creator>Albert Öttl</dc:creator>
      <dc:date>2006-10-17T14:46:09Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881656#M79497</link>
      <description>&lt;BR /&gt;Had a similar problem.  Check your locks.  HP changed some memory locking stuff in 8.2.  If your locking rate has become excessive, install SYS500 and UPDATE400.  There is a patch that reverts the behavior back to 7.3-2 for locking pages.&lt;BR /&gt;&lt;BR /&gt;Hope that helps!&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 17 Oct 2006 15:56:17 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881656#M79497</guid>
      <dc:creator>EdgarZamora_1</dc:creator>
      <dc:date>2006-10-17T15:56:17Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881657#M79498</link>
      <description>In addition to all the monitoring stuff, did You do a simple AUTOGEN with feedback since the upgrade to more than double the number of disks/HSGs ?&lt;BR /&gt;It may show some of the parameters to adjust. &lt;BR /&gt;</description>
      <pubDate>Wed, 18 Oct 2006 02:28:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881657#M79498</guid>
      <dc:creator>Joseph Huber_1</dc:creator>
      <dc:date>2006-10-18T02:28:27Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881658#M79499</link>
      <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;what is the 'typical' behavior?&lt;BR /&gt;Gradual slowdown till things stop, or all going normal until 'sudden death'?&lt;BR /&gt;&lt;BR /&gt;So many questios asked already, I guess the right one is there, but you need facts to decide which one.&lt;BR /&gt;I like Joseph's AUTOGEN idea. It could give a lot of info, even before you run stuck again.&lt;BR /&gt;&lt;BR /&gt;Good hunting!&lt;BR /&gt;&lt;BR /&gt;Proost.&lt;BR /&gt;&lt;BR /&gt;Have one on me.&lt;BR /&gt;&lt;BR /&gt;jpe</description>
      <pubDate>Wed, 18 Oct 2006 03:00:26 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881658#M79499</guid>
      <dc:creator>Jan van den Ende</dc:creator>
      <dc:date>2006-10-18T03:00:26Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881659#M79500</link>
      <description>Hello guys&lt;BR /&gt;I think I found the reason of problem. Yesterday I doubled CHANNELCNT parameter and everything is working fine so far.&lt;BR /&gt;&lt;BR /&gt;Some answers to question:&lt;BR /&gt;what is the 'typical' behavior?&lt;BR /&gt;Behavior was all going normal until 'sudden death'.&lt;BR /&gt;did you do a simple AUTOGEN with feedback ?&lt;BR /&gt;I did. There was nothing about CHANNELCNT.&lt;BR /&gt;LOCKDIRWT and PE1 are set to 0 on both nodes&lt;BR /&gt;&lt;BR /&gt;I will try to schedule some downtime to encrease NPAGEDYN and NPAGEVIR because of new database instance.&lt;BR /&gt;&lt;BR /&gt;Again, thanks a lot for your help and time.</description>
      <pubDate>Wed, 18 Oct 2006 05:44:48 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881659#M79500</guid>
      <dc:creator>Vladimir Fabecic</dc:creator>
      <dc:date>2006-10-18T05:44:48Z</dc:date>
    </item>
    <item>
      <title>Re: Problem with Cluster</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881660#M79501</link>
      <description>Vladimir,&lt;BR /&gt;&lt;BR /&gt;At first glance, that would certainly appear to be able to produce the symptoms that you described.&lt;BR /&gt;&lt;BR /&gt;I would also recommend checking other paramters which may be close to a problem area. It is hard to come up with a solid rule, but I would take a look at everything that is over 50-60% (since presumably, this is to become a two node cluster).&lt;BR /&gt;&lt;BR /&gt;- Bob Gezelter, &lt;A href="http://www.rlgsc.com" target="_blank"&gt;http://www.rlgsc.com&lt;/A&gt;</description>
      <pubDate>Wed, 18 Oct 2006 06:07:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/problem-with-cluster/m-p/3881660#M79501</guid>
      <dc:creator>Robert Gezelter</dc:creator>
      <dc:date>2006-10-18T06:07:32Z</dc:date>
    </item>
  </channel>
</rss>

