<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Quorum disk lost connection every two hours in Operating System - OpenVMS</title>
    <link>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114608#M90275</link>
    <description>For Hoff's comments:&lt;BR /&gt;&lt;BR /&gt;Parameter Name            Current    Default    Minimum    Maximum Unit  Dynamic&lt;BR /&gt;--------------            -------    -------    -------    ------- ----  -------&lt;BR /&gt;VAXCLUSTER                      2          1          0          2 Coded-value&lt;BR /&gt;EXPECTED_VOTES                  2          1          1        127 Votes&lt;BR /&gt;VOTES                           1          1          0        127 Votes&lt;BR /&gt;DISK_QUORUM     "$1$DUA182       "    "    "     "    "     "ZZZZ" Ascii&lt;BR /&gt;QDSKVOTES                       1          1          0        127 Votes&lt;BR /&gt;QDSKINTERVAL                    3          3          1      32767 Seconds&lt;BR /&gt;ALLOCLASS                       1          0          0        255 Pure-number&lt;BR /&gt;LOCKDIRWT                       1          0          0        255 Pure-number&lt;BR /&gt;CLUSTER_CREDITS                32         32         10        128 Credits&lt;BR /&gt;NISCS_CONV_BOOT                 0          0          0          1 Boolean&lt;BR /&gt;NISCS_LOAD_PEA0                 1          0          0          1 Boolean&lt;BR /&gt;MSCP_LOAD                       1          0          0      16384 Coded-value&lt;BR /&gt;TMSCP_LOAD                      0          0          0          3 Coded-value&lt;BR /&gt;MSCP_SERVE_ALL                  1          4          0         -1 Bit-Encoded&lt;BR /&gt;TMSCP_SERVE_ALL                 0          0          0         -1 Bit-Encoded&lt;BR /&gt;MSCP_BUFFER                  1024       1024        256         -1 Coded-value&lt;BR /&gt;MSCP_CREDITS                   32         32          2       1024 Coded-value&lt;BR /&gt;TAPE_ALLOCLASS                  1          0          0        255 Pure-number&lt;BR /&gt;NISCS_MAX_PKTSZ              8192       8192        576       9180 Bytes&lt;BR /&gt;CWCREPRC_ENABLE                 1          1          0          1 Bitmask     D&lt;BR /&gt;RECNXINTERVAL                  20         20          1      32767 Seconds     D&lt;BR /&gt;NISCS_PORT_SERV                 0          0          0        256 Bitmask     D&lt;BR /&gt;MSCP_CMD_TMO                    0          0          0 2147483647 Seconds     D&lt;BR /&gt;LOCKRMWT                        5          5          0         10 Pure-number D&lt;BR /&gt;&lt;BR /&gt;Disk $1$DUA182: (HSJ004), device type MSCP served SCSI disk array, is online,&lt;BR /&gt;    mounted, file-oriented device, shareable, served to cluster via MSCP Server,&lt;BR /&gt;    error logging is enabled.&lt;BR /&gt;&lt;BR /&gt;    Error count                    0    Operations completed           12140682&lt;BR /&gt;    Owner process                 ""    Owner UIC                      [SYSTEM]&lt;BR /&gt;    Owner process ID        00000000    Dev Prot            S:RWPL,O:RWPL,G:R,W&lt;BR /&gt;    Reference count             1722    Default buffer size                 512&lt;BR /&gt;    Current preferred CPU Id       0    Fastpath                              1&lt;BR /&gt;    Total blocks            17763835    Sectors per track                    64&lt;BR /&gt;    Total cylinders             6939    Tracks per cylinder                  40&lt;BR /&gt;    Logical Volume Size     17763835    Expansion Size Limit           18505728&lt;BR /&gt;    Host name               "HSJ004"    Host type, avail              HSJ5, yes&lt;BR /&gt;    Alternate host name     "HSJ005"    Alt. type, avail              HSJ5, yes&lt;BR /&gt;    Allocation class               1&lt;BR /&gt;&lt;BR /&gt;    Volume label      "CL1_RD09_182"    Relative volume number                0&lt;BR /&gt;    Cluster size                  18    Transaction count                   896&lt;BR /&gt;    Free blocks              5740218    Maximum files allowed            467469&lt;BR /&gt;    Extend quantity                5    Mount count                           1&lt;BR /&gt;    Mount status              System    Cache name        "_$1$DUA182:XQPCACHE"&lt;BR /&gt;    Extent cache size             64    Maximum blocks in extent cache   574021&lt;BR /&gt;    File ID cache size            64    Blocks in extent cache           573444&lt;BR /&gt;    Quota cache size               0    Maximum buffers in FCP cache       4240&lt;BR /&gt;    Volume owner UIC           [1,1]    Vol Prot    S:RWCD,O:RWCD,G:RWCD,W:RWCD&lt;BR /&gt;&lt;BR /&gt;  Volume Status:  ODS-2, subject to mount verification, protected subsystems&lt;BR /&gt;      enabled, write-through caching enabled.&lt;BR /&gt;&lt;BR /&gt;No activity on the HSJ50 consoles.  No unusual network activity.&lt;BR /&gt;&lt;BR /&gt;This appears to have started around the time that we upgraded from V7.3-2 to V8.3.&lt;BR /&gt;&lt;BR /&gt;The machine is scheduled for a reboot tomorrow evening to remove the quorum disk, and for other changes, so the matter will be, as Spock would say, rendered academic.</description>
    <pubDate>Wed, 18 Jun 2008 19:11:45 GMT</pubDate>
    <dc:creator>pcseunix</dc:creator>
    <dc:date>2008-06-18T19:11:45Z</dc:date>
    <item>
      <title>Quorum disk lost connection every two hours</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114605#M90272</link>
      <description>I have a system that is currently a one node cluster, with a quorum disk.  Yes, I know that the quorum disk is not necessary -- it's a holdover from earlier days when there were two nodes.  We're planning on getting rid of the quorum disk on our next scheduled reboot.  But, this is an interesting problem.&lt;BR /&gt;&lt;BR /&gt;Anyway, on to the problem.&lt;BR /&gt;&lt;BR /&gt;From time to time, the system reports "Lost connection to quorum disk", followed a few seconds later by "Quorum regained...".  The interesting this is that this occurs on two hour intervals, but not on all two hour intervals:&lt;BR /&gt;&lt;BR /&gt;06/17/08 00:07:45: %CNXMAN,  Lost "connection" to quorum disk&lt;BR /&gt;06/17/08 00:07:48: %CNXMAN,  Quorum regained, resuming activity&lt;BR /&gt;06/17/08 02:07:45: %CNXMAN,  Lost "connection" to quorum disk&lt;BR /&gt;06/17/08 02:08:15: %CNXMAN,  Quorum regained, resuming activity&lt;BR /&gt;06/17/08 04:07:45: %CNXMAN,  Lost "connection" to quorum disk&lt;BR /&gt;06/17/08 04:07:52: %CNXMAN,  Quorum regained, resuming activity&lt;BR /&gt;06/17/08 08:07:45: %CNXMAN,  Lost "connection" to quorum disk&lt;BR /&gt;06/17/08 08:08:15: %CNXMAN,  Quorum regained, resuming activity&lt;BR /&gt;06/17/08 10:07:45: %CNXMAN,  Lost "connection" to quorum disk&lt;BR /&gt;06/17/08 10:07:53: %CNXMAN,  Quorum regained, resuming activity&lt;BR /&gt;06/17/08 14:07:45: %CNXMAN,  Lost "connection" to quorum disk&lt;BR /&gt;06/17/08 14:08:15: %CNXMAN,  Quorum regained, resuming activity&lt;BR /&gt;06/17/08 16:07:41: %CNXMAN,  Lost "connection" to quorum disk&lt;BR /&gt;06/17/08 16:08:15: %CNXMAN,  Quorum regained, resuming activity&lt;BR /&gt;06/17/08 22:07:45: %CNXMAN,  Lost "connection" to quorum disk&lt;BR /&gt;06/17/08 22:08:15: %CNXMAN,  Quorum regained, resuming activity&lt;BR /&gt;&lt;BR /&gt;No disk errors reported, the system is not busy at the times indicated -- actually not very busy at all.  &lt;BR /&gt;&lt;BR /&gt;System is ES40, 4 cpus, 4GB memory, CIPCA connected to HSZ50, all disks are RAID5.  Has VMS83A_UPDATE V5.0 installed (yes, I see that there is a V6.0).&lt;BR /&gt;&lt;BR /&gt;Ideas, suggestions?</description>
      <pubDate>Wed, 18 Jun 2008 13:09:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114605#M90272</guid>
      <dc:creator>pcseunix</dc:creator>
      <dc:date>2008-06-18T13:09:27Z</dc:date>
    </item>
    <item>
      <title>Re: Quorum disk lost connection every two hours</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114606#M90273</link>
      <description>The usual triggers tend to be I/O errors, periodic I/O floods, or other such.  Here, I'd look at the CI, too, as cable faults and termination problems can cause communications issues.  Periodic, though, is weird.&lt;BR /&gt;&lt;BR /&gt;Please post the cluster system parameters.&lt;BR /&gt;&lt;BR /&gt;SYSMAN&amp;gt; param show /cluster &lt;BR /&gt;&lt;BR /&gt;Please also post the SHOW DEVICE /FULL from the quorum disk.  This disk is typically MOUNT /SYSTEM.&lt;BR /&gt;&lt;BR /&gt;Please do check for errors or restarts or such out at the HSZ, too -- for any disk- or CI-related errors or faults or such that might be logged out on the controller, or elsewhere in the configuration.&lt;BR /&gt;&lt;BR /&gt;Also check the network and other cluster communications controllers that might be present.&lt;BR /&gt;&lt;BR /&gt;FWIW, RAID5 has an enormous I/O load during rebuilds, too.  IMHO with modern disk prices, RAID10 is often a better choice.  And when you get rid of the quorum disk, I'd take a look at the whole of the CI storage connection, too, as that's old kit.  Direct-attached SCSI might be a better choice for a one-node configuration, with a PCI RAID controller.&lt;BR /&gt;&lt;BR /&gt;And yes, do get rid of the quorum disk.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 18 Jun 2008 15:00:07 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114606#M90273</guid>
      <dc:creator>Hoff</dc:creator>
      <dc:date>2008-06-18T15:00:07Z</dc:date>
    </item>
    <item>
      <title>Re: Quorum disk lost connection every two hours</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114607#M90274</link>
      <description>This would be possibly a nice T4 excercise.&lt;BR /&gt;If you have T4 running, zoom in to the 7'th minute.&lt;BR /&gt;Notably I would check the minute for 6/17 06:07 and 12:07 because it might show something happening without the lost quorum noise.&lt;BR /&gt;&lt;BR /&gt;I would also run a SHOW SYSTEM just at 6 minutes past the hour, and again at 8 minutes and 'subtract' them for a process activity insight for those minutes.&lt;BR /&gt;Of course this is not unlikely to influence the problem ... it might even make it go away :-).&lt;BR /&gt;&lt;BR /&gt;Finally, has it been behaving like this 'for ever'? When did it start? What had changed around that time?&lt;BR /&gt;&lt;BR /&gt;Hein.&lt;BR /&gt;</description>
      <pubDate>Wed, 18 Jun 2008 16:09:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114607#M90274</guid>
      <dc:creator>Hein van den Heuvel</dc:creator>
      <dc:date>2008-06-18T16:09:52Z</dc:date>
    </item>
    <item>
      <title>Re: Quorum disk lost connection every two hours</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114608#M90275</link>
      <description>For Hoff's comments:&lt;BR /&gt;&lt;BR /&gt;Parameter Name            Current    Default    Minimum    Maximum Unit  Dynamic&lt;BR /&gt;--------------            -------    -------    -------    ------- ----  -------&lt;BR /&gt;VAXCLUSTER                      2          1          0          2 Coded-value&lt;BR /&gt;EXPECTED_VOTES                  2          1          1        127 Votes&lt;BR /&gt;VOTES                           1          1          0        127 Votes&lt;BR /&gt;DISK_QUORUM     "$1$DUA182       "    "    "     "    "     "ZZZZ" Ascii&lt;BR /&gt;QDSKVOTES                       1          1          0        127 Votes&lt;BR /&gt;QDSKINTERVAL                    3          3          1      32767 Seconds&lt;BR /&gt;ALLOCLASS                       1          0          0        255 Pure-number&lt;BR /&gt;LOCKDIRWT                       1          0          0        255 Pure-number&lt;BR /&gt;CLUSTER_CREDITS                32         32         10        128 Credits&lt;BR /&gt;NISCS_CONV_BOOT                 0          0          0          1 Boolean&lt;BR /&gt;NISCS_LOAD_PEA0                 1          0          0          1 Boolean&lt;BR /&gt;MSCP_LOAD                       1          0          0      16384 Coded-value&lt;BR /&gt;TMSCP_LOAD                      0          0          0          3 Coded-value&lt;BR /&gt;MSCP_SERVE_ALL                  1          4          0         -1 Bit-Encoded&lt;BR /&gt;TMSCP_SERVE_ALL                 0          0          0         -1 Bit-Encoded&lt;BR /&gt;MSCP_BUFFER                  1024       1024        256         -1 Coded-value&lt;BR /&gt;MSCP_CREDITS                   32         32          2       1024 Coded-value&lt;BR /&gt;TAPE_ALLOCLASS                  1          0          0        255 Pure-number&lt;BR /&gt;NISCS_MAX_PKTSZ              8192       8192        576       9180 Bytes&lt;BR /&gt;CWCREPRC_ENABLE                 1          1          0          1 Bitmask     D&lt;BR /&gt;RECNXINTERVAL                  20         20          1      32767 Seconds     D&lt;BR /&gt;NISCS_PORT_SERV                 0          0          0        256 Bitmask     D&lt;BR /&gt;MSCP_CMD_TMO                    0          0          0 2147483647 Seconds     D&lt;BR /&gt;LOCKRMWT                        5          5          0         10 Pure-number D&lt;BR /&gt;&lt;BR /&gt;Disk $1$DUA182: (HSJ004), device type MSCP served SCSI disk array, is online,&lt;BR /&gt;    mounted, file-oriented device, shareable, served to cluster via MSCP Server,&lt;BR /&gt;    error logging is enabled.&lt;BR /&gt;&lt;BR /&gt;    Error count                    0    Operations completed           12140682&lt;BR /&gt;    Owner process                 ""    Owner UIC                      [SYSTEM]&lt;BR /&gt;    Owner process ID        00000000    Dev Prot            S:RWPL,O:RWPL,G:R,W&lt;BR /&gt;    Reference count             1722    Default buffer size                 512&lt;BR /&gt;    Current preferred CPU Id       0    Fastpath                              1&lt;BR /&gt;    Total blocks            17763835    Sectors per track                    64&lt;BR /&gt;    Total cylinders             6939    Tracks per cylinder                  40&lt;BR /&gt;    Logical Volume Size     17763835    Expansion Size Limit           18505728&lt;BR /&gt;    Host name               "HSJ004"    Host type, avail              HSJ5, yes&lt;BR /&gt;    Alternate host name     "HSJ005"    Alt. type, avail              HSJ5, yes&lt;BR /&gt;    Allocation class               1&lt;BR /&gt;&lt;BR /&gt;    Volume label      "CL1_RD09_182"    Relative volume number                0&lt;BR /&gt;    Cluster size                  18    Transaction count                   896&lt;BR /&gt;    Free blocks              5740218    Maximum files allowed            467469&lt;BR /&gt;    Extend quantity                5    Mount count                           1&lt;BR /&gt;    Mount status              System    Cache name        "_$1$DUA182:XQPCACHE"&lt;BR /&gt;    Extent cache size             64    Maximum blocks in extent cache   574021&lt;BR /&gt;    File ID cache size            64    Blocks in extent cache           573444&lt;BR /&gt;    Quota cache size               0    Maximum buffers in FCP cache       4240&lt;BR /&gt;    Volume owner UIC           [1,1]    Vol Prot    S:RWCD,O:RWCD,G:RWCD,W:RWCD&lt;BR /&gt;&lt;BR /&gt;  Volume Status:  ODS-2, subject to mount verification, protected subsystems&lt;BR /&gt;      enabled, write-through caching enabled.&lt;BR /&gt;&lt;BR /&gt;No activity on the HSJ50 consoles.  No unusual network activity.&lt;BR /&gt;&lt;BR /&gt;This appears to have started around the time that we upgraded from V7.3-2 to V8.3.&lt;BR /&gt;&lt;BR /&gt;The machine is scheduled for a reboot tomorrow evening to remove the quorum disk, and for other changes, so the matter will be, as Spock would say, rendered academic.</description>
      <pubDate>Wed, 18 Jun 2008 19:11:45 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114608#M90275</guid>
      <dc:creator>pcseunix</dc:creator>
      <dc:date>2008-06-18T19:11:45Z</dc:date>
    </item>
    <item>
      <title>Re: Quorum disk lost connection every two hours</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114609#M90276</link>
      <description>Ok.  You have an HSJ, and not an HSZ.&lt;BR /&gt;&lt;BR /&gt;I don't see anything obvious in the settings.  &lt;BR /&gt;&lt;BR /&gt;Usual shot-gun for weirdnesses: Check the HSJ firmware, the SRM firmware, and the OpenVMS ECOs.&lt;BR /&gt;&lt;BR /&gt;But then if you're removing the quorum disk, set your votes and expected votes and disk quorum values appropriately, and be done with it.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 18 Jun 2008 19:49:16 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114609#M90276</guid>
      <dc:creator>Hoff</dc:creator>
      <dc:date>2008-06-18T19:49:16Z</dc:date>
    </item>
    <item>
      <title>Re: Quorum disk lost connection every two hours</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114610#M90277</link>
      <description>PCSEUniks&lt;BR /&gt;&lt;BR /&gt;How many nodes is the cluster ?&lt;BR /&gt;All nodes are/have the same vms version ?&lt;BR /&gt;&lt;BR /&gt;AvR</description>
      <pubDate>Wed, 25 Jun 2008 07:52:38 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114610#M90277</guid>
      <dc:creator>Anton van Ruitenbeek</dc:creator>
      <dc:date>2008-06-25T07:52:38Z</dc:date>
    </item>
    <item>
      <title>Re: Quorum disk lost connection every two hours</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114611#M90278</link>
      <description>We have removed the quorum disk on our 1-node cluster, and how the CNXMAN messages have gone away.</description>
      <pubDate>Wed, 25 Jun 2008 12:43:16 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/quorum-disk-lost-connection-every-two-hours/m-p/5114611#M90278</guid>
      <dc:creator>pcseunix</dc:creator>
      <dc:date>2008-06-25T12:43:16Z</dc:date>
    </item>
  </channel>
</rss>

