<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic cmcld: Halting node to preserve data integrity in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051821#M708166</link>
    <description>Hi,&lt;BR /&gt;&lt;BR /&gt;I have just installed test cluster system (HP-UX 11i (June 2003) on 2 servers with 2 shared disk arrays), and configured Apache as test HA service.&lt;BR /&gt;&lt;BR /&gt;Problem. When I start package on any node always goes to reboot.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Aug 19 16:21:05 db2 cmcld: Request from node db2 to start package pkg1 on node db2.&lt;BR /&gt;Aug 19 16:21:05 db2 cmcld: Executing '/etc/cmcluster/pkg1/pkg1.sh  start' for package pkg1, as service PKG*40961.&lt;BR /&gt;Aug 19 16:21:06 db2 LVM[5364]: vgchange -a y vgdb &lt;BR /&gt;Aug 19 16:21:07 db2 CM-pkg1[5396]: cmmodnet -a -i 10.20.2.90 10.20.2.0 &lt;BR /&gt;Aug 19 16:21:07 db2 CM-pkg1[5406]: cmrunserv www &amp;gt;&amp;gt; /etc/cmcluster/pkg1/pkg1.sh.log 2&amp;gt;&amp;amp;1 /etc/cmcluster/pkg1/www monitor &lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service www terminated due to an exit(127).&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service PKG*40961 terminated due to an exit(0).&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Started package pkg1 on node db2.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service www in package pkg1 has gone down.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service fail fast is set. Node will be failed.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Failed node in response to failure of package pkg1.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Halting db2 to preserve data integrity&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Reason: A crucial package failed&lt;BR /&gt;Aug 19 16:21:07 db2 cmlvmd: Could not read messages from /usr/lbin/cmcld: Software caused connection abort&lt;BR /&gt;Aug 19 16:21:07 db2 cmlvmd: CLVMD exiting&lt;BR /&gt;Aug 19 16:21:07 db2 cmsrvassistd[5042]: The cluster daemon aborted our connection.&lt;BR /&gt;Aug 19 16:21:07 db2 cmsrvassistd[5042]: Lost connection with ServiceGuard cluster daemon (cmcld): Software caused connection abort&lt;BR /&gt;Aug 19 16:21:07 db2 cmtaped[5045]: The cluster daemon aborted our connection.&lt;BR /&gt;Aug 19 16:21:07 db2 cmtaped[5045]: cmtaped terminating. (ATS 1.14)&lt;BR /&gt;Aug 19 16:21:07 db2 cmclconfd[5355]: The cluster daemon aborted our connection.&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix: SCSI: Reset detected -- lbolt: 7927106, bus: 4&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             lbp-&amp;gt;state: 4060&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             lbp-&amp;gt;offset: ffffffff&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             lbp-&amp;gt;uPhysScript: f9fef000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     From most recent interrupt:&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             ISTAT: 02, SIST0: 02, SIST1: 00, DSTAT: 80, DSPS: f9fef028&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     lsp: 0000000000000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     lbp-&amp;gt;owner: 0000000000000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     scratch_lsp: 0000000000000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     Pre-DSP script dump [fffffffff9fef0e0]:&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             e0340004 00000000 e0100004 00000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             48000000 00000000 78350000 00000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     Script dump [fffffffff9fef100]:&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             50000000 f9fef028 80000000 0000000b&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             0f000001 f9fef5c0 60000040 00000000&lt;BR /&gt;&lt;BR /&gt;I suspect something is not OK with vg sharing, some patches may help. BTW, how where to find a list of recommended patches for MC/Servce Guard?&lt;BR /&gt;&lt;BR /&gt;Many thanks and point for your comments!&lt;BR /&gt;&lt;BR /&gt;BR,&lt;BR /&gt;Mihails&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Tue, 19 Aug 2003 13:18:09 GMT</pubDate>
    <dc:creator>Mihails Nikitins</dc:creator>
    <dc:date>2003-08-19T13:18:09Z</dc:date>
    <item>
      <title>cmcld: Halting node to preserve data integrity</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051821#M708166</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;I have just installed test cluster system (HP-UX 11i (June 2003) on 2 servers with 2 shared disk arrays), and configured Apache as test HA service.&lt;BR /&gt;&lt;BR /&gt;Problem. When I start package on any node always goes to reboot.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Aug 19 16:21:05 db2 cmcld: Request from node db2 to start package pkg1 on node db2.&lt;BR /&gt;Aug 19 16:21:05 db2 cmcld: Executing '/etc/cmcluster/pkg1/pkg1.sh  start' for package pkg1, as service PKG*40961.&lt;BR /&gt;Aug 19 16:21:06 db2 LVM[5364]: vgchange -a y vgdb &lt;BR /&gt;Aug 19 16:21:07 db2 CM-pkg1[5396]: cmmodnet -a -i 10.20.2.90 10.20.2.0 &lt;BR /&gt;Aug 19 16:21:07 db2 CM-pkg1[5406]: cmrunserv www &amp;gt;&amp;gt; /etc/cmcluster/pkg1/pkg1.sh.log 2&amp;gt;&amp;amp;1 /etc/cmcluster/pkg1/www monitor &lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service www terminated due to an exit(127).&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service PKG*40961 terminated due to an exit(0).&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Started package pkg1 on node db2.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service www in package pkg1 has gone down.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service fail fast is set. Node will be failed.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Failed node in response to failure of package pkg1.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Halting db2 to preserve data integrity&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Reason: A crucial package failed&lt;BR /&gt;Aug 19 16:21:07 db2 cmlvmd: Could not read messages from /usr/lbin/cmcld: Software caused connection abort&lt;BR /&gt;Aug 19 16:21:07 db2 cmlvmd: CLVMD exiting&lt;BR /&gt;Aug 19 16:21:07 db2 cmsrvassistd[5042]: The cluster daemon aborted our connection.&lt;BR /&gt;Aug 19 16:21:07 db2 cmsrvassistd[5042]: Lost connection with ServiceGuard cluster daemon (cmcld): Software caused connection abort&lt;BR /&gt;Aug 19 16:21:07 db2 cmtaped[5045]: The cluster daemon aborted our connection.&lt;BR /&gt;Aug 19 16:21:07 db2 cmtaped[5045]: cmtaped terminating. (ATS 1.14)&lt;BR /&gt;Aug 19 16:21:07 db2 cmclconfd[5355]: The cluster daemon aborted our connection.&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix: SCSI: Reset detected -- lbolt: 7927106, bus: 4&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             lbp-&amp;gt;state: 4060&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             lbp-&amp;gt;offset: ffffffff&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             lbp-&amp;gt;uPhysScript: f9fef000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     From most recent interrupt:&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             ISTAT: 02, SIST0: 02, SIST1: 00, DSTAT: 80, DSPS: f9fef028&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     lsp: 0000000000000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     lbp-&amp;gt;owner: 0000000000000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     scratch_lsp: 0000000000000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     Pre-DSP script dump [fffffffff9fef0e0]:&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             e0340004 00000000 e0100004 00000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             48000000 00000000 78350000 00000000&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:     Script dump [fffffffff9fef100]:&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             50000000 f9fef028 80000000 0000000b&lt;BR /&gt;Aug 19 16:21:12 db2 vmunix:             0f000001 f9fef5c0 60000040 00000000&lt;BR /&gt;&lt;BR /&gt;I suspect something is not OK with vg sharing, some patches may help. BTW, how where to find a list of recommended patches for MC/Servce Guard?&lt;BR /&gt;&lt;BR /&gt;Many thanks and point for your comments!&lt;BR /&gt;&lt;BR /&gt;BR,&lt;BR /&gt;Mihails&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 19 Aug 2003 13:18:09 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051821#M708166</guid>
      <dc:creator>Mihails Nikitins</dc:creator>
      <dc:date>2003-08-19T13:18:09Z</dc:date>
    </item>
    <item>
      <title>Re: cmcld: Halting node to preserve data integrity</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051822#M708167</link>
      <description>you have configured a package to have a service and on startup this service is failing.&lt;BR /&gt;&lt;BR /&gt;Aug 19 16:21:07 db2 CM-pkg1[5406]: cmrunserv www &amp;gt;&amp;gt; /etc/cmcluster/pkg1/pkg1.sh.log 2&amp;gt;&amp;amp;1 /etc/cmcluster/pkg1/www monitor&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service www terminated due to an exit(127).&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service PKG*40961 terminated due to an exit(0).&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Started package pkg1 on node db2.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service www in package pkg1 has gone down.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;You then have th e package configured with SERVICE_FAIL_FAST=YES&lt;BR /&gt;This causes the system to TOC on hte failure of a service.&lt;BR /&gt;&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Service fail fast is set. Node will be failed.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Failed node in response to failure of package pkg1.&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Halting db2 to preserve data integrity&lt;BR /&gt;Aug 19 16:21:07 db2 cmcld: Reason: A crucial package failed &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;reconfigure the package to have SERVICE_FAIL_FAST=NO, and then try again.&lt;BR /&gt;also look at the package log to see if you can track down why your service is failing</description>
      <pubDate>Tue, 19 Aug 2003 13:37:04 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051822#M708167</guid>
      <dc:creator>melvyn burnard</dc:creator>
      <dc:date>2003-08-19T13:37:04Z</dc:date>
    </item>
    <item>
      <title>Re: cmcld: Halting node to preserve data integrity</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051823#M708168</link>
      <description>Regarding "... BTW, how where to find a list of recommended patches for MC/Servce Guard?..."&lt;BR /&gt;&lt;BR /&gt;maintenance and support for hp products &amp;gt; individual patches &amp;gt; hp-ux &amp;gt; enter version in data field &amp;gt; enter serviceguard in 'search by keyword' data field &amp;gt; 47 patches are returned&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www1.itrc.hp.com/service/patch/search.do" target="_blank"&gt;http://www1.itrc.hp.com/service/patch/search.do&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;#############################################&lt;BR /&gt;&lt;BR /&gt;This link provides endless ServiceGuard advice:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://docs.hp.com/hpux/ha/" target="_blank"&gt;http://docs.hp.com/hpux/ha/&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;#############################################&lt;BR /&gt;&lt;BR /&gt;Regarding "...I suspect something is not OK with vg sharing..."&lt;BR /&gt;&lt;BR /&gt;From either node you should be able to activate vg.'s without bringing up the cluster.  But here is the procedure:&lt;BR /&gt;&lt;BR /&gt;vgchange -c y /dev/vgdata&lt;BR /&gt;#vgexport -p -s -m /tmp/vgoracle.map /dev/vgoracle &lt;BR /&gt;&lt;BR /&gt;# rcp /tmp/vgoracle.map nodeB:/tmp/vgoracle.map &lt;BR /&gt;&lt;BR /&gt;On nodeB &lt;BR /&gt;&lt;BR /&gt;# mkdir /dev/vgoracle &lt;BR /&gt;#mknod /dev/vgoracle/group c 64 0x0x0000 &lt;BR /&gt;#vgimport -s -m /tmp/vgoracle.map /dev/vgoracle &lt;BR /&gt;vgchange -c y /dev/vgdata&lt;BR /&gt;#vgchange -a y /dev/vgdata &lt;BR /&gt;&lt;BR /&gt;#############################################&lt;BR /&gt;&lt;BR /&gt;However, I feel the problem is within your package.conf or package.cntl files.  These files are linked together by the exact same service name, so check for this with :&lt;BR /&gt;&lt;BR /&gt;cmcheckconf -P package.conf&lt;BR /&gt;&lt;BR /&gt;The exact same service name</description>
      <pubDate>Tue, 19 Aug 2003 13:55:06 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051823#M708168</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2003-08-19T13:55:06Z</dc:date>
    </item>
    <item>
      <title>Re: cmcld: Halting node to preserve data integrity</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051824#M708169</link>
      <description>Note that "vgchange -c y &lt;VGNAME&gt;" can only be performed if the node the command is executed on is currently running the ServiceGuard daemons.  cmlvmd must be running to authorize the "clusterizing" of VGs.&lt;BR /&gt;&lt;BR /&gt;-s.&lt;/VGNAME&gt;</description>
      <pubDate>Wed, 20 Aug 2003 11:27:12 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051824#M708169</guid>
      <dc:creator>Stephen Doud</dc:creator>
      <dc:date>2003-08-20T11:27:12Z</dc:date>
    </item>
    <item>
      <title>Re: cmcld: Halting node to preserve data integrity</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051825#M708170</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;Thank tou, the error was bad value of SERVICE_FAIL_FAST parameter.&lt;BR /&gt;&lt;BR /&gt;BR,&lt;BR /&gt;Mihails</description>
      <pubDate>Thu, 21 Aug 2003 15:20:22 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/cmcld-halting-node-to-preserve-data-integrity/m-p/3051825#M708170</guid>
      <dc:creator>Mihails Nikitins</dc:creator>
      <dc:date>2003-08-21T15:20:22Z</dc:date>
    </item>
  </channel>
</rss>

