<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Package doesn't start in Operating System - Linux</title>
    <link>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417213#M56325</link>
    <description>Hello,&lt;BR /&gt;&lt;BR /&gt;from what I can see from the log files you posted is:&lt;BR /&gt;&lt;BR /&gt;- you are trying to run and monitor a service that consists of a single script&lt;BR /&gt;&lt;BR /&gt;- the script is run by Serviceguard during package startup, performs the commands and exits in a normal way (exit 0)&lt;BR /&gt;&lt;BR /&gt;- Serviceguard interprets this exit as a service failure and tries to restart the service. The script is run again and again exits with return code 0&lt;BR /&gt;&lt;BR /&gt;- after attemting to restart the service (the script is run twice), Serviceguard assumes that the service is unable to run on the current node and halts the package due to a service failure&lt;BR /&gt;&lt;BR /&gt;I believe this works as expected. In my opinion, the service to be monitored should be a constantly running process.&lt;BR /&gt;&lt;BR /&gt;J.</description>
    <pubDate>Mon, 11 May 2009 11:17:18 GMT</pubDate>
    <dc:creator>Jozef_Novak</dc:creator>
    <dc:date>2009-05-11T11:17:18Z</dc:date>
    <item>
      <title>Package doesn't start</title>
      <link>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417212#M56324</link>
      <description>I've been trying to configure SG, but I've encountered a problem which I can't cope with.&lt;BR /&gt;When there are parameters service_name[0], service_cmd[0], service_restart[0] in a control script, a package doesn't want to start. If these parameters are commented, everything works fine.&lt;BR /&gt;The service name is identical in the control script and the package configuration file.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;package configuration file (pkg1.config):&lt;BR /&gt;PACKAGE_NAME                    pkg1&lt;BR /&gt;PACKAGE_TYPE                    FAILOVER&lt;BR /&gt;NODE_NAME                       sg1&lt;BR /&gt;NODE_NAME                       sg2&lt;BR /&gt;AUTO_RUN                        YES&lt;BR /&gt;NODE_FAIL_FAST_ENABLED                  NO&lt;BR /&gt;RUN_SCRIPT                      /usr/local/cmcluster/conf/pkg1/pkg1.sh&lt;BR /&gt;HALT_SCRIPT                     /usr/local/cmcluster/conf/pkg1/pkg1.sh&lt;BR /&gt;RUN_SCRIPT_TIMEOUT                      NO_TIMEOUT&lt;BR /&gt;HALT_SCRIPT_TIMEOUT                     NO_TIMEOUT&lt;BR /&gt;SUCCESSOR_HALT_TIMEOUT                  NO_TIMEOUT&lt;BR /&gt;FAILOVER_POLICY                 CONFIGURED_NODE&lt;BR /&gt;FAILBACK_POLICY                 MANUAL&lt;BR /&gt;PRIORITY                        NO_PRIORITY&lt;BR /&gt;MONITORED_SUBNET                10.1.40.0&lt;BR /&gt;SERVICE_NAME                    pkg1.monitor&lt;BR /&gt;SERVICE_FAIL_FAST_ENABLED       no&lt;BR /&gt;SERVICE_HALT_TIMEOUT            300&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;control script (pkg1.sh):&lt;BR /&gt;sglinux[0]=1 &amp;gt;/dev/null 2&amp;gt;&amp;amp;1&lt;BR /&gt;if [ $? -gt 0 ]; then&lt;BR /&gt;    exec /bin/bash2 -c "$0 $*"&lt;BR /&gt;    exit 1&lt;BR /&gt;fi&lt;BR /&gt;. ${SGCONFFILE:=/etc/cmcluster.conf}&lt;BR /&gt;PATH=$SGSBIN:/bin:/sbin:/usr/bin:/usr/sbin&lt;BR /&gt;GFS="NO"&lt;BR /&gt;DATA_REP="none"&lt;BR /&gt;VG[0]=vg01&lt;BR /&gt;LV[0]=/dev/vg01/lvol00; FS[0]=/srv/iscsi1; FS_TYPE[0]="ext3"; FS_MOUNT_OPT[0]="-o rw"&lt;BR /&gt;FS_UMOUNT_OPT[0]=""; FS_FSCK_OPT[0]=""&lt;BR /&gt;FS_UMOUNT_COUNT=1&lt;BR /&gt;FS_MOUNT_RETRY_COUNT=0&lt;BR /&gt;CONCURRENT_FSCK_OPERATIONS=1&lt;BR /&gt;CONCURRENT_MOUNT_AND_UMOUNT_OPERATIONS=1&lt;BR /&gt;IP[0]=10.1.40.223&lt;BR /&gt;SUBNET[0]=10.1.40.0&lt;BR /&gt;PR_TYPE_WERO="--prout-type=5"&lt;BR /&gt;ABORT_KEY="--param-sark"&lt;BR /&gt;SG_PERSIST_RDKEYS="sg_persist -k"&lt;BR /&gt;SG_PERSIST_RDRESV="sg_persist -r"&lt;BR /&gt;SG_PERSIST_REG="sg_persist --out -G --param-sark"&lt;BR /&gt;SG_PERSIST_REG_IGN="sg_persist --out -I --param-sark"&lt;BR /&gt;SG_PERSIST_UNREG="sg_persist --out -G --param-rk"&lt;BR /&gt;SG_PERSIST_RESV="sg_persist --out -R --param-rk"&lt;BR /&gt;SG_PERSIST_PREEMPT="sg_persist --out -A --param-rk"&lt;BR /&gt;SG_PERSIST_CLEAR="sg_persist --out -C --param-rk"&lt;BR /&gt;SERVICE_NAME[0]=pkg1.monitor&lt;BR /&gt;SERVICE_CMD[0]="/usr/local/cmcluster/conf/pkg1/pkg1.mon"&lt;BR /&gt;SERVICE_RESTART[0]="-r 2"&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;pkg1.mon: (content is silly, it's not important at the moment, probably. The script has to work and it works. The new line is appended to the czas.log file during cmrunpkg) &lt;BR /&gt;#!/bin/sh&lt;BR /&gt;/bin/echo `/bin/date` &amp;gt;&amp;gt; /tmp/czas.log&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Following lines appear in log files when I use cmrunpkg command and lines service_xxx[0] in pkg1.sh are uncommented&lt;BR /&gt;&lt;BR /&gt;/var/log/message&lt;BR /&gt;&lt;BR /&gt;May 11 12:55:51 sg1 xinetd[3016]: EXIT: hacl-cfgudp status=0 pid=7192 duration=17(sec)&lt;BR /&gt;May 11 12:55:55 sg1 xinetd[3016]: START: hacl-cfgudp pid=7212 from=127.0.0.1&lt;BR /&gt;May 11 12:55:55 sg1 cmrunpkg: cmrunpkg -v pkg1 &lt;BR /&gt;May 11 12:55:55 sg1 xinetd[3016]: START: hacl-cfg pid=7217 from=127.0.0.1&lt;BR /&gt;May 11 12:55:55 sg1 xinetd[7217]: USERID: hacl-cfg UNIX :root&lt;BR /&gt;May 11 12:55:55 sg1 cmrunpkg: Request from root on node sg1 to start package pkg1&lt;BR /&gt;May 11 12:55:55 sg1 cmcld[17505]: Request from root on node sg1 to start package pkg1 &lt;BR /&gt;May 11 12:55:55 sg1 cmcld[17505]: Request from node sg1 to start package pkg1 on node sg1. &lt;BR /&gt;May 11 12:55:55 sg1 cmcld[17505]: Executing '/usr/local/cmcluster/conf/pkg1/pkg1.sh  start' for package pkg1, as service PKG*44033. &lt;BR /&gt;May 11 12:55:55 sg1 cmserviced[17514]: Request to perform run service PKG*44033 &lt;BR /&gt;May 11 12:55:55 sg1 xinetd[3016]: START: hacl-cfg pid=7235 from=127.0.0.1&lt;BR /&gt;May 11 12:55:55 sg1 xinetd[7235]: USERID: hacl-cfg UNIX :root&lt;BR /&gt;May 11 12:55:55 sg1 xinetd[3016]: EXIT: hacl-cfg status=0 pid=7235 duration=0(sec)&lt;BR /&gt;May 11 12:55:55 sg1 xinetd[3016]: START: hacl-cfg pid=7250 from=127.0.0.1&lt;BR /&gt;May 11 12:55:55 sg1 xinetd[7250]: USERID: hacl-cfg UNIX :root&lt;BR /&gt;May 11 12:55:55 sg1 xinetd[3016]: EXIT: hacl-cfg status=0 pid=7250 duration=0(sec)&lt;BR /&gt;May 11 12:55:56 sg1 kernel: kjournald starting.  Commit interval 5 seconds&lt;BR /&gt;May 11 12:55:56 sg1 kernel: EXT3 FS on dm-2, internal journal&lt;BR /&gt;May 11 12:55:56 sg1 kernel: EXT3-fs: mounted filesystem with ordered data mode.&lt;BR /&gt;May 11 12:55:56 sg1 cmmodnet: cmmodnet -a -i 10.1.40.223 10.1.40.0 &lt;BR /&gt;May 11 12:55:56 sg1 avahi-daemon[2835]: Registering new address record for 10.1.40.223 on eth0.&lt;BR /&gt;May 11 12:55:56 sg1 avahi-daemon[2835]: Withdrawing address record for 10.1.40.223 on eth0.&lt;BR /&gt;May 11 12:55:56 sg1 avahi-daemon[2835]: Registering new address record for 10.1.40.223 on eth0.&lt;BR /&gt;May 11 12:55:56 sg1 cmrunserv: cmrunserv -r 2 pkg1.monitor &amp;gt;&amp;gt; /usr/local/cmcluster/conf/pkg1/pkg1.sh.log 2&amp;gt;&amp;amp;1 /usr/local/cmcluster/conf/pkg1/pkg1.mon &lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Request to perform run service pkg1.monitor &lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Service PKG*44033 terminated due to an exit(0). &lt;BR /&gt;May 11 12:55:56 sg1 cmcld[17505]: Started package pkg1 on node sg1. &lt;BR /&gt;May 11 12:55:56 sg1 xinetd[3016]: EXIT: hacl-cfg status=0 pid=7217 duration=1(sec)&lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Service pkg1.monitor terminated due to an exit(0). &lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Automatically restarted service pkg1.monitor for the 1st time after failure. &lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Service pkg1.monitor terminated due to an exit(0). &lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Automatically restarted service pkg1.monitor for the 2nd time after failure. &lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Service pkg1.monitor terminated due to an exit(0). &lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Request to perform run service PKG*44033 &lt;BR /&gt;May 11 12:55:56 sg1 cmcld[17505]: Service pkg1.monitor in package pkg1 has gone down. &lt;BR /&gt;May 11 12:55:56 sg1 cmcld[17505]: Disabled node sg1 from running package pkg1. &lt;BR /&gt;May 11 12:55:56 sg1 cmcld[17505]: Failing package pkg1 on node sg1 due to service failure. &lt;BR /&gt;May 11 12:55:56 sg1 cmcld[17505]: Request from node sg1 to fail package pkg1 on node sg1. &lt;BR /&gt;May 11 12:55:56 sg1 cmcld[17505]: Executing '/usr/local/cmcluster/conf/pkg1/pkg1.sh  stop' for package pkg1, as service PKG*44033. &lt;BR /&gt;May 11 12:55:56 sg1 cmhaltserv: cmhaltserv pkg1.monitor &lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Request to perform halt service pkg1.monitor &lt;BR /&gt;May 11 12:55:56 sg1 cmmodnet: cmmodnet -r -i 10.1.40.223 10.1.40.0 &lt;BR /&gt;May 11 12:55:56 sg1 avahi-daemon[2835]: Withdrawing address record for 10.1.40.223 on eth0.&lt;BR /&gt;May 11 12:55:56 sg1 xinetd[3016]: START: hacl-cfg pid=7391 from=127.0.0.1&lt;BR /&gt;May 11 12:55:56 sg1 xinetd[7391]: USERID: hacl-cfg UNIX :root&lt;BR /&gt;May 11 12:55:56 sg1 xinetd[3016]: EXIT: hacl-cfg status=0 pid=7391 duration=0(sec)&lt;BR /&gt;May 11 12:55:56 sg1 xinetd[3016]: START: hacl-cfg pid=7401 from=127.0.0.1&lt;BR /&gt;May 11 12:55:56 sg1 xinetd[7401]: USERID: hacl-cfg UNIX :root&lt;BR /&gt;May 11 12:55:56 sg1 xinetd[3016]: EXIT: hacl-cfg status=0 pid=7401 duration=0(sec)&lt;BR /&gt;May 11 12:55:56 sg1 cmserviced[17514]: Service PKG*44033 terminated due to an exit(0). &lt;BR /&gt;May 11 12:55:56 sg1 cmcld[17505]: Halted package pkg1 on node sg1. &lt;BR /&gt;May 11 12:56:11 sg1 xinetd[3016]: EXIT: hacl-cfgudp status=0 pid=7212 duration=16(sec)&lt;BR /&gt;May 11 12:56:18 sg1 xinetd[3016]: START: hacl-cfgudp pid=7425 from=127.0.0.1&lt;BR /&gt;May 11 12:56:18 sg1 xinetd[3016]: START: hacl-cfg pid=7430 from=127.0.0.1&lt;BR /&gt;May 11 12:56:18 sg1 xinetd[7430]: USERID: hacl-cfg UNIX :root&lt;BR /&gt;May 11 12:56:18 sg1 xinetd[3016]: EXIT: hacl-cfg status=0 pid=7430 duration=0(sec)&lt;BR /&gt;May 11 12:56:33 sg1 xinetd[3016]: EXIT: hacl-cfgudp status=0 pid=7425 duration=15(sec)&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;pkg1.sh.log&lt;BR /&gt;&lt;BR /&gt;###### Node "sg1.localdomain": Starting package at Mon May 11 12:55:55 CEST 2009 ######&lt;BR /&gt;WARNING: /dev/sdb1 does not support Persistent Reservations&lt;BR /&gt;WARNING: Persistent Reservations disabled for package&lt;BR /&gt;WARNING: In some configurations this may result in data&lt;BR /&gt;WARNING: corruption under certain conditions.  Please&lt;BR /&gt;WARNING: check the documentation for more details.&lt;BR /&gt;PR in: command not supported&lt;BR /&gt;Clearing existing PR key&lt;BR /&gt;Attempting to addtag to vg vg01...&lt;BR /&gt;addtag was successful on vg vg01.&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Activating volume group vg01 .&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Checking filesystems:&lt;BR /&gt;   /dev/vg01/lvol00&lt;BR /&gt;e2fsck 1.39 (29-May-2006)&lt;BR /&gt;/dev/vg01/lvol00: clean, 11/128000 files, 8444/256000 blocks&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Mounting /dev/vg01/lvol00 at /srv/iscsi1&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Adding IP address 10.1.40.223 to subnet 10.1.40.0&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Starting service pkg1.monitor using&lt;BR /&gt;         "/usr/local/cmcluster/conf/pkg1/pkg1.mon"&lt;BR /&gt;###### Node "sg1.localdomain": Package start completed at Mon May 11 12:55:56 CEST 2009 ######&lt;BR /&gt;&lt;BR /&gt;####### Node "sg1.localdomain": Halting package at Mon May 11 12:55:56 CEST 2009 #######&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Halting service pkg1.monitor&lt;BR /&gt;cmhaltserv: Service name pkg1.monitor is not running.&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Remove IP address 10.1.40.223 from subnet 10.1.40.0&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Unmounting filesystem on /srv/iscsi1&lt;BR /&gt;May 11 12:55:56 - Node "sg1.localdomain": Deactivating volume group vg01&lt;BR /&gt;Attempting to deltag to vg vg01...&lt;BR /&gt;deltag was successful on vg vg01.&lt;BR /&gt;PR in: command not supported&lt;BR /&gt;###### Node "sg1.localdomain": Package halt completed at Mon May 11 12:55:56 CEST 2009 ######&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;My configuration:&lt;BR /&gt;Red Hat Enterprise Linux 5.3, SG A.11.19.00 (demo version)&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;What am I doing wrong. Does anybody have any suggestions?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Regards</description>
      <pubDate>Mon, 11 May 2009 10:26:25 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417212#M56324</guid>
      <dc:creator>pfred</dc:creator>
      <dc:date>2009-05-11T10:26:25Z</dc:date>
    </item>
    <item>
      <title>Re: Package doesn't start</title>
      <link>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417213#M56325</link>
      <description>Hello,&lt;BR /&gt;&lt;BR /&gt;from what I can see from the log files you posted is:&lt;BR /&gt;&lt;BR /&gt;- you are trying to run and monitor a service that consists of a single script&lt;BR /&gt;&lt;BR /&gt;- the script is run by Serviceguard during package startup, performs the commands and exits in a normal way (exit 0)&lt;BR /&gt;&lt;BR /&gt;- Serviceguard interprets this exit as a service failure and tries to restart the service. The script is run again and again exits with return code 0&lt;BR /&gt;&lt;BR /&gt;- after attemting to restart the service (the script is run twice), Serviceguard assumes that the service is unable to run on the current node and halts the package due to a service failure&lt;BR /&gt;&lt;BR /&gt;I believe this works as expected. In my opinion, the service to be monitored should be a constantly running process.&lt;BR /&gt;&lt;BR /&gt;J.</description>
      <pubDate>Mon, 11 May 2009 11:17:18 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417213#M56325</guid>
      <dc:creator>Jozef_Novak</dc:creator>
      <dc:date>2009-05-11T11:17:18Z</dc:date>
    </item>
    <item>
      <title>Re: Package doesn't start</title>
      <link>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417214#M56326</link>
      <description>Your "service" script (/usr/local/cmcluster/conf/pkg1/pkg1.mon) is exiting. It should keep running forever when everything is OK, and exit only if something is wrong. When the service script exits, ServiceGuard assumes something may be wrong. As configured, it attempts to restart the service script exactly 2 times.&lt;BR /&gt;&lt;BR /&gt;If the purpose of the service script is to monitor the state of the real application, the service script should be essentially an infinite loop running the necessary tests and then sleeping for a while before re-testing. &lt;BR /&gt;&lt;BR /&gt;If the actual application is slow to start up, you might want to add an extra sleep in the service script just before entering the testing loop, so that the application has plenty of time to start up before the testing loop begins.&lt;BR /&gt;&lt;BR /&gt;The "AUTO_RUN YES" setting is effective only when starting the entire cluster (usually with the cmruncl command). If you are starting the package with cmrunpkg, you must then "arm" the fail-over mechanism by running "cmmodpkg -e &lt;PACKAGE name=""&gt;". When you halt the package manually using the cmhaltpkg command, the AUTO_RUN setting is automatically disabled, disarming the fail-over.&lt;BR /&gt;&lt;BR /&gt;MK&lt;/PACKAGE&gt;</description>
      <pubDate>Mon, 11 May 2009 11:44:42 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417214#M56326</guid>
      <dc:creator>Matti_Kurkela</dc:creator>
      <dc:date>2009-05-11T11:44:42Z</dc:date>
    </item>
    <item>
      <title>Re: Package doesn't start</title>
      <link>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417215#M56327</link>
      <description>Thank you for your help and exhaustive explanations.&lt;BR /&gt;I'm a beginner in SG therefore I couldn't solve the problem.&lt;BR /&gt;Fortunately I've got the idea.&lt;BR /&gt;&lt;BR /&gt;Regards&lt;BR /&gt;pfred</description>
      <pubDate>Thu, 14 May 2009 05:19:29 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-linux/package-doesn-t-start/m-p/4417215#M56327</guid>
      <dc:creator>pfred</dc:creator>
      <dc:date>2009-05-14T05:19:29Z</dc:date>
    </item>
  </channel>
</rss>

