<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: SecondaryServerMonitoringDaemon cannot start on the compute node in Server Clustering</title>
    <link>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6805788#M607</link>
    <description>&lt;P&gt;hi Pradeep,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;thank you for your hint to raise the debug level.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I found messages&amp;nbsp; like the following&amp;nbsp;in the MainMonitoringDaemon_asv11slt1.log:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;[26-Oct-2015_11:03:25] [CMUResultReceiver ]&amp;nbsp; Received data from &amp;lt;10.250.128.153&amp;gt; instead of &amp;lt;172.23.99.34&amp;gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;Will send single_halt order to SEC monitoring daemon of node &amp;lt;10.250.128.153&amp;gt; immediately CheckStopCondition&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;172.23.99.34 - that&amp;nbsp;is the configured ip in the management network. This IP is configured in the CMU for the compute&amp;nbsp;node asv12slt1&lt;/P&gt;&lt;P&gt;10.250.128.153 - that is the traffic ip address of the same node. This IP is configured in the DNS enty for the node asv12slt1.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As workaround I change the node name in CMU to asv12slt1-mgm. Now the CMU IP address and the DNS entry match. The issue&amp;nbsp;is resolved now.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However I still see the errors for data collection. I assume I have to adopt the ActionAndAlertsFile to RHEL 7 (e.g. due to different names for network interfaces than eth0, eth1). I attached the logs from the compute node with those errors.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Or can you provide me updated version of this configuration file for RHEL7?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The project is a proof of concept for an automotive customer in Germany. Please contact me HP internally if you like to have more details.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Many thanks for your support!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Mon, 26 Oct 2015 16:32:16 GMT</pubDate>
    <dc:creator>rkrassow</dc:creator>
    <dc:date>2015-10-26T16:32:16Z</dc:date>
    <item>
      <title>SecondaryServerMonitoringDaemon cannot start on the compute node</title>
      <link>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6804712#M605</link>
      <description>&lt;P&gt;hi all,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I cannot enable CMU monitoring in my cluster. After starting the monitoring engine on the CMU server the SecondaryServerMonitoringDaemon process cannot be started on the compute node. The process crashes after 5 seconds. As far as I understand CMU tries to restart the process every 30 seconds.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;-&amp;gt; ssh-keys are distributed for user root&lt;/P&gt;&lt;P&gt;-&amp;gt; firewall is curently&amp;nbsp;disabled&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The installed versions and the logs are provided below. Can somebody give me a hint how too proceed here?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank you in advance, Rostislaw&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;PS: where can I open a support ticket for CMU?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial black,avant garde" size="1"&gt;Environment - CMU Server:&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv11slt1 log]# uname -a&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;Linux asv11slt1 3.10.0-229.el7.x86_64 #1 SMP Thu Jan 29 18:37:38 EST 2015 x86_64 x86_64 x86_64 GNU/Linux&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv11slt1 log]# java -version&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;java version "1.7.0_75"&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;OpenJDK Runtime Environment (rhel-2.5.4.2.el7_0-x86_64 u75-b13)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;OpenJDK 64-Bit Server VM (build 24.75-b04, mixed mode)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv11slt1 log]# rpm -qa | grep cmu&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;cmu-7.3.2-1.x86_64&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;FONT face="arial black,avant garde" size="1"&gt;Environment - compute node:&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv12slt1 log]# uname -a&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;Linux asv12slt1 3.10.0-229.el7.x86_64 #1 SMP Thu Jan 29 18:37:38 EST 2015 x86_64 x86_64 x86_64 GNU/Linux&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv12slt1 log]# java -version&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;java version "1.8.0_60"&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;Java(TM) SE Runtime Environment (build 1.8.0_60-b27)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;Java HotSpot(TM) 64-Bit Server VM (build 25.60-b23, mixed mode)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv12slt1 log]# rpm -qa | grep cmu&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;cmu_cn-7.3.2-1.x86_64&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="arial black,avant garde" size="1"&gt;# CMU Server log&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv11slt1 log]# tail MainMonitoringDaemon_asv11slt1.log&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:09:29] [CMUSecReelector&amp;nbsp;&amp;nbsp; ] Electing new SEC (asv12slt1 is no longer SEC for NE 0) MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:09:30] [CMUSecReelector&amp;nbsp;&amp;nbsp; ]&amp;nbsp; spawning asv12slt1 as new SEC for NE 0 MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:09:59] [CMUSecReelector&amp;nbsp;&amp;nbsp; ] Electing new SEC (asv12slt1 is no longer SEC for NE 0) MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:00] [CMUSecReelector&amp;nbsp;&amp;nbsp; ]&amp;nbsp; spawning asv12slt1 as new SEC for NE 0 MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:29] [CMUSecReelector&amp;nbsp;&amp;nbsp; ] Electing new SEC (asv12slt1 is no longer SEC for NE 0) MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:30] [CMUSecReelector&amp;nbsp;&amp;nbsp; ]&amp;nbsp; spawning asv12slt1 as new SEC for NE 0 MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:59] [CMUSecReelector&amp;nbsp;&amp;nbsp; ] Electing new SEC (asv12slt1 is no longer SEC for NE 0) MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:11:00] [CMUSecReelector&amp;nbsp;&amp;nbsp; ]&amp;nbsp; spawning asv12slt1 as new SEC for NE 0 MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:11:29] [CMUSecReelector&amp;nbsp;&amp;nbsp; ] Electing new SEC (asv12slt1 is no longer SEC for NE 0) MonitSrMainRoutine&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:11:30] [CMUSecReelector&amp;nbsp;&amp;nbsp; ]&amp;nbsp; spawning asv12slt1 as new SEC for NE 0 MonitSrMainRoutine&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;FONT face="arial black,avant garde" size="1"&gt;# compute node logs&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv12slt1 log]# cat SecondaryServerMonitoring_asv12slt1.log&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:04] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ]&amp;nbsp; thread test [START]...&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:04] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ]&amp;nbsp; thread test... [STOP]&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:04] [CMUFileLockTools&amp;nbsp; ]&amp;nbsp; mypid is 19306 CMUGetMonitoringDaemonLockFile&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:04] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] monitoring synchro is on&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:04] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] monitoring memlock is off&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:04] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] monitoring realtime priority parameter is 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:04] [CMUFileLockTools&amp;nbsp; ]&amp;nbsp; mypid is 19310 CMUGetMonitoringDaemonLockFile&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:04] [CMUFileLockTools&amp;nbsp; ]&amp;nbsp; killing process 19306 : CMUKillDaemon&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:09] [CMUslaveListener&amp;nbsp; ] Halt single daemon msg received, exiting program MonitSlActOnMessageReceived&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:09] [CMUPthreadTools&amp;nbsp;&amp;nbsp; ] Fatal, thread_cancel failed could not find thread CMUThreadCancel&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:09] [CMUPthreadTools&amp;nbsp;&amp;nbsp; ] [Fatal] Error while trying to kill thread MonitRsKillThread&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:09] [Shutdown Module&amp;nbsp;&amp;nbsp; ]&amp;nbsp; Could not kill CS thread HaltMyThreadsAndDie&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:09] [Shutdown Module&amp;nbsp;&amp;nbsp; ]&amp;nbsp; Stopping now HaltMyThreadsAndDie&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_16:10:09] [Shutdown Module&amp;nbsp;&amp;nbsp; ]&amp;nbsp; ------------ HaltMyThreadsAndDie&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier" size="1"&gt;[root@asv12slt1 log]# cat SmallMonitoringDaemon_asv12slt1.log&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] Entering checkBigRegexpBuggy&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ]&amp;nbsp; size tested is 21000 checkBigRegexpBuggy&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] Entering checkRedhat8Bug&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] Entering checkThreadLibrary&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ]&amp;nbsp; thread test [START]...&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ]&amp;nbsp; thread test... [STOP]&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] cmuconf&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =(null)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,cmuconf_compl&amp;nbsp;&amp;nbsp; =(null)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,cmu_cluster_conf=(null)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,AAFile&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =/opt/cmu/etc/ActionAndAlertsFile.txt&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,MAFile&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =(null)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,debugLevelMMD&amp;nbsp;&amp;nbsp; =(null)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,debugLevelSEC&amp;nbsp;&amp;nbsp; =(null)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,debugLevelSMD&amp;nbsp;&amp;nbsp; =1&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,timestep&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =5000000&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,master_host_ip&amp;nbsp; =172.23.99.34&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,sec_ip&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =(null)&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,incomingSLPort&amp;nbsp; =48557&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,outgoingMSRRPort=48560&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,outgoingMSHellop=49074&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,do_synchro&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =1&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,do_memlock&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,do_realtime&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,cmu_mgt_node_ip =172.23.99.16&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,host_ip&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; =172.23.99.34&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;,nodes_file_path =(null)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUFileLockTools&amp;nbsp; ]&amp;nbsp; mypid is 1293 CMUGetMonitoringDaemonLockFile&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] monitoring synchro is on&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] monitoring memlock is off&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUstartup&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ] monitoring realtime priority parameter is 0&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUFileLockTools&amp;nbsp; ]&amp;nbsp; mypid is 1293 CMUGetMonitoringDaemonLockFile&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [SmallMonitorDaemon]&amp;nbsp; not starting collectl client Main&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;...&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [(4)CMUResultSender]&amp;nbsp; Could not extract numerical part of &amp;lt;NOK&amp;nbsp;&amp;nbsp; 14&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;&amp;gt; MonitRsConvertToDouble&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [(4)CMUResultSender]&amp;nbsp; [Warning] Data conversion failed for Action &amp;lt;eth1_MB/s_tx&amp;gt;value was --&amp;gt;NOK 14&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;&amp;lt;--, fix /opt/cmu/etc/ActionAndAlertsFile.txt&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUslaveListener&amp;nbsp; ] Halt single daemon msg received, exiting program MonitSlActOnMessageReceived&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [CMUPthreadTools&amp;nbsp;&amp;nbsp; ] [MonitRsKillThread] Thread_join failed &amp;lt;Invalid argument&amp;gt; CMUThreadJoin&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [Shutdown Module&amp;nbsp;&amp;nbsp; ]&amp;nbsp; Stopping now HaltMyThreadsAndDie&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="1"&gt;[22-Oct-2015_14:40:11] [Shutdown Module&amp;nbsp;&amp;nbsp; ]&amp;nbsp; ------------ HaltMyThreadsAndDie&lt;/FONT&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 22 Oct 2015 14:41:36 GMT</pubDate>
      <guid>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6804712#M605</guid>
      <dc:creator>rkrassow</dc:creator>
      <dc:date>2015-10-22T14:41:36Z</dc:date>
    </item>
    <item>
      <title>Re: SecondaryServerMonitoringDaemon cannot start on the compute node</title>
      <link>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6805561#M606</link>
      <description>&lt;P&gt;Hi Rostislaw,&lt;BR /&gt;&lt;BR /&gt;Please increase the monitoring debug level CMU_MAIN_MONITORING_DEBUG_LEVEL, CMU_SEC_MONITORING_DEBUG_LEVEL, CMU_SMD_MONITORING_DEBUG_LEVEL to 3 in /opt/cmu/etc/cmuserver.conf file and restart the monitoring on head node using below steps.&lt;BR /&gt;&lt;BR /&gt;# /opt/cmu/tools/cmu_stop_monitoring&lt;BR /&gt;( Wait for few minutes )&lt;BR /&gt;# /opt/cmu/tools/cmu_start_monitoring&lt;BR /&gt;&lt;BR /&gt;Please wait for few minutes and capture below logs.&lt;BR /&gt;&lt;BR /&gt;From head node:&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; /opt/cmu/log/MainMonitoringDaemon_&amp;lt;server_name&amp;gt;.log&lt;BR /&gt;From compute nodes&amp;nbsp; where SecMD ran during this test; &amp;nbsp;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; /opt/cmu/log/SecondaryServerMonitoring_&amp;lt;server_name&amp;gt;.log&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; /opt/cmu/log/SmallMonitoringDaemon_&amp;lt;server_name&amp;gt;.log&lt;BR /&gt;&lt;BR /&gt;Also, please send us the "/opt/cmu/etc/ActionAndAlertsFile.txt" file from management node and what is the output of "rpm --verify cmu" from management node.&lt;BR /&gt;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&amp;nbsp;&amp;nbsp; &amp;nbsp;&lt;BR /&gt;Can you run the following command on a management node :&lt;BR /&gt;In below command, &amp;lt;compute-node&amp;gt; is secondary server node or any other node.&lt;BR /&gt;&lt;BR /&gt;#time ssh &amp;lt;compute-node&amp;gt; hostname&lt;BR /&gt;&lt;BR /&gt;We had seen issues with Monitoring where it is unable to start on compute nodes due to ssh login delays.&lt;BR /&gt;If ssh logins to any of the compute nodes take a long time (ie., 5 seconds), monitoring fails to start on those nodes.&lt;BR /&gt;Incorrect DNS/gateway settings on the nodes is the reason for such ssh login delays.&lt;BR /&gt;For more details, please refer "Section 5.25 Monitoring fails to start on compute nodes due to ssh login delays" in Insight CMU v7.3.2 Release Notes.&lt;BR /&gt;&lt;BR /&gt;&amp;gt;PS: where can I open a support ticket for CMU?&lt;BR /&gt;Please raise a case by calling to Local HP Support Center.&lt;BR /&gt;&lt;BR /&gt;What is the name of the customer? Also, let us know the details of the customer?&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Pradeep Kumar A.&lt;/P&gt;</description>
      <pubDate>Mon, 26 Oct 2015 07:46:55 GMT</pubDate>
      <guid>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6805561#M606</guid>
      <dc:creator>Armugam_Pradeep</dc:creator>
      <dc:date>2015-10-26T07:46:55Z</dc:date>
    </item>
    <item>
      <title>Re: SecondaryServerMonitoringDaemon cannot start on the compute node</title>
      <link>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6805788#M607</link>
      <description>&lt;P&gt;hi Pradeep,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;thank you for your hint to raise the debug level.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I found messages&amp;nbsp; like the following&amp;nbsp;in the MainMonitoringDaemon_asv11slt1.log:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;FONT face="courier new,courier"&gt;[26-Oct-2015_11:03:25] [CMUResultReceiver ]&amp;nbsp; Received data from &amp;lt;10.250.128.153&amp;gt; instead of &amp;lt;172.23.99.34&amp;gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier"&gt;Will send single_halt order to SEC monitoring daemon of node &amp;lt;10.250.128.153&amp;gt; immediately CheckStopCondition&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;172.23.99.34 - that&amp;nbsp;is the configured ip in the management network. This IP is configured in the CMU for the compute&amp;nbsp;node asv12slt1&lt;/P&gt;&lt;P&gt;10.250.128.153 - that is the traffic ip address of the same node. This IP is configured in the DNS enty for the node asv12slt1.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As workaround I change the node name in CMU to asv12slt1-mgm. Now the CMU IP address and the DNS entry match. The issue&amp;nbsp;is resolved now.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;However I still see the errors for data collection. I assume I have to adopt the ActionAndAlertsFile to RHEL 7 (e.g. due to different names for network interfaces than eth0, eth1). I attached the logs from the compute node with those errors.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Or can you provide me updated version of this configuration file for RHEL7?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The project is a proof of concept for an automotive customer in Germany. Please contact me HP internally if you like to have more details.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Many thanks for your support!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 26 Oct 2015 16:32:16 GMT</pubDate>
      <guid>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6805788#M607</guid>
      <dc:creator>rkrassow</dc:creator>
      <dc:date>2015-10-26T16:32:16Z</dc:date>
    </item>
    <item>
      <title>Re: SecondaryServerMonitoringDaemon cannot start on the compute node</title>
      <link>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6805967#M608</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This warning "&lt;FONT face="courier new,courier" size="1"&gt;[Warning] Data conversion failed for Action &amp;lt;eth1_MB/s_tx&amp;gt;value was --&amp;gt;NOK&lt;/FONT&gt;" is expected.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Looks like your compute node NICs are not detected legacy nic naming like eth0, eth1 etc. I hope it follows RHEL7 persistent nic naming schemes like eno1,eno2,ens1,ens3f0, ens3f1,em1,em2 etc.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This RHEL 7 naming scheme affects some of the NIC-related monitoring metrics like&lt;BR /&gt;"eth0_MB/s_rx", "eth1_MB/s_rx", etc. Please&amp;nbsp; add the appropriate network interface ACTION to the&lt;BR /&gt;/opt/cmu/etc/ActionAndAlertsFile.txt file and restart monitoring.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For more details, please refer Section 5.21 in Insight CMU v7.3.2 Release Notes.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There is no ActionAndAlerts File which have RHEL7 naming scheme. Please change the ACTION in the ActionAndAlertsFile.txt and restart monitoring.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope it helps.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Regards,&lt;/P&gt;&lt;P&gt;Pradeep A.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Oct 2015 08:58:19 GMT</pubDate>
      <guid>https://community.hpe.com/t5/server-clustering/secondaryservermonitoringdaemon-cannot-start-on-the-compute-node/m-p/6805967#M608</guid>
      <dc:creator>Armugam_Pradeep</dc:creator>
      <dc:date>2015-10-27T08:58:19Z</dc:date>
    </item>
  </channel>
</rss>

