Operating System - HP-UX
1844533 Members
3390 Online
110233 Solutions
New Discussion

cmquery fails giving the error message:Error 231 (Software caused connection abort)

 
amit mehta_2
Regular Advisor

cmquery fails giving the error message:Error 231 (Software caused connection abort)

Hi,

While trying to form 4 node cluster,the cmquerycl command fails and gives the following error message:

Error 231 (Software caused connection abort) performing security validation. Please verify that identd is running properly.
Unable to connect to node hpia4: Software caused connection abort.

The complete output of cmquerycl command:
root@hpia4/ >cmquerycl -v -C /etc/cmcluster/ruby.ascii -q quorums -n hpia1 -n hpia2 -n hpia3 -n hpia4
Looking for other clusters ... Done
Gathering storage information
Error 231 (Software caused connection abort) performing security validation. Please verify that identd is running properly.
Unable to connect to node hpia4: Software caused connection abort
Found 45 devices on node hpia1
Found 51 devices on node hpia2
Found 51 devices on node hpia3
Analysis of 147 devices should take approximately 10 seconds
0%----10%----20%----30%----40%----50%----60%----70%----80%----90%----100%
Found 1 volume groups on node hpia1
Found 1 volume groups on node hpia2
Found 1 volume groups on node hpia3
Analysis of 3 volume groups should take approximately 1 seconds
0%----10%----20%----30%----40%----50%----60%----70%----80%----90%----100%
Note: Disks were discovered which are not in use by either LVM or VxVM.
Use pvcreate(1M) to initialize a disk for LVM or,
use vxdiskadm(1M) to initialize a disk for VxVM.
Gathering network information
Beginning network probing
Not probing node hpia4 as it is currently unreachable.
This may cause network partitions to be reported.
Completed network probing
Failed to gather configuration information.

Though i was able to rlogin from hpia4 to itself and also on other nodes(hpia1,2,3)

if i remove hpia4 from querrying then it works fine and i get the ascii file

final message from cmquerycl:
Writing cluster data to /etc/cmcluster/tmp.ascii.

so it seems something is wrong with this node only.

The contents of /.rhosts and /etc/

root@hpia4/ >cat /.rhosts
+

root@hpia4/ >cat /etc/cmcluster/cmclnodelist
+


root@ruby8/ >tail /etc/inetd.conf

#ncpm-pm dgram udp wait root /opt/ncpm/bin/ncpmd ncpmd
#ncpm-hip dgram udp wait root /opt/ncpm/bin/hipd hipd

swat stream tcp nowait.400 root /opt/samba/bin/swat swat
recserv stream tcp nowait root /usr/lbin/recserv recserv -display :0
hacl-probe stream tcp nowait root /opt/cmom/lbin/cmomd /opt/cmom/lbin/cmomd -f /var/opt/cmom/cmomd.log -r /var/opt/cmom
registrar stream tcp6 nowait root /etc/opt/resmon/lbin/registrar /etc/opt/resmon/lbin/registrar
hacl-cfg dgram udp wait root /usr/lbin/cmclconfd cmclconfd -p
hacl-cfg stream tcp nowait root /usr/lbin/cmclconfd cmclconfd -c

please let me know what else information i should provide.

~amit
2 REPLIES 2
Steven E. Protter
Exalted Contributor

Re: cmquery fails giving the error message:Error 231 (Software caused connection abort)

Shalom,

Your information is pretty complete but does fail to list the OS uname -a and the version of serviceguard.

swlist -l product | grep -i serviceguard

There are some enhancements required for certain newer versions for the last two lines of your inetd.conf

Recommendations.

Try the command again excluding the troublesome node. If it works then concentrate on that node.

remove the binary cluster configuration file from the troublesome node and try again with the full command.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
amit mehta_2
Regular Advisor

Re: cmquery fails giving the error message:Error 231 (Software caused connection abort)

Hi steven,

we figured the problem !
one of the module(rng) failed to load.
after manually loading it using
$ kcmodule rng=loaded
i could querry the node(hpia4)

~amit