Operating System - HP-UX
1834704 Members
2670 Online
110069 Solutions
New Discussion

Serviceguard w/ Sap Extentions not starting the remote app servers

 
jtfischer
Occasional Contributor

Serviceguard w/ Sap Extentions not starting the remote app servers

Hi,

Have a weird problem. Working with SG 3.12 on HPUX and trying to start 3 remote app servers running LINUX. I can connect with SSH just fine manually from root on the SG machine but when SG tries to do it...it wont work. I show

Node "gvh420": (start_addons_postci): TRACE POINT
Node "gvh420": (start_saposcol): TRACE POINT
Node "gvh420": (start_own_app): TRACE POINT
Node "gvh420": (start_own_app): Instance D00 on host gvl722 gets started
Node "gvh420": (start_app gvl722 00 p01adm LINUX r3 cleanipc 00 remove; sleep 10; D): TRACE POINT
Node "gvh420": (is_ip_local gvl722): TRACE POINT
Node "gvh420": (start_app gvl722 00 p01adm): Startup attempt on remote host...
Node "gvh420": (start_own_app): Instance D00 on host gvl723 gets started
Node "gvh420": (start_app gvl723 00 p01adm LINUX r3 cleanipc 00 remove; sleep 10; D): TRACE POINT
Node "gvh420": (watchdog): Watchdog initiated (PID: 25162 Timeout: 100 secs)
Node "gvh420": (is_ip_local gvl723): TRACE POINT
Node "gvh420": (start_app gvl723 00 p01adm): Startup attempt on remote host...
Node "gvh420": (watchdog): Watchdog initiated (PID: 25268 Timeout: 100 secs)
Node "gvh420": (start_own_app): Instance D00 on host gvl724 gets started
Node "gvh420": (start_own_app): Waiting for parallel startup sequence to finish...
Node "gvh420": (start_app gvl724 00 p01adm LINUX r3 cleanipc 00 remove; sleep 10; D): TRACE POINT
Node "gvh420": (wait_pidlist): TRACE POINT
Node "gvh420": (wait_pidlist): PIDlist watchdog initiated (PIDlist: 25029)
Node "gvh420": (wait_pidlist): PIDlist watchdog initiated (PIDlist: 25145)
Node "gvh420": (is_ip_local gvl724): TRACE POINT
Node "gvh420": (wait_pidlist): PIDlist watchdog initiated (PIDlist: 25277)
Node "gvh420": (start_app gvl724 00 p01adm): Startup attempt on remote host...
Node "gvh420": (watchdog): Watchdog initiated (PID: 25304 Timeout: 100 secs)
Node "gvh420": (start_own_app): Leaving parallel App-Server startup delay
Node "gvh420": (test_app gvl722 00 p01adm D 1): TRACE POINT
Node "gvh420": (is_ip_local gvl722): TRACE POINT
Node "gvh420": (test_app): Trying to connect via RFC to host gvl722 instance D00 ...
Node "gvh420": (watchdog): Watchdog initiated (PID: 25335 Timeout: 100 secs)
Node "gvh420": (test_app): No connection to instance D00
Node "gvh420": (test_app): Instance D00 not responding
Node "gvh420": (start_own_app): ERROR: Instance D00 startup problem detected
Node "gvh420": (test_app gvl723 00 p01adm D 1): TRACE POINT
Node "gvh420": (is_ip_local gvl723): TRACE POINT
Node "gvh420": (test_app): Trying to connect via RFC to host gvl723 instance D00 ...
Node "gvh420": (watchdog): Watchdog initiated (PID: 25439 Timeout: 100 secs)
Node "gvh420": (test_app): No connection to instance D00
Node "gvh420": (test_app): Instance D00 not responding
Node "gvh420": (start_own_app): ERROR: Instance D00 startup problem detected
Node "gvh420": (test_app gvl724 00 p01adm D 1): TRACE POINT
Node "gvh420": (is_ip_local gvl724): TRACE POINT
Node "gvh420": (test_app): Trying to connect via RFC to host gvl724 instance D00 ...
Node "gvh420": (watchdog): Watchdog initiated (PID: 25543 Timeout: 100 secs)
Node "gvh420": (test_app): No connection to instance D00
Node "gvh420": (test_app): Instance D00 not responding
Node "gvh420": (start_own_app): ERROR: Instance D00 startup problem detected
Node "gvh420": (start_addons_postciapp): TRACE POINT


config looks ok. It seems to be an issue with SSH on SG. Anyone else have tried to start a linux box via ssh and have it work successfully?

If I run

ssh p01adm@gvl722 startsap

it goes to the app server(gvl722) just fine and starts the dialog instance on the linux box. So ssh config doesnt seem to be an issue...just when its run from the sap extentions in SG.

Any help would be welcome.

Thanks,

John Fischer
4 REPLIES 4
Geoff Wild
Honored Contributor

Re: Serviceguard w/ Sap Extentions not starting the remote app servers

I don't use Linux for app servers nor ssh to start them...however - I may still be able to help...

Do you have a /etc/cmcluster/sap.function file?

Basically, I'm looking for the commands you use to ssh...

For example, with remsh, it looks like so:

remsh ${I} -l ${SIDADM} -n "csh -c \"${CLEANUP}~/startsap_${I}_${ASNR[$INR]}
r3 & \"" >/dev/null 2>&1

Rgds...Geoff
Proverbs 3:5,6 Trust in the Lord with all your heart and lean not on your own understanding; in all your ways acknowledge him, and he will make all your paths straight.
melvyn burnard
Honored Contributor

Re: Serviceguard w/ Sap Extentions not starting the remote app servers

you may want to check whether you have patched SgeSAP:
cumulative patch for SGeSAP 4.01:
HP-UX 11.11: PHSS_34636
HP-UX 11.23: PHSS_34637

cumulative patch for SGeSAP 3.12:
HP-UX 11.11: PHSS_30740
HP-UX 11.23: PHSS_30741
My house is the bank's, my money the wife's, But my opinions belong to me, not HP!
jtfischer
Occasional Contributor

Re: Serviceguard w/ Sap Extentions not starting the remote app servers

Thanks for all the help. Its was an issue in the customer.functions and it couldn't find where /bin/sh was on LINUX and failed.

working properly once it was told where the shell is and it found it.

Thanks,

John
jtfischer
Occasional Contributor

Re: Serviceguard w/ Sap Extentions not starting the remote app servers

Thanks for the help