Serviceguard
cancel
Showing results for 
Search instead for 
Did you mean: 

rhel 5 cluster problem...........

 
Viral Ahire
Occasional Contributor

rhel 5 cluster problem...........

Hi ,
I have configured 2 node cluster (redhat cluster suite)with Proliant 580G4 servers and MSA500 storage.
Cluster configured on RHEL5 64bit os.
Now the problem is ,when i stop cluster application (oracle), then the node where the application was running ,is leave cluster and it's cluster service stop and reboot server due to fencing.
HP-ILO is used for fencing.
I have attached logs and cluster.conf file...

Please help to solve this problem....

regards,
viral ahire
5 REPLIES
Ivan Ferreira
Honored Contributor

Re: rhel 5 cluster problem...........

It looks like your stop script needs some checking. It should return 0 value if the database was stopped sucessfuly.


Jun 30 11:21:03 psfhost1 clurgmgrd: [3767]: script:Oracle: stop of /etc/init.d/orcl failed (returned 127)
Jun 30 11:21:03 psfhost1 clurgmgrd[3767]: stop on script "Oracle" returned 1 (generic error)

There you can see that ths script "Oracle" returned 1.

You must ensure that your start and stop script is working correctly before using the cluster commands.
Por que hacerlo dificil si es posible hacerlo facil? - Why do it the hard way, when you can do it the easy way?
Steven E. Protter
Exalted Contributor

Re: rhel 5 cluster problem...........

Shalom,

Any time a script called by the cluster returns a return code other than 0 (zero).

It is game over for the service.

Often you will need to customize the script.

RHCS also does a service stop on the service at start up, even if the service is not running.

The start up script must be modified to detect this condition and return code zero when it occurs.

SEP
Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
Viral Ahire
Occasional Contributor

Re: rhel 5 cluster problem...........

Hi lvan,

I can start/stop oracle manually. even right now cluster is working with oracle in live environment. but problem occure only when you try to stop cluster app (oracle).So currunt cluster setup will not usefull during failover situation.

Actully the problem is, my 2 node cluster was working good ,but i starting facing this problem after changed hostname and ip address of both nodes. due to this changes i had to reconfigur cluster (through system-config-cluster).


Please help to resolve this problem.............................


regards,
Viral Ahire
skt_skt
Honored Contributor

Re: rhel 5 cluster problem...........

what is the processes you followed to change hostname/ip?
Viral Ahire
Occasional Contributor

Re: rhel 5 cluster problem...........

to change ip edit /etc/sysconfig/network-script/-ifcfg-eth0 & eth1 file

to change hostname edit /etc/sysconfig/network file.....



Where is the masters ??????

i am waiting for the reply..........


regards,
Viral Ahire