1834621 Members
2140 Online
110069 Solutions
New Discussion

S/W fail over

 
SOLVED
Go to solution
Lai Nee Shyang_1
Frequent Advisor

S/W fail over

Hi there,

I've a new system implementation with MCSG. It is a 3 node cluster. The special thing about this system is that on one of the machine it has 3 Oracle instances and HP guys propose to have 3 MCSG pkg to fail over the individual Oracle Instances. They plan to implement process monitoring scripts for triggering fail over. I always thot that MCSG protects against HW/NEtwork fail over, never on s/w fail over.

Has anyone out there done something similar?
Any comments please.


Thanks and have a nice day.

Lai
: )
If it doesn't work, We'll make it work. If it works, We'll make it work better.
7 REPLIES 7
Alexander M. Ermes
Honored Contributor
Solution

Re: S/W fail over

Hi there.
Yes, we have something similar here.
We have setup the cluster in case one of the packages goes down on one node, it switches to the other node with the shared disks. Then the Oracle instances run on the second node. It is very useful, if you are in need of a minimum downtime for the users. The package switching takes about 2 minutes or less.
Rgds
Alexander M. Ermes
.. and all these memories are going to vanish like tears in the rain! final words from Rutger Hauer in "Blade Runner"
Animesh Chakraborty
Honored Contributor

Re: S/W fail over

Hi,
Offcourse..
It does s/w (package) failover too.
Once in my office someone killed one oracle processes by mistake then the package started on another node successfully.


Thanks
Animesh
Did you take a backup?
linuxfan
Honored Contributor

Re: S/W fail over

Hi Lai,

It is very common to have a package defined for oracle instance and in your case have 3 packages defined for 3 instances, what it does is provide flexibility and differentiates each instance and you could(depending on your configuration) run each instance on different node.
This way rollover upgrades become easier, you could upgrade oracle on one node while you failover the package to other node (though this can be done with just one oracle package too).
Not sure if you have seen the doc "Managing MC/SG"
http://docs.hp.com/hpux/pdf/B3936-90045.pdf

-Ramesh
They think they know but don't. At least I know I don't know - Socrates
linuxfan
Honored Contributor

Re: S/W fail over

Hi Lai,

One more thing, as far as monitoring the oracle databases are concerened, with-in MC/SG the monitoring scripts can be configured to monitor oracle listener as well as individual processes like (smon, pmon etc).

You can actually write your own control/monitoring scripts, but HP also provides the Oracle toolkit($$$) which you can be modify if you want to.

-HTH
Ramesh
They think they know but don't. At least I know I don't know - Socrates
Rainer von Bongartz
Honored Contributor

Re: S/W fail over

One thing to think about protecting yourself against SW problems with MC/SG: Normally it makes no sense to switch the package over to another node in case of a SW problem. If your nodes are configured in the same way the SW problem will also happen on the next box. So configure MC/SG to re-start your package on the same box and to only re-start it once after a SW error.

I once had a SW problem that my database crashed immediatly after startup and MC/SG restarted it, and then
crash,restart,crash,restart...
about 400 times during a night.


Regards
Rainer

He's a real UNIX Man, sitting in his UNIX LAN making all his UNIX plans for nobody ...
Printaporn_1
Esteemed Contributor

Re: S/W fail over

Actually you just mis understand the wording.
HW should be redundancy like using disk array.
In MCSG there are standby LAN
that can be local fail-over when primary lan fail
package still runing in same machine but standby lan will up to be active lan.
in case of primary node fail the S/W package will fail-over to adoptive node , this is S/W fail-over.
enjoy any little thing in my life
Lai Nee Shyang_1
Frequent Advisor

Re: S/W fail over

Hi People,

Thanks for your comments. They are really helpful.

Special thanks to Printaporn. You just cleared another miss-understanding I have regarding MCSG definitions.

Thanks to all again.


Lai
: )
If it doesn't work, We'll make it work. If it works, We'll make it work better.