- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - OpenVMS
- >
- Re: One node shutdown on an OpenVMS clustered syst...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-19-2005 08:05 PM
тАО12-19-2005 08:05 PM
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-19-2005 08:09 PM
тАО12-19-2005 08:09 PM
Re: One node shutdown on an OpenVMS clustered system
welcome to the OpenVMS ITRC forum.
- did you shutdown the node with the REMOVE_NODE shutdown option ?
- what are the setting of VOTES on the 2 nodes and is a quorum disk in use ?
If you've set up the cluster with 2 nodes with 1 vote each and EXPECTED_VOTES=2 and you shut down one node without REMOVE_NODE, the remaining node will NOT adjust quorum and wait hanging for a second vote.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-19-2005 09:07 PM
тАО12-19-2005 09:07 PM
Re: One node shutdown on an OpenVMS clustered system
Welcome from me as well.
Would you care to explain what exactly you mean by "secondary node" , and by "Standalone shutdown"?
Is this a cluster with a bootnode and a satellite, or two equivalent nodes?
In the first case, does the satellite have VOTES > 0 ?
In the latter case, is there a Quorum Disk?
Like Volker asked, what are the values of VOTES (on both nodes) and of EXPECTED VOTES (should be equal to the summ of all votes, including QSKVOTES, and the same on every node).
Most important, did you specify "REMOVE_NODE" as shutdown option?
Please answer these, and we will be able to sort things out.
Success,
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-20-2005 02:48 AM
тАО12-20-2005 02:48 AM
Re: One node shutdown on an OpenVMS clustered system
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-20-2005 01:46 PM
тАО12-20-2005 01:46 PM
Re: One node shutdown on an OpenVMS clustered system
Ok. i have a clustered system (OpenVMS 7.3.1)with 2 nodes: primary and secondary systems. Both have their own ip addresses.
I did not use the REMOVE_NODE shutdown option. If i use the REMOVE_NODE option, will it rejoin back automatically with primary when i boot up again the secondary system.
All i want to do is to ONLY shutdown the secondary system WITHOUT have any impact on the primary system. And then when i am ready, i wnt to rejoin back the 2 system as cluster again.
Currently, when i used SHUTDOWN on the secondary system, the primary system hang for some times. (not sure how long it will last, so i have to restart the secondary machine)
Here is the SHOW CLUSTER info for both Primary and Secondary system.
View of Cluster from system ID 10241 node: MPRI 12-DEC-2005 15:55:12
+-------------------+---------+
| SYSTEMS | MEMBERS |
+--------+----------+---------+
| NODE | SOFTWARE | STATUS |
+--------+----------+---------+
| MPRI | VMS V7.3 | MEMBER |
| MSEC | VMS V7.3 | MEMBER |
+--------+----------+---------+
+-------------------------------------------------------------------------------
| CLUSTER
+--------+-----------+----------+---------+------------+-------------------+----
| CL_EXP | CL_QUORUM | CL_VOTES | QF_VOTE | CL_MEMBERS | FORMED | LA
+--------+-----------+----------+---------+------------+-------------------+----
| 4 | 3 | 4 | NO | 2 | 3-OCT-2005 15:39 | 8-
View of Cluster from system ID 10242 node: MSEC 12-DEC-2005 18:01:51
+-------------------+---------+
| SYSTEMS | MEMBERS |
+--------+----------+---------+
| NODE | SOFTWARE | STATUS |
+--------+----------+---------+
| MSEC | VMS V7.3 | MEMBER |
| MPRI | VMS V7.3 | MEMBER |
+--------+----------+---------+
+-------------------------------------------------------------------------------
| CLUSTER
+--------+-----------+----------+---------+------------+-------------------+----
| CL_EXP | CL_QUORUM | CL_VOTES | QF_VOTE | CL_MEMBERS | FORMED | LA
+--------+-----------+----------+---------+------------+-------------------+----
| 4 | 3 | 4 | NO | 2 | 3-OCT-2005 15:39 | 8-
The device info for MPRI and MSEC are as below:
MPRI:
Device Device Error Volume Free Trans Mnt
Name Status Count Label Blocks Count Cnt
DSA100: Mounted 0 DATA1 53815929 202 2
$10$DKD0: (MPRI) Mounted 0 ALPHASYS_PRI 48014892 489 1
$10$DKD1: (MPRI) ShadowSetMember 0 (member of DSA100:)
$10$DQA0: (MPRI) Online 0
$10$DQA1: (MPRI) Online 1
$10$DQB0: (MPRI) Online 1
$10$DQB1: (MPRI) Online 1
$11$DKD0: (MSEC) Mounted 0 (remote mount) 1
$11$DKD1: (MSEC) ShadowSetMember 0 (member of DSA100:)
$11$DQA0: (MSEC) Online 0
$11$DQA1: (MSEC) Online 0
$11$DQB0: (MSEC) Online 0
$11$DQB1: (MSEC) Online 0
MSEC:
Device Device Error Volume Free Trans Mnt
Name Status Count Label Blocks Count Cnt
DSA100: Mounted 0 DATA1 53815860 1 2
$10$DKD0: (MPRI) Mounted 0 (remote mount) 1
$10$DKD1: (MPRI) ShadowSetMember 0 (member of DSA100:)
$10$DQA0: (MPRI) Online 0
$10$DQA1: (MPRI) Online 0
$10$DQB0: (MPRI) Online 0
$10$DQB1: (MPRI) Online 0
$11$DKD0: (MSEC) Mounted 0 ALPHASYS_SEC 47586126 629 1
$11$DKD1: (MSEC) ShadowSetMember 0 (member of DSA100:)
$11$DQA0: (MSEC) Online 0
$11$DQA1: (MSEC) Online 1
$11$DQB0: (MSEC) Online 1
$11$DQB1: (MSEC) Online 1
One more question: How do i know (besides disks) what other resources are shared between the 2 nodes?
Thanks.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-20-2005 05:50 PM
тАО12-20-2005 05:50 PM
Re: One node shutdown on an OpenVMS clustered system
it looks like both nodes have 2 VOTES each and there is no quorum disk. This will make EXPECTED_VOTES=4 and QUORUM=3. If you shut down one node WITHOUT the REMOVE_NODE option, the other node will hang with CL_VOTES = 2 < CL_QUORUM = 3 until you bring back the stopped node.
If you use REMOVE_NODE, CL_EXP will be reduced to 2 and CL_QUORUM will be reduced to 2, so the other node can continue.
If you would be able to use a quorum disk (a shared non-shadowed disk directly accessible by both nodes - does not seem possible in your config, which apparently only has local SCSI buses), you could live without the REMOVE_NODE option during shutdown, but it would increase your cluster state transition time a bit.
Did you know about the IPC interrupt for re-calculating quorum on a system hung due to quorum loss ?
It goes like this:
Press HALT to get to the console prompt
>>> D SIRR C
>>> C
IPC> Q
IPC>
Or you could use DECamds / Availability Manager and the Fix Quorum function.
The above is only necessary, if you FORGOT the REMOVE_NODE during shutdown or if one of the systems suddenly breaks down.
Besides the disks (and files on them), the lock manager database is also shared between the 2 nodes.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-21-2005 06:52 PM
тАО12-21-2005 06:52 PM
Re: One node shutdown on an OpenVMS clustered system
First, thank very much for your valuable advice. Appreciate it.
If i would like to rejoin the secondary node (which was brought down using REMOVE_NODE option) back to the cluster, does it mean i have to do the below on the secondary machine ? With the below execution, it will bring back the CL_Votes(4) wich will be greater than the quorom value(3), rite ?
Press HALT to get to the console prompt
>>> D SIRR C
>>> C
IPC> Q
IPC>
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-21-2005 06:57 PM
тАО12-21-2005 06:57 PM
Re: One node shutdown on an OpenVMS clustered system
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-21-2005 08:06 PM
тАО12-21-2005 08:06 PM
Solution... assuming you havn't made any sysgen paremter changes that might affect the cluster or the system just shutdown.
Kind Regards
John.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО12-22-2005 04:01 AM
тАО12-22-2005 04:01 AM
Re: One node shutdown on an OpenVMS clustered system
as John already said, just boot the secondary node normally and it will join the cluster and the primary will then continue, because there are now enough votes to satisfy quorum.
The IPC interrupt could have been used to revive the primary node, once it hung after shutting down the secondary node (and forgetting to specify the REMOVE_NODE shutdown option).
Volker.