- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - OpenVMS
- >
- Clustering question
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-14-2007 08:12 AM
01-14-2007 08:12 AM
Clustering question
Node A is set up from the beginning as a cluster member, booting from [SYS0] over shared SCSI.
Node B is destined to be added into the cluster. When booted locallyand running CLUSTER_CONFIG_LAN on NodeB, it will join the cluster.
For booting it from the common system disk, I ran CLUSTER_CONFIG_LAN on node A, adding Node B. [SYS1] will then be created (has been verified) and next Node A requests Node B to boot (from [SYS1]). It will get up, trying to contact Node A to reauest formation of the cluster. Node A will confirm but for some reason, Node B will never accept it. At one point it does signal there it has contacted node A but forming the cluster will not happen.
When booted from it's local system disk, after having CLUSTER_AUTHORIZATIOn.DAT to the system, it will boot, but node A will not be contacted, and the system will start up as a single-node cluster.
The disks on the shared SCSI that are mounted by Node A, are seen as "remote mount" on node B, with multiple paths. So far, so good - as long as the disks are not mounted loaclly. If so, I get "disk offline" and "mount verification" messages on both systems.
The controller might be an issue - I have installed PZPBA-CY on both machines, but I tried KZPSA as well, with the same outcome.
Node A has two NIC's but one has been disabled after the system is started since it's not (yet) connected ($ NCL DISABLE CSMA STATION CMSMA-CD1). Node B has one NIC
I may have missed something - but what?
OpenVMS Developer & System Manager
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-14-2007 10:47 AM
01-14-2007 10:47 AM
Re: Clustering question
If it's shared SCSI, make sure you've got Port Allocation Classes enabled, and the ports for the shared bus with matching allocation classes on both systems.
The disks on the shared bus MUST have the same names on both systems. With PAC enabled, all SCSI drives will be named $a$DKAnnn where only "a" varies between busses. (of course, your applications only access disks via logical names, so any change in the physical name of a drive is easily dealt with, right?... ;-)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-15-2007 08:05 AM
01-15-2007 08:05 AM
Re: Clustering question
I know - and this has been set up some time ago already. All disks accessed by NodeA (except for page- and swapfiles) are on the shared SCSI.
OpenVMS Developer & System Manager
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-15-2007 10:37 AM
01-15-2007 10:37 AM
Re: Clustering question
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-15-2007 11:24 AM
01-15-2007 11:24 AM
Re: Clustering question
Check sysgen parameters VOTES and EXPECTED_VOTES. A system will note join a cluster if it's EXPECTED_VOTES will cause quorum to be lost.
Andy
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-16-2007 04:36 AM
01-16-2007 04:36 AM
Re: Clustering question
I agree completely with the port allocation classes needing to be set up properly. When I work with them, I usually set up a cluster common file that has all of the settings for every node in the cluster in one SYS$DEVICES.DAT file and propagate that to every SYS$COMMON:[SYSEXE] in the cluster.
Robert
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-21-2007 01:49 AM
01-21-2007 01:49 AM
Re: Clustering question
Next, it won't boot anymore, see this file (second part, as yesterday, after ">>>init").
It _might_ have to do with the SCSI card (KZPBA-CY) that doesn't support shered SCSI, though the documentation isn't clear at all. Some state KZPSA (could try that but I don't have sufficient cards for all intended members), some state KZPBA-CB - where I use KZPBA-CY said by a collegue to be feasable for the job - his machines work with that as well - admitted though he wasnt sure..
A file containing output of NodeA (session running cluster_config_lan, and of an "operator terminal") and NodeB's console is attached.
OpenVMS Developer & System Manager
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-21-2007 06:40 AM
01-21-2007 06:40 AM
Re: Clustering question
(Two different names for the same physical disk is bad. Very bad. This could conceivably lead to disk data corruptions.)
The %x910 errors are File Not Found errors. That's not usually a good sign for a system disk.
I also see a mix of $2$ and $116$. Is this the same bus?
There's no gentle way to ask this: how current is your most recent system disk BACKUP?
Should the node start up as a single node cluster, then it is either the only node with votes (and all other nodes should be wedged), or the configuration is problematic and the settings for VOTES and EXPECTED_VOTES might well be incorrect.
I encountered the results of booting into a partitioned clusters -- where two nodes booted from the same system root on the same disk, and whomever had set it up had not set VOTES and EXPECTED_VOTES correctly -- and the disk corruptions were quite impressive. Each node thought it had quorum, could not reach the other node due to the duplicate network address, and both proceeded to write to the disks sans any and all coordination. The owner (and perpetrator of the incorrect settings) told me he ended up reloading the system disk.
And as for the hardware and based on a quick look, it looks like the KZPBA-CY controller will probably work for the configuration. I don't see immediate evidence that it's officially supported, however, nor do I know that it will work.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-21-2007 07:02 AM
01-21-2007 07:02 AM
Re: Clustering question
$116$DKA100 and $2$DKA100 are different disks: the first the one on shared scsi (116 comes straight from the book) and the other the local disk on NodeB. I think the system disk is not corrupted, backup is very recent (just a few days), and DKA100 - well, I don't really care, it's a newly installed, and hardly modified 8.3 version that can easily be rebuilt from scratch.)
I will dig into the prevention of a partitioned cluster some day, this is not a major issue at the moment. It's neither a problem to reboot all machines if required.
It's very well possible that NodeA will address some disks on shared SCSI and nodeB others - but the ability to 'take over' in case of emergency is wanted. It doesn't have to be automated now - some manual work is no problem.
So at this moment: no issue ;-)
OpenVMS Developer & System Manager
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-21-2007 08:17 AM
01-21-2007 08:17 AM
Re: Clustering question
OpenVMS Developer & System Manager