How to increase the filesystem which is under redhat cluster directly?

bullz · ‎07-20-2010

Hello Experts,

I got an activity for increasing the size of the file system on RHEL 5.2

/dev/mapper/xyz1 179G 59G 111G 35% /xyz

1) As per my observation, the server is configured with redhat cluster, if this mount point is down, it will be fail over to the alternative node.
2) /xyz is not in fstab, I can understand that, this is controlled by cluster
3) The storage is form the centralized HP storage, I can recognize these luns via command multipath –l ( On both the node )

• In case, if I want to increase the file system size, it’s possible to do it online? After increasing the lun from the storage end.
• Since it’s cluster with other node, what are the changes to be made at the other server?
• Do have any other method to carry out this activity, please through some light on this of your experience

Matti_Kurkela · ‎07-21-2010

Your anonymized device name /dev/mapper/xyz1 suggests this might not be a LVM PV, but a traditional PC-style partition. If that's true, you can extend partition /dev/mapper/xyz1 online only if no other partitions exist on the LUN.

If you don't have LVM, you will have to edit the LUN's partition table while the LUN is mounted, which is always a little bit scary to me.

With LVM, it would be possible to add a completely new LUN to the clustered volume group, which would be safer than extending the existing LUN.

Anyway, the procedure would be something like this:

1.) Extend the LUN in the storage system.

2.) Run "partprobe" on both nodes and see if the nodes can detect the new LUN size. If the new size cannot be detected, you may have to reboot the cluster.

3.) Go to the node that has the filesystem mounted, and extend the partition with parted/cfdisk/fdisk/your favorite partitioning tool.
Be *extremely* careful with this step!

4.) Run "partprobe" on both nodes and verify the nodes can see the new partition size (using "cat /proc/partitions" or whatever)

5.) To make the filesystem use the new space, run the filesystem resizer tool appropriate for your filesystem type.

For example, if it's an ext3 filesystem, run:

resize2fs /dev/mapper/xyz1

6.) Done!

MK

MK

Steven E. Protter · ‎07-21-2010

Shalom,

I recommend adding a new lun to expand file systems. It avoids reboots, as sometimes Linux has trouble recognizing a size change on a LUN.

SEP

Steven E Protter
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com

bullz · ‎07-21-2010

But, how can it help in this case.

existing mountpoint is not under lvm :(

Experts,

I have a very strong doubt,

Multipath -l shows me the wwn number, but it got converted to /dev/mapper/xyz1 with LVM?

will it be from CLVM? (cluser lvm)
i could see, some rpm is installed as clvm, but i am not aware of configuration and commands.

* It's on red-hat cluster on both the node.

Matti_Kurkela · ‎07-22-2010

> Multipath -l shows me the wwn number, but it got converted to /dev/mapper/xyz1 with LVM?

/dev/mapper/xyz1 does not look like a LVM device name. A LVM device name usually has two parts: the first is the Volume Group name, the second is the Logical Volume name. They combine together like this:

/dev/mapper/-

A traditional PC-style partition on a multipathed device is normally named like this:

/dev/mapper/p

Is "xyz1" the real name of the device, or have you censored some information? I understand the requirement to keep the actual filesystem names secret, but the name format includes some important details about how the filesystem is set up.

It might be that the system has been configured to add a custom name "xyz1" for the multipath device - if this is the case, the association between the WWID and the name "xyz1" is configured using either /etc/multipath.conf or /var/lib/multipath/bindings.

If that's true, there might be neither LVM nor a partition table on the LUN, just a filesystem on the whole-LUN device. This is not a very good practice, as it makes storage migration rather more difficult, but the filesystem expansion should be relatively simple:

1.) extend the LUN from the storage end
2.) run partprobe on all nodes, verify that they all see the new size of the LUN
3.) run the filesystem extension tool (resize2fs or whatever is appropriate for the filesystem type) on the node that currently has the filesystem active.
4.) complete!

If you're still uncertain, run "pvs": if it does not list /dev/mapper/xyz1, then xyz1 is not a Linux LVM physical volume (neither CLVM or not).

Then run "dmsetup ls --tree".
If /dev/mapper/xyz1 is just a whole-LUN multipath device, it should display as a two-level hierarchy like this:

xyz1 (253:)
|- (:)
\- (:)

The numbers within the parentheses are major:minor device number pairs. If the major number is 8, they are /dev/sd* devices.

If there is a partition table or LVM is used on the LUN, there will be a three-level hierarchy:

xyz1p1 (253:)
\- xyz1 (253: |- (:)
\- (number>:)

i.e. the top level is the partition, the second level is the whole-LUN multipath device, and the third level includes all the individual /dev/sd* paths to the LUN.

MK

MK

bullz · ‎07-22-2010

Hey MK,

Amazing. I wish to give you more points ï Unfortunate, no option for it. I really wonder about your ability.

BTW, let me come to the point.

Storage: HP MSA 2000
OS:- RHEL 4.2

I get the output as follows.

xyzp1 (253:20)
Ã¢Ã¢FERSIT (253:18)
Ã¢Ã¢ (8:80)
Ã¢Ã¢ (8:32)
abcp1 (253:19)
Ã¢Ã¢FERDBS (253:17)
Ã¢Ã¢ (8:64)
Ã¢Ã¢ (8:16)

So, I understand that, the LUNs are under some manager control, but still I am unable to dig it, how it got converted from LUN to /dev/mapper/***

I verified it very well that, itâ s not from local LVM ( as far I can see it in LVS/VGS commands )

Option 1)
Thank you so much for your opinion for expanding volume at storage and OS. But users feel that extend Fs process on the MSA is very very slowly, and risk involved.

Option 2)
Preset the additional LUN and move some data to the new volume.

In such case, can you explain how we can bring the new disk under cluster control? I am able find only file /etc/cluster/cluster.conf which is related to cluster setup.

Matti_Kurkela · ‎07-22-2010

xyzp1 (253:20)
\- FERSIT (253:18)
|- (8:80)
\- (8:32)

The top level looks like a partition (...p1), then there is a whole-LUN multipath device FERSIT, and finally two /dev/sd* paths to the LUN.

(8:80) is /dev/sdf and (8:32) is /dev/sdc.
Run "ls -l /dev/sd*" to see the major and minor numbers of disk devices. These numbers are systematic and will use the same numbering sequence in most Linux systems.

> but still I am unable to dig it, how it got converted from LUN to /dev/mapper/***

Look into /etc/multipath.conf. Is there something like this?
-----
multipaths {
multipath {
wwid
alias FERSIT
}
}
-----
That would assign the custom name /dev/mapper/FERSIT to a multipath device with a given WWID.

The same effect can be achieved by editing the /var/lib/multipath/bindings file, but /etc/multipath.conf is the recommended way to do it.

In /var/lib/multipath/bindings, the name would be specified simply like:

FERSIT

Now we know that the name of the whole-LUN device is /dev/mapper/FERSIT. The next step is to verify its partition table. Please run this command:

fdisk -l /dev/mapper/FERSIT

I still don't know why the partition-level name in the dmsetup listing was "xyzp1": the default name would be FERSITp1. (Or if you were trying to censor something, I think I just managed to reconstruct the censored information :-)

In general, the system builds up a device like this in the following sequence:

1.) The SAN HBA driver detects the LUNs visible to each HBA and presents them as /dev/sd* devices. Each path to the LUN gets its own device node: this layer does not care about multipathing at all.

2.) The dm-multipath subsystem detects that /dev/sdc and /dev/sdf both have the same WWID, and sets up a multipath device for it. The multipath configuration includes a custom name FERSIT for the multipath device, so the multipath device is named /dev/mapper/FERSIT.

3.) As the new multipath device is created, it causes an udev event. The udev/hotplug subsystem will detect that /dev/mapper/FERSIT seems to contain a partition table, and automatically runs "kpartx -a /dev/mapper/FERSIT" to create corresponding multipath devices for each partition found.

The default name for the first partition on /dev/mapper/FERSIT would be /dev/mapper/FERSITp1; if this partition device has a different name, there is probably an udev rule somewhere in /etc/udev/rules.d/ that assigns the custom name to the partition. Run "grep FERSIT /etc/udev/rules.d/*" to find it.

> But users feel that extend Fs process on the MSA is very very slowly, and risk involved.

You probably mean the LUN extension process? The MSA does not care about the filesystem: it extends only the raw LUN. The filesystem extension is done by the "resize2fs" (or whatever) command on the host.

Nevertheless, your users may be right. A more common strategy would be to present a new LUN from the MSA. If LVM was used, it would have been easy to add the new LUN to the existing VG, and then extend the existing LVs as necessary.

> I am able find only file /etc/cluster/cluster.conf which is related to cluster setup.

Yes, that is the only configuration file for the RedHat Cluster Suite (at least for RHEL 5.x; the Cluster Suite for RHEL 4.x might have been different). See the RedHat Cluster Suite documentation.

--------

I would recommend RedHat's "RedHat Enterprise Clustering and Storage Management" course for you. The course code is RH436. It includes 4 days of lectures and labs, and optionally a certification exam (code EX436) on the 5th day.

I sat the course in last November, and it covered much of the information required to answer your questions in this thread. But if you want to do the EX436 exam, you must get the basic RHCE certification first: you cannot participate in the exam unless you already have a RHCE certification.

MK

MK

bullz · ‎07-24-2010

Perfect MR KK. Thanks for you inputs as always.

I have few thing need to be clarified.

Current setting:

2 nodes are in redhat cluster, each node has one mountpoint and available in cluster configuration.

Node1: /XYZ
Noce2:/ABC

New mountpoint: /ABC1

Here is the request for one more mountpoint and need to be added as part of cluster resource.

1) Can I edit /etc/cluster/cluster.conf with freezing cluster service?
2) I have attached the txt with new configuration added in /etc/cluster/cluster.conf, please see, if the changes are OK.
3) Should I need to restart cluster service after editing configuration file?

Note: I can get a time window for this activity.

bullz · ‎07-24-2010

minor change.

Can I edit /etc/cluster/cluster.conf without freezing cluster service?

Matti_Kurkela · ‎07-25-2010

1.) You should never edit the cluster configuration file directly when the cluster is running.

Instead, make a copy of the cluster configuration file:

cp /etc/cluster/cluster.conf /tmp/cluster.conf.new

Make your changes to the new copy:

vi /tmp/cluster.conf.new

Increment the configuration version number (on the 2nd line of the cluster.conf file)

... config_version="18" ...
=>
... config_version="19" ...

Then run this command to apply the update to all cluster members simultaneously:

ccs_tool update /tmp/cluster.conf.new

2.) You should not re-use the same fsid number with two different filesystems.

(The fsid is essentially a random number that is important if the filesystem is exported over NFS. If two exported filesystems have the same fsid number, the NFS clients might get confused. If you aren't NFS exporting the filesystem, it's not so big deal, but it's a bad practice to create duplicate fsids.)

Other than that, it looks OK to me.

3.) The safest choice would be to stop the service before doing the "ccs_tool update" command, then restart the server and do a test failover, so that you have positive proof that the new configuration works.

It *may* be possible to add the filesystem without stopping & restarting the service, but I would not recommend it.

MK

MK

bullz · ‎07-25-2010

Hey MK,

A small query.

â Then run this command to apply the update to all cluster members simultaneously:

ccs_tool update /tmp/cluster.conf.newâ

As you said, the above one is to update the cluster.conf on all the cluster node simultaneously, but in point 3, you recommended that stop the service before doing the "ccs_tool update" command, then restart the server and do a test failover, so that you have positive proof that the new configuration works.

1) Stop the service. Do you mean cluster service?

service qdiskd stop
service rgmanager stop
service fenced stop
service cman stop
service ccsd stop

and

service ccsd start
service qdiskd start
service cman start
service fenced start
service rgmanager start

Also, fyi, I do not have ccsd and fenced in /etc/init.d/ these startup scripts are located in any where else?

I could figure it out that, the process are running.

# ps -ef | grep -i ccsd
root 3830 1 0 2009 ? 00:43:18 /sbin/ccsd
root 13005 7384 0 08:24 pts/0 00:00:00 grep -i ccsd
#ps -ef | grep -i fence
root 3855 1 0 2009 ? 00:00:00 /sbin/fenced
root 13074 7384 0 08:24 pts/0 00:00:00 grep -i fence

2) As per my understanding, if restart the cluster service, without syncing the cluster.conf on other cluster node by â ccs_tool update /tmp/cluster.conf.newâ after adding my new file system in configuration file, how my new filesystem will be failoverd to the other cluster node? Since the new filesystem configuration is missing on other node?

Please correct me if I am wong.

3) By default, if I shut the cluster service on one node, the failover will happen automatically right?
4) â then restart the server and do a test failoverâ do you mean server restart?
5) After editing, /var/lib/multipath/bindings file, can I restart /etc/init.d/multipath restart do change the name alias
6) If I freeze the cluster service and do, can this activity go smooth?

Also, please find the plan below. Please suggest your point of view, how we can proceed further

â ¢ Assign the LUNs from the storage to both the sevrer
â ¢ Do multipath â l and make sure, the assigned LUNs are visible
â ¢ Do name alias
â ¢ Partition
â ¢ Mount it
â ¢ Freeze cluster service ( if required ) please advise ***
â ¢ Edit /etc/cluster/cluster.conf.new
â ¢ ccs_tool update /tmp/cluster.conf.new
â ¢ restart the cluster serice (if required ) please advise ***
â ¢ do a test failover and check if the new filesystem failing over to other cluster node please advise ***

Matti_Kurkela · ‎07-26-2010

1.) Sorry, I tried to stay with RedHat Cluster Suite terminology.

The RedHat cluster documentation uses the words "cluster service" (or "service" for short) for what is known as "package" in Serviceguard.

So I meant: keep the cluster infrastructure services (qdiskd, rgmanager, fenced, cman, ccsd...) running, but stop the cluster service/package that uses the filesystem(s). In other words, I meant running "clusvcadm -d ABC".

But I looked at your configuration again, and now I see a problem: you're using the new filesystem to define a completely new service (=package). You're also trying to use the same IP with two different services. This won't work.

If the new filesystem is intended to extend the old service/package, you should add it to the existing service block, instead of adding a new service: otherwise there will be no guarantee that the filesystems will fail over together.

<script ref="script_ABC"/>

In this case, you should stop the service/package ABC before running "ccs_tool update".

If you're trying to create a completely new service/package, you should pick an unique IP address for it.

2.) You're right: it won't.
If you edit /etc/cluster/cluster.conf directly, you *must* shut down the entire cluster first, and you *must* perform exactly the same edits in each node. But this is not recommended.

The recommended way is to use the "ccs_tool update" while the cluster is running (but the packages may be up or down, depending on the situation). The ccs_tool will perform some sanity checks, and will coordinate the configuration change over the entire cluster.

The internal procedure of "ccs_tool update" will work a bit like this:

ccs_tool: "Attention all nodes. Here's a new configuration file, version 19. Do not use it yet; just store it, check it and acknowledge if it looks OK."
node 1: "OK."
node 2: "OK."
ccs_tool: "All nodes have acknowledged, OK."
(If even one node does not acknowledge the change, the change is stopped and the new configuration won't take effect.)
ccs_tool: "Attention all nodes. Prepare to start using configuration version 19. The new configuration shall take effect in 3... 2... 1... NOW."

When a cluster node rejoins a cluster, it checks its configuration file version. If the majority of the cluster has a newer version than this node, the node will receive an updated configuration from the other cluster members before starting cluster operations.

If the majority of the cluster has an _older_ version than the rejoining node, the rejoin process will be stopped and an error message will be logged, saying that the configuration file is out of sync. It will be the responsibility of the cluster administrator to verify the correct configuration file version and to downgrade the configuration in the single node before rejoining it to the rest of the cluster.

If the configuration file version is identical, but the content of the configuration file is different... the cluster will be very confused, and might not work sensibly at all.

3.) Yes... and the other node will typically fence the node that has abruptly stopped the cluster communication. In other words, if you just kill/stop the cluster daemons without using the correct procedure, the node may be powered off suddenly, with no warning.

4.)
Assuming that the package ABC normally runs on database01, I meant something like running:

clusvcadm -r ABC -m database02

clusvcadm -r ABC -m database01

A complete server restart would certainly achieve the same thing, but I think it would be overkill.

5.) The recommended way to create name aliases would be to use /etc/multipath.conf instead of /var/lib/multipath/bindings... but yes, you can and should restart /etc/init.d/multipath after changing either of those files.

6.) "Freezing" the cluster service means only that the failover and auto-start actions are blocked for the service. When you're modifying a cluster service, it is recommended to shutdown that cluster service first ("clusvcadm -d ABC").

Updated plan:

- Assign the LUNs from the storage to both servers

- Do multipath -l and make sure, the assigned LUNs are visible (on both servers)

- Do name alias. Run "service multipathd restart". (on both servers)

- Partition (on one server only)

- Run "partprobe" and verify the server sees the new partitioning (on the other server)

- Create the mountpoint directory (on both servers)

- Edit /tmp/cluster.conf.new (on one server only). Don't forget to increment the configuration version number.

- Shutdown the cluster service: "clusvcadm -d ABC"

- Update cluster configuration: "ccs_tool update /tmp/cluster.conf.new" (on one server only: will automatically update both servers)

- restart the cluster service "clusvcadm -e ABC"
* the new filesystem should be mounted automatically, and the application should start successfully.

- do a test failover:
* clusvcadm -r ABC -m database02
* check that the cluster service is running successfully on database02
* clusvcadm -r ABC -m database01
* check that the cluster service is again running successfully on database01

MK

MK

bullz · ‎07-27-2010

Great MK.

I am almost done. 99%

Now I realized my mistake. My previous configuration was something look like to create additional service.
Now I have attached the recent one as per your advice. I hope itâ s good to go. Please advise me.

I feel, the below line is necessary for the reference, apart from service configuration.

options="" self_fence="0"/>

Should I need to sync the updated version by â cman_tool version â r v18â ? (or) It will taken care by â ccs_tool update /etc/cluster/cluster.conf.newâ

bullz · ‎08-02-2010

Hey MK,

any luck

Categories

Company

Local Language

Forums

Discussions

Forums

Discussions

Forums

Discussions

Forums

Discussions

Forums

Discussions

Discussions

Forums

Forums

Discussions

Forums

Discussions

Forums

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Community

Resources

Other HPE Sites

Discussions

Forums

Blogs

How to increase the filesystem which is under redhat cluster directly?

How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?

Re: How to increase the filesystem which is under redhat cluster directly?