GreenLake Administration
- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- SG problem?
Operating System - HP-UX
1846372
Members
4467
Online
110256
Solutions
Forums
Categories
Company
Local Language
back
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Forums
Discussions
back
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Blogs
Information
Community
Resources
Community Language
Language
Forums
Blogs
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
06-08-2001 05:50 AM
06-08-2001 05:50 AM
SG problem?
I got a bunch of messages from syslog in every 30 mins.
Jun 8 08:12:49 cmcld[2549]: Attempting to adjust cluster membership
Jun 8 08:12:52 cmcld[2549]: Obtaining Cluster Lock
Jun 8 08:12:53 cmcld[2549]: Turning off safety time protection since the cluster
Jun 8 08:12:53 cmcld[2549]: now consists of a single node. If ServiceGuard
Jun 8 08:12:53 cmcld[2549]: fails, this node will not automatically halt
This is my configuration. I have one pri lan, one standby lan, and another to be crossover lan between two nodes. I brought pri lan to be a heartbeat also. I have do nothing to crossover lan.
flags: 12 (single cluster lock)
heartbeat interval: 1.00 (seconds)
node timeout: 2.00 (seconds)
heartbeat connection timeout: 4.00 (seconds)
auto start timeout: 600.00 (seconds)
network polling interval: 2.00 (seconds)
Someone told me to tune up node timeout from 2 seconds to 5+ seconds. I agreed with this point that will elimiate the syslog message. However, I would like to configure the crossover lan cable to have an another heartbeat running. Is it possible? If so, I still want to keep node timeout to 2 second, and add crossover to have an second heartbeat, does it eliminate the problem?
SG experts, pl let me know.
Boonchu Ngampairoijpibul
Jun 8 08:12:49
Jun 8 08:12:52
Jun 8 08:12:53
Jun 8 08:12:53
Jun 8 08:12:53
This is my configuration. I have one pri lan, one standby lan, and another to be crossover lan between two nodes. I brought pri lan to be a heartbeat also. I have do nothing to crossover lan.
flags: 12 (single cluster lock)
heartbeat interval: 1.00 (seconds)
node timeout: 2.00 (seconds)
heartbeat connection timeout: 4.00 (seconds)
auto start timeout: 600.00 (seconds)
network polling interval: 2.00 (seconds)
Someone told me to tune up node timeout from 2 seconds to 5+ seconds. I agreed with this point that will elimiate the syslog message. However, I would like to configure the crossover lan cable to have an another heartbeat running. Is it possible? If so, I still want to keep node timeout to 2 second, and add crossover to have an second heartbeat, does it eliminate the problem?
SG experts, pl let me know.
Boonchu Ngampairoijpibul
Boonchu Ngampairoijpibul
2 REPLIES 2
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
06-08-2001 06:14 AM
06-08-2001 06:14 AM
Re: SG problem?
Hi:
The general guideline is to keep the NODE_TIMEOUT at, or slightly above 5-seconds. This gives the 'cmcld' daemon reasonable assurance of getting processor cycles to accomodate its needs.
...JRF...
The general guideline is to keep the NODE_TIMEOUT at, or slightly above 5-seconds. This gives the 'cmcld' daemon reasonable assurance of getting processor cycles to accomodate its needs.
...JRF...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
06-08-2001 06:30 AM
06-08-2001 06:30 AM
Re: SG problem?
The widely agreed opinion is that NODE_TIMEOUT should be in the range between 5-8 seconds. This is a general statement which is independent from you SG configuration. The reason why we recommend this, is that we feel that this setting will effectively avoid cluster reformations triggered by short hic-ups of the system (system hangs that starve out SG's main process cmcld from getting CPU), but still guarantees a short failover time.
We do not recommend to increase the NODE_TIMEOUT beyond 8s, even if the cluster still runs into reformations. In this case it is definitely necessary to identify the root cause of the problem.
From the messages you get, we cannot deduce the cause of the cluster reformation. Basically we deal with the following problems:
1) network problems: cmcld doesn't receive heartbeats and can therefore not update the safety timer
2) system hangs and related: cmcld doesn't get CPU time to update the safety timer
Adding a private heartbeat network (Yes, crossover cables ARE supported!) helps for problems of the category 1 only.
If cmcld is not getting CPU time (category 2), the new heartbeat network will not help.
My recommendation is to do both: Adding the crossover lan and to increase NODE_TIMEOUT.
Carsten
We do not recommend to increase the NODE_TIMEOUT beyond 8s, even if the cluster still runs into reformations. In this case it is definitely necessary to identify the root cause of the problem.
From the messages you get, we cannot deduce the cause of the cluster reformation. Basically we deal with the following problems:
1) network problems: cmcld doesn't receive heartbeats and can therefore not update the safety timer
2) system hangs and related: cmcld doesn't get CPU time to update the safety timer
Adding a private heartbeat network (Yes, crossover cables ARE supported!) helps for problems of the category 1 only.
If cmcld is not getting CPU time (category 2), the new heartbeat network will not help.
My recommendation is to do both: Adding the crossover lan and to increase NODE_TIMEOUT.
Carsten
-------------------------------------------------------------------------------------------------
In the beginning the Universe was created. This has made a lot of people very angry and been widely regarded as a bad move. -- HhGttG
In the beginning the Universe was created. This has made a lot of people very angry and been widely regarded as a bad move. -- HhGttG
The opinions expressed above are the personal opinions of the authors, not of Hewlett Packard Enterprise. By using this site, you accept the Terms of Use and Rules of Participation.
Company
Events and news
Customer resources
© Copyright 2026 Hewlett Packard Enterprise Development LP