HPE GreenLake Administration
- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Re: Problem with Cluster
Operating System - HP-UX
1833757
Members
3056
Online
110063
Solutions
Forums
Categories
Company
Local Language
back
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
back
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Blogs
Information
Community
Resources
Community Language
Language
Forums
Blogs
Topic Options
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-05-2010 12:51 AM
05-05-2010 12:51 AM
Problem with Cluster
Hi. I have two node in Cluster Service Guard.
I had change IP address of interface NODE1.
I did this step by step.
1.Stop the cluster that you are running for reconfiguring from old to new IP address.
# cmhaltcl -f
2.change proper the IP address for heartbeat in /etc/cmcluster/cluster.ascii ( cluster configuration file)
# vi /etc/cmcluster/cluster.ascii
--------------------------------------------
NETWORK_INTERFACE lan1
HEARTBEAT_IP xxx.xxx.xxx.xxx <- new
FIRST_CLUSTER_LOCK_PV /dev/dsk/c7t0d4
--------------------------------------------
3. Check the changed configuration file in the cluster.
# cmcheckconf -C /etc/cmcluster/cluster.ascii
4. copy and apply for making binary file for taking effect
# cmapplyconf -C /etc/cmcluster/cluster.ascii
5. run the cluster and monitor the control logs.
# cmruncl
When cluster start TWO NODE will rebooted.
After NODE1 will rebooted once again.
Log NODE1 (When i change ip address):
17:54 Tue May 04 2010. Reboot after panic: SafetyTimer expired, INIT, IIP:0x00000707fc4a2b60 IFA:0xe0000001205cfd28
18:04 Tue May 04 2010. Reboot after panic: SafetyTimer expired, INIT, IIP:0x00000707fc4b0910 IFA:0xe0000001205cfd28
_______________________________________________
Message from syslogd@NODE1 at Tue May 4 17:49:54 2010 ...
vparcher cmcld[9565]: Halting vparcher to preserve data integrity
May 4 17:49:54 vparcher cmcld[9565]: Reason: A crucial package failed
May 4 17:49:54 vparcher cmcld[9565]: Reason: A crucial package failed
Message from syslogd@NODE1 at Tue May 4 17:49:54 2010 ...
NODE1 cmcld[9565]: Reason: A crucial package failed
INIT occurs.
INIT: make crash event table.
INIT: Waiting for processors to save state.
INIT: Invoking callbacks.
Calling function e00000000160c700 for Shutdown State 9 type 0x10
Calling function e0000000020304e0 for Shutdown State 9 type 0x10
SafetyTimer expired, INIT, IIP:0x00000707fc4a2b60 IFA:0xe0000001205cfd28
INIT: Executing platform dependent procedures.
INIT: Begin crashdump.
i 0 pfn 0x1080000 pages 0x7cdd4
i 1 pfn 0x10fce7c pages 0x172
i 2 pfn 0x1100000 pages 0x180000
i 3 pfn 0x1780000 pages 0x200000
*** Not enough CPUS for a compressed dump ***
*** A system crash has occurred. (See the above messages for details.)
*** The system is now preparing to dump physical memory to disk, for use
*** in debugging the crash.
*** The dump will be a SELECTIVE dump with
compression OFF and concurrency ON: 2067 of 16350 megabytes.
*** To change this dump type, press any key within 10 seconds.
*** Proceeding with selective dump, with compression off and concurrency on.
Primary Dump Header Location :
Device details:
Major number: 31 Minor number:0x30100
Offset: 2349920.
*** The dump may be aborted at any time by pressing ESC.
*** Dumping: 100% complete (2067 of 2067 MB)
time: 35 seconds, Number of Dump units: 1
INIT[0]: OS_INIT ends. Resetting the system.
Initializing IO Devices ...
LBA Cell 03 (12): Occupied PCI-X 133MHz
Scan PCI:
Rope Slot Seg Bus Dev Fun Card
====================================================================
12 08 0x39 0x00 0x01 0x00 PCI Bridge (0x01a7,0x1014)
12 08 0x39 0x01 0x04 0x00 Ethernet (0x1079,0x8086)
12 08 0x39 0x01 0x04 0x01 Ethernet (0x1079,0x8086)
12 08 0x39 0x01 0x06 0x00 Ethernet (0x1079,0x8086)
12 08 0x39 0x01 0x06 0x01 Ethernet (0x1079,0x8086)
LBA Cell 03 (04): Occupied PCIe x8
Scan PCI:
Rope Slot Seg Bus Dev Fun Card
====================================================================
04 03 0x33 0x00 0x00 0x00 PCIe Root Port (0x403b,0x103c)
04 03 0x33 0x01 0x00 0x00 Fibre Channel (0x2532,0x1077)
LBA Cell 03 (02/03): Occupied PCIe x8
Scan PCI:
Rope Slot Seg Bus Dev Fun Card
====================================================================
02 02 0x32 0x00 0x00 0x00 PCIe Root Port (0x403b,0x103c)
02 02 0x32 0x01 0x00 0x00 Fibre Channel (0x2532,0x1077)
LBA Cell 03 (00): Occupied PCI 33MHz
Scan PCI:
Rope Slot Seg Bus Dev Fun Card
====================================================================
00 00 0x30 0x00 0x01 0x00 Network (0xb921,0x1133)
Complete
Log NODE2
17:57 Tue May 04 2010. Reboot after panic: SafetyTimer expired, INIT, IIP:0x00000707fc4a2b60 IFA:0xe0000001205cfd28
I had change IP address of interface NODE1.
I did this step by step.
1.Stop the cluster that you are running for reconfiguring from old to new IP address.
# cmhaltcl -f
2.change proper the IP address for heartbeat in /etc/cmcluster/cluster.ascii ( cluster configuration file)
# vi /etc/cmcluster/cluster.ascii
--------------------------------------------
NETWORK_INTERFACE lan1
HEARTBEAT_IP xxx.xxx.xxx.xxx <- new
FIRST_CLUSTER_LOCK_PV /dev/dsk/c7t0d4
--------------------------------------------
3. Check the changed configuration file in the cluster.
# cmcheckconf -C /etc/cmcluster/cluster.ascii
4. copy and apply for making binary file for taking effect
# cmapplyconf -C /etc/cmcluster/cluster.ascii
5. run the cluster and monitor the control logs.
# cmruncl
When cluster start TWO NODE will rebooted.
After NODE1 will rebooted once again.
Log NODE1 (When i change ip address):
17:54 Tue May 04 2010. Reboot after panic: SafetyTimer expired, INIT, IIP:0x00000707fc4a2b60 IFA:0xe0000001205cfd28
18:04 Tue May 04 2010. Reboot after panic: SafetyTimer expired, INIT, IIP:0x00000707fc4b0910 IFA:0xe0000001205cfd28
_______________________________________________
Message from syslogd@NODE1 at Tue May 4 17:49:54 2010 ...
vparcher cmcld[9565]: Halting vparcher to preserve data integrity
May 4 17:49:54 vparcher cmcld[9565]: Reason: A crucial package failed
May 4 17:49:54 vparcher cmcld[9565]: Reason: A crucial package failed
Message from syslogd@NODE1 at Tue May 4 17:49:54 2010 ...
NODE1 cmcld[9565]: Reason: A crucial package failed
INIT occurs.
INIT: make crash event table.
INIT: Waiting for processors to save state.
INIT: Invoking callbacks.
Calling function e00000000160c700 for Shutdown State 9 type 0x10
Calling function e0000000020304e0 for Shutdown State 9 type 0x10
SafetyTimer expired, INIT, IIP:0x00000707fc4a2b60 IFA:0xe0000001205cfd28
INIT: Executing platform dependent procedures.
INIT: Begin crashdump.
i 0 pfn 0x1080000 pages 0x7cdd4
i 1 pfn 0x10fce7c pages 0x172
i 2 pfn 0x1100000 pages 0x180000
i 3 pfn 0x1780000 pages 0x200000
*** Not enough CPUS for a compressed dump ***
*** A system crash has occurred. (See the above messages for details.)
*** The system is now preparing to dump physical memory to disk, for use
*** in debugging the crash.
*** The dump will be a SELECTIVE dump with
compression OFF and concurrency ON: 2067 of 16350 megabytes.
*** To change this dump type, press any key within 10 seconds.
*** Proceeding with selective dump, with compression off and concurrency on.
Primary Dump Header Location :
Device details:
Major number: 31 Minor number:0x30100
Offset: 2349920.
*** The dump may be aborted at any time by pressing ESC.
*** Dumping: 100% complete (2067 of 2067 MB)
time: 35 seconds, Number of Dump units: 1
INIT[0]: OS_INIT ends. Resetting the system.
Initializing IO Devices ...
LBA Cell 03 (12): Occupied PCI-X 133MHz
Scan PCI:
Rope Slot Seg Bus Dev Fun Card
====================================================================
12 08 0x39 0x00 0x01 0x00 PCI Bridge (0x01a7,0x1014)
12 08 0x39 0x01 0x04 0x00 Ethernet (0x1079,0x8086)
12 08 0x39 0x01 0x04 0x01 Ethernet (0x1079,0x8086)
12 08 0x39 0x01 0x06 0x00 Ethernet (0x1079,0x8086)
12 08 0x39 0x01 0x06 0x01 Ethernet (0x1079,0x8086)
LBA Cell 03 (04): Occupied PCIe x8
Scan PCI:
Rope Slot Seg Bus Dev Fun Card
====================================================================
04 03 0x33 0x00 0x00 0x00 PCIe Root Port (0x403b,0x103c)
04 03 0x33 0x01 0x00 0x00 Fibre Channel (0x2532,0x1077)
LBA Cell 03 (02/03): Occupied PCIe x8
Scan PCI:
Rope Slot Seg Bus Dev Fun Card
====================================================================
02 02 0x32 0x00 0x00 0x00 PCIe Root Port (0x403b,0x103c)
02 02 0x32 0x01 0x00 0x00 Fibre Channel (0x2532,0x1077)
LBA Cell 03 (00): Occupied PCI 33MHz
Scan PCI:
Rope Slot Seg Bus Dev Fun Card
====================================================================
00 00 0x30 0x00 0x01 0x00 Network (0xb921,0x1133)
Complete
Log NODE2
17:57 Tue May 04 2010. Reboot after panic: SafetyTimer expired, INIT, IIP:0x00000707fc4a2b60 IFA:0xe0000001205cfd28
2 REPLIES 2
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-05-2010 03:01 AM
05-05-2010 03:01 AM
Re: Problem with Cluster
hi Goriik,
I think monitored subnet is lost.
Please check for subnet enrty in .ascii file Whether it is true for your new IP
I think monitored subnet is lost.
Please check for subnet enrty in .ascii file Whether it is true for your new IP
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-05-2010 04:17 AM
05-05-2010 04:17 AM
Re: Problem with Cluster
Hi Goriik,
if you see, you panic occured due to "Safety timer expiration", which calls INIT to reboot the server and safety timer expiration comes, when cmcld is not able to communicate with the cluster nodes.
1) is your new IP address having the same subnet as your second node has?
2) i agree with S.N.S, check your monitored subnet. it might have lost.
if you see, you panic occured due to "Safety timer expiration", which calls INIT to reboot the server and safety timer expiration comes, when cmcld is not able to communicate with the cluster nodes.
1) is your new IP address having the same subnet as your second node has?
2) i agree with S.N.S, check your monitored subnet. it might have lost.
The opinions expressed above are the personal opinions of the authors, not of Hewlett Packard Enterprise. By using this site, you accept the Terms of Use and Rules of Participation.
Company
Events and news
Customer resources
© Copyright 2025 Hewlett Packard Enterprise Development LP