- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Safety timer TOC, with SG on Integrity VM
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-30-2008 02:53 AM
04-30-2008 02:53 AM
I hava built a little test enviroment, on one box
with 2 IVM version 3.50 with 11.23, i have installed and configured SG 11.17, i have a cluster running with no packages, when i halt the cluster or just one node, the other TOCs with safety timer panic on the shutdownlog.
Here is the panic and some config:
Apr 30 2008. Reboot after panic: SafetyTimer expired, INIT, IIP:0xe0000000014123d0 IFA:0x0000000000000045
This is what i get on the syslog of the machine that doesn't toc:
Apr 29 18:15:37 iumtest2 cmcld[5118]: Request from root on node iumtest3 to halt the cluster on this node
Apr 29 18:15:37 iumtest2 cmcld[5118]: Turning off safety time: node halting
Apr 29 18:15:37 iumtest2 cmcld[5118]: Service cmlvmd terminated due to an exit(0).
And on the machine that tocs i get no info at all.
Heartbeat networks and Quorum server are working ok.
I increased the node timeout, but I still have the same problem.
Cpu and mem are ok, because the machines are completly IDLE
Any idea on what else i can check?
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-30-2008 03:01 AM
04-30-2008 03:01 AM
Re: Safety timer TOC, with SG on Integrity VM
Apr 30 12:51:16 iumtest2 cmcld[3441]: Global Cluster Information:
Apr 30 12:51:16 iumtest2 cmcld[3441]: Heartbeat Interval is 1.00 seconds.
Apr 30 12:51:16 iumtest2 cmcld[3441]: Logging level changed to level 0.
Apr 30 12:51:16 iumtest2 cmcld[3441]: Node Timeout is 12.00 seconds.
Apr 30 12:51:16 iumtest2 cmcld[3441]: Network Polling Interval is 1.00 seconds.
Apr 30 12:51:16 iumtest2 cmcld[3441]: IO Timeout Extension is 70.00 seconds.
Apr 30 12:51:16 iumtest2 cmcld[3441]: Auto Start Timeout is 600.00 seconds.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-30-2008 05:50 AM
04-30-2008 05:50 AM
SolutionDo you face the same issue if the other node is halted which is TOCing now ?
As long as one node is leaving the cluster gracefully there shouldn't be any issues on the other node.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-30-2008 06:48 AM
04-30-2008 06:48 AM
Re: Safety timer TOC, with SG on Integrity VM
cmhaltcl the node where I run the command TOCs
cmhaltnode iumtest3 --> iumtest2 TOCs
cmhaltnode iumtest2 --> iumtest3 TOCs
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-30-2008 06:52 AM
04-30-2008 06:52 AM
Re: Safety timer TOC, with SG on Integrity VM
cmruncl -n iumtest3
and then:
cmhaltcl
the cluster stops with no problem.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-30-2008 11:19 AM
04-30-2008 11:19 AM
Re: Safety timer TOC, with SG on Integrity VM
Please check the output of cmscancl as well.
You can also check the flight recorder logs for more details. See if the node which is TOCing if its saving crash dump.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-05-2008 12:11 AM
05-05-2008 12:11 AM
Re: Safety timer TOC, with SG on Integrity VM
On the OLDsyslog from the machine that TOCs i don't have anything on the syslog, not even the node stop command, on the other node you get:
pr 29 18:15:37 iumtest2 cmcld[5118]: Request from root on node iumtest3 to halt the cluster on this node
Apr 29 18:15:37 iumtest2 cmcld[5118]: Turning off safety time: node halting
Apr 29 18:15:37 iumtest2 cmcld[5118]: Service cmlvmd terminated due to an exit(0).
and nothing else.
It's no leaving any cluster core dumps either.
Every thing looks ok on the cmscancl, i have attached it, if you want to have a look.
Thnx
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-05-2008 04:24 AM
05-05-2008 04:24 AM
Re: Safety timer TOC, with SG on Integrity VM
I took a look at output of cmscancl. Nothing to say but heartbeat : it is attached to only one network 10.132.75.0 on Lan1. I would also attach it to 10.10.10.0 on lan0.
I don't understand how it could be an issue but found this in "HP Integrity Virtual Machines Installation, Configuration, and Administration Version A.03.50", page 154 :
-----------------------------
Whether Serviceguard is installed on the VM Host system or on the guest, HP
recommends that you configure every LAN as a heartbeat LAN.
-----------------------------
So try to replace stationnary_ip with heartbeat_ip for lan0 and tell us
Eric
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-05-2008 06:17 AM
05-05-2008 06:17 AM
Re: Safety timer TOC, with SG on Integrity VM
The same happens, this time a little more info was available on the machine that didn't panic:
May 5 15:53:05 iumtest2 cmcld[3110]: Request from root on node iumtest3 to halt the cluster on this node
May 5 15:53:05 iumtest2 cmcld[3110]: Request from node iumtest3 to disable node switching for package test1 on node iumtest2.
May 5 15:53:05 iumtest2 cmcld[3110]: Request from node iumtest3 to disable global switching for package test1.
May 5 15:53:05 iumtest2 cmcld[3110]: (iumtest3) Halted package test1 on node iumtest3.
May 5 15:53:05 iumtest2 cmcld[3110]: Request from node iumtest3 to enable global switching for package test1.
May 5 15:53:05 iumtest2 cmcld[3110]: Package test1 cannot run on this node because switching has been disabled for this node
May 5 15:53:05 iumtest2 cmcld[3110]: Turning off safety time: node halting
May 5 15:53:05 iumtest2 cmcld[3110]: Service cmlvmd terminated due to an exit(0).
May 5 15:56:19 iumtest2 cmcld[3110]: HB connection to 10.10.10.2 not responding, closing
May 5 15:56:19 iumtest2 cmcld[3110]: HB connection to 10.132.75.47 not responding, closing
May 5 15:56:19 iumtest2 cmcld[3110]: GS connection to 10.10.10.2 not responding, closing
May 5 15:56:19 iumtest2 cmcld[3110]: GS connection to 10.132.75.47 not responding, closing
May 5 15:56:19 iumtest2 cmcld[3110]: Service cmnetassistd terminated due to an exit(0).
Looks like the hole machie goes AWOL and Freezes, as soon as you type the cmhaltcl command
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-05-2008 10:35 AM
05-05-2008 10:35 AM
Re: Safety timer TOC, with SG on Integrity VM
What abt the setup stability if u don't halt one node, does it run without any error messages like these on any of the node ??
If HB is not responding then cluster lock disk comes into the picture. Do we have a cluster lock VG and PV in place to avoid tie b/w nodes.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-05-2008 11:52 PM
05-05-2008 11:52 PM
Re: Safety timer TOC, with SG on Integrity VM
When it comes to cluster lock, I have tried with both Quorum server and lock disk, with the same result.
I have tried increasing the node timeout up to 10 minutes to test, and on the machine that u issue the command, it freezes for 10 minutes and then tocs.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-06-2008 12:00 AM
05-06-2008 12:00 AM
Re: Safety timer TOC, with SG on Integrity VM
regards,
ivan
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-06-2008 11:18 AM
05-06-2008 11:18 AM
Re: Safety timer TOC, with SG on Integrity VM
Sorry for my late post : mostly out of office today.
Well, I have no obvious idea on what is happening. May be you could post some more informations :
- "hpvmnet", and "hpvmnet -S XXX" for each virtual switch XXX
- "hpvmstatus", and "hpvmstatus -P YYY" for each guest YYY
- cluster configuration file, package configuration file and package control script
Please, do the post in attachement, in a tar gzipped file : it is more readable and easier to use ;-)
Have you placed a call to HP ? And have you searched for all patches around HPVM an MCSG ?
Regards
Eric
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
05-07-2008 11:42 PM
05-07-2008 11:42 PM
Re: Safety timer TOC, with SG on Integrity VM
I am at the moment, updating the Host and the client patches, also HPIVM to 3.5 and I am going to try using the aviolan drivers.
I will update when I finish and try the cluster out.