- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Re: System hang detected via timer popping on clus...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 09:14 PM
10-07-2003 09:14 PM
this is so strange.
We have two rp7400 Service Guarded (HP-UX 11.00) sharing a dual controller va7400.
Unfortunately one of the controllers in the va is DOWN so that we reconfigured it to use one single controller.
The cmclconfig.ascii has the FIRST_CLUSTER_LOCK_PV configured on the device file targeting the controller which is down. Cluster goes fine, but each time one of the tw nodes (let's say the standby) gets rebooted the main follows pretty soon with the "System hang detected via timer popping", after dumping memory.
Is there any explanation for that ? I've cmhalted the standby node before rebooting it and this time the phenomenon disappears. I'll change the lock to point to the controller which is up, but it looks to me a SPOF in such a redundant environment, could you feedback ?
Thanks !
Mike
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 09:22 PM
10-07-2003 09:22 PM
Re: System hang detected via timer popping on cluster nodes
You should definitively log a call at your HP local support service.
The 'timer popping' issue can be issued by several root causes.
You will need to provide :
- the GSP logs
- the eventual kernel dump
- the syslog.log, OLDsyslog.log and dmesg outputs
Hope this helps, Bye.
Francis DERDEYN - HP-UX ASCE.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 09:29 PM
10-07-2003 09:29 PM
Re: System hang detected via timer popping on cluster nodes
I'd be fine with that if only one of the two nodes was involved, but actually if both nodes are part of the cluster, whichever reboots the other will crash.
If I cmhaltnode the standby, situation is ok. It looks to me an issue with the way the cluster lock (which is unobtainable because the disk controller is down) is handled in such a circumstance.
Thanks,
Mike
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 09:31 PM
10-07-2003 09:31 PM
Re: System hang detected via timer popping on cluster nodes
Your cluster will run fine without FIRST_CLUSTER_LOCK_PV being accessible as long as there is no cluster reformation. When the cluster reformation happens i.e a node leaves or joins the cluster all the online nodes will try to access the FIRST_CLUSTER_LOCK_PV, and failing to do so results in unexpected behaviour like hanging etc.
Same in your case your cluster is fine as long as other node is not rebooted, and as soon as you do so, the other online server tries to access the FIRST_CLUSTER_LOCK_PV and since it cant the node hangs. The cmcld daemon is in a hung state.
The first thing you might do is make sure the FIRST_CLUSTER_LOCK_PV disk seen by both the nodes.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 09:35 PM
10-07-2003 09:35 PM
Re: System hang detected via timer popping on cluster nodes
If the 2 nodes are facing the same msg, i would strongly recommend you to log a call.
For me, this can absolutely be related to firmware issues : PDC, GSP.
Hope this helps, Bye.
Francis DERDEYN - HP-UX ASCE.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 09:37 PM
10-07-2003 09:37 PM
SolutionJust realised i meant "crash" by hanging.
And this is beacuse when one of youyr node is leaving the cluster, cluster reformation happens, but since the node which is up can not access the cluster lock disk to become the cluster manager a TOC happens on that node to halt that node and thats the normal bahaviour of MC/SG.
Other thing you might worth consider.
If you just have 2 node cluster, why do you want a cluster lock disk?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 09:44 PM
10-07-2003 09:44 PM
Re: System hang detected via timer popping on cluster nodes
Regarding the need of a lock actually this is an old configuration that's been there for a long time, we could take this chance to think about changing it :-)
Thanks agai for your quick participation !
Mike
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 10:02 PM
10-07-2003 10:02 PM
Re: System hang detected via timer popping on cluster nodes
No cluster lock is not an option in a 2 node cluster.
HTH
Duncan
I am an HPE Employee

- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
10-07-2003 11:17 PM
10-07-2003 11:17 PM
Re: System hang detected via timer popping on cluster nodes
Took some time to convince hardware support they had to do anything though.
SEP
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com