HPE GreenLake Administration
- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- Re: Load balancing bonding and DEADMAN timeout und...
Operating System - Linux
        1839866
        Members
    
    
        2168
        Online
    
    
        110156
        Solutions
    
Forums
        Categories
Company
Local Language
                
                  
                  back
                
        
                
        
                
        
                
        
        
        
                
        
                
        
        
        
                
        
              
              Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
                
                  
                  back
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
                
            
            
                
            
                
            
                
            
                
            
            
                
            
                
            
            
                
            
                
              
            Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Blogs
        Information
        Community
Resources
Community Language
        Language
        Forums
Blogs
	
		
			
            
                
            Go to solution
        
            
		
		
			
            	
	
		
        
		
	
	
		Topic Options
			
				
					
	
			
		
	- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-14-2006 07:35 AM
12-14-2006 07:35 AM
			
				
					
					
						Hi,
We are having serious problems with a Serviceguard (A.11.16.02-0) cluster running under Red Hat Enterprise Linux AS 4.0 (x86_64) on two DLG585 G1 boxes.
Basically from time to time the server hangs and reboots by ASR timeout after 10 minutes. When it starts up, the bond network fails.
The network bonding mode I have configured is to work with “balance-tlb” (mode=5). In read that SG does not support load balancing mode.
1. Is this still true with Serviceguard A.11.16.02 and Red Hat 4?
When the server hangs it reboots by ASR timeout and not by DEADMAN timeout.
2. What is the value of the DEADMAN timeout?
By the way I have NODE_TOC_BEHAVIOR="reboot"
Any help/suggestion is highly appreciated.
TIA.
 
Kind Regards,
 
Rui Vilao.
	
			
				
		
			
			
			
			
			
			
		
		
		
	
	
	
We are having serious problems with a Serviceguard (A.11.16.02-0) cluster running under Red Hat Enterprise Linux AS 4.0 (x86_64) on two DLG585 G1 boxes.
Basically from time to time the server hangs and reboots by ASR timeout after 10 minutes. When it starts up, the bond network fails.
The network bonding mode I have configured is to work with “balance-tlb” (mode=5). In read that SG does not support load balancing mode.
1. Is this still true with Serviceguard A.11.16.02 and Red Hat 4?
When the server hangs it reboots by ASR timeout and not by DEADMAN timeout.
2. What is the value of the DEADMAN timeout?
By the way I have NODE_TOC_BEHAVIOR="reboot"
Any help/suggestion is highly appreciated.
TIA.
Kind Regards,
Rui Vilao.
	"We should never stop learning"_________ rui.vilao@rocketmail.com
			
			
				Solved! Go to Solution.
		2 REPLIES 2
	
	            
            
		
		
			
            
                - Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-14-2006 06:26 PM
12-14-2006 06:26 PM
Solution
			
				
					
					
						1 You said 4.0  If this is the base RedHat 4, it is not supported.  The certification matrix shows RH4 support starting with Update 1.
See ftp://ftp.compaq.com/pub/products/servers/ha/linux/svcguard-certmatrix.pdf
The Deadman driver timeout varies based on heartbeat timeouts. But, if the hang is that "hard" the deadman driver will not run at all. The deadman driver is to catch the system after if "unhangs" so it causes no problems. So ASR rebooting the system is to be expected in some cases. The key thing from a Serviceguard perspective is, did the packages fail over.
I think the ASR timeout is configurable, so drop it to a lower value if you wish.
On bonding - if I remember modes 0 and 1 are supported. I believe this is in the docs. Try a search for "mode" in acrobat. Too late here for me to double check.
					
				
			
			
				
			
			
				
			
			
			
			
			
			
		
		
		
	
	
	
See ftp://ftp.compaq.com/pub/products/servers/ha/linux/svcguard-certmatrix.pdf
The Deadman driver timeout varies based on heartbeat timeouts. But, if the hang is that "hard" the deadman driver will not run at all. The deadman driver is to catch the system after if "unhangs" so it causes no problems. So ASR rebooting the system is to be expected in some cases. The key thing from a Serviceguard perspective is, did the packages fail over.
I think the ASR timeout is configurable, so drop it to a lower value if you wish.
On bonding - if I remember modes 0 and 1 are supported. I believe this is in the docs. Try a search for "mode" in acrobat. Too late here for me to double check.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
01-19-2007 03:22 AM
01-19-2007 03:22 AM
			
				
					
						
							Re: Load balancing bonding and DEADMAN timeout under SG
						
					
					
				
			
		
	
			
	
	
	
	
	
			
				
					
					
						Thanks!
					
				
			
			
				
		
		
	
	
	
	"We should never stop learning"_________ rui.vilao@rocketmail.com
			
			
				
			
			
			
			
			
			
		The opinions expressed above are the personal opinions of the authors, not of Hewlett Packard Enterprise. By using this site, you accept the Terms of Use and Rules of Participation.
		
	
	
Company
Events and news
Customer resources
© Copyright 2025 Hewlett Packard Enterprise Development LP
