- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - OpenVMS
- >
- Re: What kills X sessions?
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-07-2009 11:33 AM
08-07-2009 11:33 AM
What kills X sessions?
We are seeing X sessions being killed, typically a few 10s of seconds after they logged in, after DECterm windows have been opened, whether or not the user is busy doing things. Sometimes the session is killed even before one has had a chance to login.
The X host (client) systems are a variety of Alphas running OpenVMS 8.3 (update 8) in a cluster, with Multinet TCP/IP v5.2.
The problem has been seen both with Linux display systems (using gdm in chooser mode and xorg) and with NCD X terminals.
When it happens, the NCDs generate a popup menu asking if the session should be killed, giving the user the ability to keep it from happening. The Linux display systems, however, simply kill all the windows and redisplay the XDMCP chooser. We have not found any option in xorg to prevent this.
Thanks for whatever help anyone can provide.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-07-2009 12:11 PM
08-07-2009 12:11 PM
Re: What kills X sessions?
A suggestion: Check the accounting logs on the OpenVMS system and see if there is anything about why the process terminated.
- Bob Gezelter, http://www.rlgsc.com
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-07-2009 01:21 PM
08-07-2009 01:21 PM
Re: What kills X sessions?
%SYSTEM-F-EXITFORCED, forced exit of image or process by SYS$DELPRC
and the processes associated with FTA devices are terminating with
%CLI-S-NORMAL, normal successful completion
The last few lines in DECW$SM.LOG are
=====================================
X connection to WSA136: broken (explicit kill or server shutdown).
Fatal error detected, image exiting -- final message:
Exiting DECW$STARTSM.COM
==============================
But we essentially already knew that. What we don't know is what is causing the X session termination.
For the weekend we've reconfigured one of the Linux systems so people can use ssh and then invoke CREATE/TERMINAL and other X programs. Jobs started that way don't have this abnormal termination problem. That's really not ideal, though, since the operations staff are much more familiar with the VMS login procedures than Linux.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-07-2009 04:10 PM
08-07-2009 04:10 PM
Re: What kills X sessions?
Are you running any type of idle process killer applications on your system? One that comes to mind is something like "hitman" in which if not configured correctly will kill some decwindows processes such as the Decw$te_nnnn processes or any process that shows up as "Disconnected" like the Motif Window Manager will sometimes.
How exactly are you starting your X Session on the OpenVMS system?
Are you using XDMCP, to start a session to display to the Linux system?
Are you using manually starting running DECW$STARTSM.COM via a command procedure or some type of application launcher?
When you Start your X Session from the OpenVMS system are you using CDE or Traditional DECWindows?
Between Decwindows and Linux, there can only be one window manager running to manage all of the windows on the display. Is it possible that the window managers are colliding with each other causing the CDE or Decwindows Motif Window manager to exit taking down the X Session as well?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-08-2009 12:57 AM
08-08-2009 12:57 AM
Re: What kills X sessions?
I'm also guessing that the network isn't heavily loaded (I've seen situations where Xsessions die because someone was doing copies of windows system disk images around the network. I wasn't happy that they were screwing my systems up!)
Finally, you haven't been editing any of the login command procedures to explicitly do a LOGOUT where they never used to do them?
Steve
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-08-2009 06:53 AM
08-08-2009 06:53 AM
Re: What kills X sessions?
... clean out LOGIN.COM and SYLOGIN.COM and DECW$LOGIN.COM. Make it all go away, just for the testing.
... remove some pieces. Start by removing xdm.
... add parallel pieces. Add an OpenVMS-based DECwindows via IP display.
... look at the network level. Switches and vlans and firewalls can be set to block traffic, and can cause weirdness.
... confirm that the IP traffic is enabled underneath DECwindows. (That transport was optional and had to be lit manually for eons; I haven't looked to see if it's now lit by default.)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-10-2009 08:54 AM
08-10-2009 08:54 AM
Re: What kills X sessions?
We're not running any idle process killer
The failing sessions are started using the Gnome Display Manager's chooser, which uses XDMCP to bring up the standard VMS login screen.
We're using the traditional Motif window manager, not CDE.
There is no manager collision: the linux systems are not running any window manager.
Steve,
We're not using captive accounts.
The X clients are running at 10Mbits/half duplex. The servers are running at 100Mbits/full duplex. This is intentional to make sure they have plenty of bandwidth available. The session shutdowns don't seem to be related to network traffic levels.
The times that the sessions die seem to be at random times, unrelated to when the login command files were run. Sometimes before the login prompt appeared, sometimes tens of seconds after the login completed.
Hoff,
Part of our frustration is that there have been no known system-level changes between the time jobs were logging in OK and now, when they die.
In particular, there have been no changes to login.com of the affected accounts in recent weeks, since before the problem started. The problem is seen with accounts with sophisticated logins and accounts with near-empty ones. The things in common between them have been in place for years.
One of the common threads does seem to be that XDMCP is involved in starting the jobs. Windows opened in other waya seem to be unaffected.
Since the XDMCP server is part of Multinet, I'll try posting to info-multinet to see if anyone there has any thoughts.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-10-2009 10:16 PM
08-10-2009 10:16 PM
Re: What kills X sessions?
>> =====================================
>> X connection to WSA136: broken (explicit kill or server shutdown).
>> Fatal error detected, image exiting -- final message:
>> Exiting DECW$STARTSM.COM
I would pay more attention to the display side! I guess the culprit is the display server WSA136 (or whatever the active display is) is pointing to. Check the logs of the X-server there and the logs of the transport protocol (TCP/IP).
Edwin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-10-2009 11:41 PM
08-10-2009 11:41 PM
Re: What kills X sessions?
I took a tcp trace and just found that both sides (KEA1X and VMS) agreed to stop. No explaining message at all. Problem was never solved, just phased out.
You could try a tcptrace of a session and try to understand that.
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-18-2009 09:35 AM
08-18-2009 09:35 AM
Re: What kills X sessions?
Its likely that your clients are using XDMCP keepalives, and missing some responses. The X-server then goes and kills all the client connections.
Have a look at the XDMCP request/response traffic on one of the Linux clients. It typically should be one request/response every 15 seconds.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
08-27-2009 11:57 AM
08-27-2009 11:57 AM
Re: What kills X sessions?
The network traces are very strange. Here's what our Linux expert reports.
===============================
Theproblem now seems to be jumping between cesr27 and cesr28.
Various logs of packets can be found in /home/dab66/xdmcp_packets/.
I captured packets for two separate attempts using:
tshark -R "ip.addr == 192.168.1.27"
I started an XDMCP sesssion on cesr27 as follows.
/usr/X11R6/bin/X -ac -once -query cesr27
On both attempts, the login window for cesr27 appeared for more than
10 seconds before the session terminated automatically. You can find
the packet captures in cesr27.log and cesr27.1.log.
Later on, the connections succeeded. A log of the packets from a
successful session can be found at cesr27_successful.log.
And finally, a log of packets from a successful connection to cesr28
is in cesr28_successful.log.
========================
Some of the failing sessions have keepalives in the packetlogs and some do not.
Some of the working sessions have keepalives in the packetlogs and some do not.
I can provide the logs to anyone who is willing to take a look.
*sigh*
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-01-2009 08:50 AM
09-01-2009 08:50 AM
Re: What kills X sessions?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-22-2009 02:09 AM
09-22-2009 02:09 AM
Re: What kills X sessions?
$ if (f$trnlnm ("Decw$Display") .nes. "")
$ then
$ TmpSetOn = (f$environ ("ON_SEVERITY") .nes. "NONE")
$ Set NoOn
$
$ Define /User Sys$output NLA0:
$ Show Display /Symbol
$ if (DECW$DISPLAY_TRANSPORT .nes. "LAT")
$ then
$ Set Display /Create /Node= "''DECW$DISPLAY_NODE'" -
/Transport= "''DECW$DISPLAY_TRANSPORT'" -
/Screen= "''DECW$DISPLAY_SCREEN'" -
/Server= "''DECW$DISPLAY_SERVER'"
$ endif
$
$ if (TmpSetOn) then Set On
$ endif
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-22-2009 12:16 PM
09-22-2009 12:16 PM
Re: What kills X sessions?
(we were at 3)
@Olivier: the timeouts happen even with just the login prompt on the screen, before logging in, so I fear that setting the display variables won't help :(
I sent a tcpdump to Process software (the Multinet vendors). They're puzzled, too.
More *sigh*s.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-22-2009 02:00 PM
09-22-2009 02:00 PM
Re: What kills X sessions?
have you looked at what OPCOM is telling you when the jobs die?
have you tried increasing quotas to see if it's actually a quotas related issue?