<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Processes Mysteriously Being Deleted in Operating System - OpenVMS</title>
    <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088086#M25194</link>
    <description>Robert, was there a log file associated with this batch job ?</description>
    <pubDate>Tue, 15 Jan 2008 22:12:02 GMT</pubDate>
    <dc:creator>Thomas Ritter</dc:creator>
    <dc:date>2008-01-15T22:12:02Z</dc:date>
    <item>
      <title>Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088081#M25189</link>
      <description>We have a problem that's now happened for a 3rd year in a row. Over the Christmas period a couple of our batch jobs have terminated, like this :-&lt;BR /&gt;&lt;BR /&gt;Entry&amp;nbsp; Jobname&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Username&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Blocks&amp;nbsp; Status&lt;BR /&gt;&amp;nbsp; -----&amp;nbsp; -------&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; --------&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; ------&amp;nbsp; ------&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; 324&amp;nbsp; GBSINVRUN&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; BATCHUSER&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Retained on error&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; %SYSTEM-F-EXITFORCED, forced exit of image or process by SYS$DELPRC&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; On available batch queue BETA$OPERATIONS&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Completed 31-DEC-2007 07:45:52.97 on queue BETA$OPERATIONS&lt;BR /&gt;&lt;BR /&gt;We cannot find a reason for this. Is only effects a couple of processes each year, and only happens at Christmas.&lt;BR /&gt;&lt;BR /&gt;We Autogen/reboot every month, so it's not a 'build-up' type issue.&lt;BR /&gt;&lt;BR /&gt;What I'd like to know is if anyone else has seen anything that matches this profile - could there be a VMS virus that only effects a few systems?&lt;BR /&gt;&lt;BR /&gt;Rob.&lt;BR /&gt;</description>
      <pubDate>Tue, 15 Jan 2008 15:50:39 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088081#M25189</guid>
      <dc:creator>Robert Atkinson</dc:creator>
      <dc:date>2008-01-15T15:50:39Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088082#M25190</link>
      <description>&amp;gt;&amp;gt; Over the Christmas period a couple of our batch jobs have terminated&lt;BR /&gt;&lt;BR /&gt;So this happened over multiple days?&lt;BR /&gt;So we can not blame say an idle process killer which got nervous (miscalculated) year-end? Anyway, the example shown is too far away from new-year even fudging in UTC consideration (looks like local time = utc for you).&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt; this. Is only effects a couple of processes each year, and only happens at Christmas.&lt;BR /&gt;&lt;BR /&gt;I think someone wants to go home early?!&lt;BR /&gt;Sorry, not obvious explanation.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt; We Autogen/reboot every month, so it's not a 'build-up' type issue.&lt;BR /&gt;&lt;BR /&gt;Why would you want to do that? Oh well.&lt;BR /&gt;Does that reboot come with process deletes?&lt;BR /&gt;Are the processes targetted just (much) longer running than anticapted aroudn Xmass?&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt; could there be a VMS virus that only effects a few systems?&lt;BR /&gt;&lt;BR /&gt;Ah, finally a simple, one word, answer: NO.&lt;BR /&gt;Two words? NO WAY!&lt;BR /&gt;&lt;BR /&gt;grins,&lt;BR /&gt;Hein.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 15 Jan 2008 17:23:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088082#M25190</guid>
      <dc:creator>Hein van den Heuvel</dc:creator>
      <dc:date>2008-01-15T17:23:32Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088083#M25191</link>
      <description>You'll want to enable auditing of privileges or auditing of process control system services, so that you can capture the $delprc or $forcex call for next year.  Or set yourself a calendar event for mid December 2008, and enable the requisite auditing then.  (You might want to have at least some of this auditing enabled as a matter of course.)&lt;BR /&gt;&lt;BR /&gt;If the processes are the approximate same set each year (check last year's accounting, if you still have it on-line or in your BACKUP archives), it is easily possible that the code itself is detecting the end of the year as part of its normal processing (as I'm guessing the code is date-related, based on that name), and it's failing and/or forcing an exit.&lt;BR /&gt;&lt;BR /&gt;Look for patterns.&lt;BR /&gt;&lt;BR /&gt;Look for application code messing with scheduling or with system time, or with time-change or TDF-related events, or other such code.  (If you don't have application code, check with the vendor or whomever has the code.)&lt;BR /&gt;&lt;BR /&gt;As for malware, an underlying operating system bug, or most any other out-of-band and external trigger for an error such as this, a site-local tool or procedure or a latent application error is a far more likely trigger for this sort of behavior.&lt;BR /&gt;&lt;BR /&gt;The so-called "built-up" issues are generally also signs of application errors, or process errors.   OpenVMS is quite capable of running for a decade, barring the standard maintenance and requisite patches.  I'd not bother with a monthly reboot and an AUTOGEN pass, save for cases where there are large-scale changes in load.&lt;BR /&gt;&lt;BR /&gt;I would look to cases such as file versions approaching 32K (tools such as DFU, or updates to the DIRECTORY command in recent OpenVMS versions can help here), and toward cruft "building up" in indexed files, and related.  There are cases where cruft can build up, but these are generally due to the way the application operates, or operates with OpenVMS.  Reboots won't generally cure these cases, either.&lt;BR /&gt;&lt;BR /&gt;Malware is certainly quite possible on OpenVMS, but you do need to ask yourself "am I really a target?" and "am I the first site to see malware in the wild on OpenVMS since, well, CCC and WANK?".  And you need to ask yourself "how did I get infested?", as transmission is rather difficult using the typical PC vectors of mail and web.  If you're handling boatloads of assets, you might well be a specific malware target.  But otherwise, hearing hoof beats usually mean horses, and not zebras.  This means application and process bugs, then maybe OpenVMS or LP bugs, and not malware.&lt;BR /&gt;</description>
      <pubDate>Tue, 15 Jan 2008 19:21:04 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088083#M25191</guid>
      <dc:creator>Hoff</dc:creator>
      <dc:date>2008-01-15T19:21:04Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088084#M25192</link>
      <description>&lt;!--!*#--&gt;Is it possible you have some type of process watcher that is looking for "runaway" processes and it is deleting the processes?  Perhaps due to seasonality, the processes are taking more resources than they normally do, and thus appear to be in a loop.&lt;BR /&gt;&lt;BR /&gt;Have you looked at accounting records for the resources used by the processes that are being deleted?   If you have image accounting turned on, this would also be helpful.  Even more helpful would be audit records if &lt;BR /&gt;&lt;BR /&gt;$ set audit/audit/enable=(process=all)&lt;BR /&gt;&lt;BR /&gt;is in effect, or for only monitoring $DELPRC events:&lt;BR /&gt;&lt;BR /&gt;$ set audit/audit/enable=(process=delprc)&lt;BR /&gt;&lt;BR /&gt;Before next year's Christmas period, you may want to turn on process auditing and possibly image accounting.  &lt;BR /&gt;&lt;BR /&gt;Is there special year end processing that must complete before 1-jan?  (Looking at the 31-DEC-2007 date).&lt;BR /&gt;&lt;BR /&gt;A bit more about image accounting:&lt;BR /&gt;&lt;BR /&gt;Image accounting can consume disk space quickly if you have many image rundowns.  But disk space is the primary resource it consumes.  We run with image accounting on all the time.  A side benefit of having image accounting turned on is that the CTL$xx_I* cells in the process control region are populated when images startup, so for example it is possible to determine when an image started while it is currently running by using something like this:&lt;BR /&gt;&lt;BR /&gt;SDA&amp;gt; set proc/in=20207802&lt;BR /&gt;SDA&amp;gt; exam/time ctl$gq_istart&lt;BR /&gt;15-JAN-2008 13:12:07.46&lt;BR /&gt;SDA&amp;gt; &lt;BR /&gt;&lt;BR /&gt;To enable image accounting:&lt;BR /&gt;&lt;BR /&gt;$ SET ACCOUNTING /ENABLE=IMAGE&lt;BR /&gt;&lt;BR /&gt;See help set accounting for more info.&lt;BR /&gt;&lt;BR /&gt;Image accounting records can be surprisingly useful for troubleshooting.  So can audit records.&lt;BR /&gt;&lt;BR /&gt;Jon&lt;BR /&gt;</description>
      <pubDate>Tue, 15 Jan 2008 19:50:47 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088084#M25192</guid>
      <dc:creator>Jon Pinkley</dc:creator>
      <dc:date>2008-01-15T19:50:47Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088085#M25193</link>
      <description>Rob,&lt;BR /&gt;&lt;BR /&gt;  Since you're only getting once a year shots at this, you should turn on more auditing than you think you need to make sure you can follow leads. In addition to PROCESS audits as already suggested, I'd also recommend both LOGIN and LOGOUT audits:&lt;BR /&gt;&lt;BR /&gt;$ SET AUDIT/AUDIT=(LOGIN=ALL,LOGOUT=ALL)&lt;BR /&gt;&lt;BR /&gt;  So, when your enable=(process=delprc) audit tells you some random process did it, you can determine when and from where that process was created. Since it's OpenVMS, you should probably enable your audits well in advance (perhaps just BEFORE a reboot prior to the expected event). That way you can be certain you'll have a record of all the processes that might be responsible.&lt;BR /&gt;&lt;BR /&gt; If you have sufficient storage and processing power, you could take the thermonuclear option and enable ALL auditing for the suspect period. (believe it or not, I've seen systems with full auditing enabled, but they were specifically sized with extra CPUs and storage to cope with the auditing load, and if I told you where they were I'd have to shoot you ;-)</description>
      <pubDate>Tue, 15 Jan 2008 21:16:38 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088085#M25193</guid>
      <dc:creator>John Gillings</dc:creator>
      <dc:date>2008-01-15T21:16:38Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088086#M25194</link>
      <description>Robert, was there a log file associated with this batch job ?</description>
      <pubDate>Tue, 15 Jan 2008 22:12:02 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088086#M25194</guid>
      <dc:creator>Thomas Ritter</dc:creator>
      <dc:date>2008-01-15T22:12:02Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088087#M25195</link>
      <description>Did you do a:&lt;BR /&gt;$ show que/all/full &lt;QUEUE_NAME&gt;&lt;BR /&gt;&lt;BR /&gt;on this queue?  Or if you leave out the queue name, it will list all of the information for every queue on your system.  You can scroll back and see what file was submitted, what time it was submitted, and what qualifiers were used.  Then you can work your way backwards and forwards to figure out what the job is doing and why it might be getting stopped.  You could add commands to the command file that is being run to identify when and where it is getting stopped.&lt;/QUEUE_NAME&gt;</description>
      <pubDate>Wed, 16 Jan 2008 01:47:26 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088087#M25195</guid>
      <dc:creator>DECxchange</dc:creator>
      <dc:date>2008-01-16T01:47:26Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088088#M25196</link>
      <description>If you analyze the moment of process deletion with VPA or whatever, you could find which programs were also active around that moment and may be find the guilty one (just hope it's called christmas_process_del.exe).&lt;BR /&gt;&lt;BR /&gt;Also accounting could reveal something.&lt;BR /&gt;&lt;BR /&gt;Bush would say "It's Al-Qaeda. Lets bomb Iran.".&lt;BR /&gt;&lt;BR /&gt;Wim</description>
      <pubDate>Wed, 16 Jan 2008 07:50:34 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088088#M25196</guid>
      <dc:creator>Wim Van den Wyngaert</dc:creator>
      <dc:date>2008-01-16T07:50:34Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088089#M25197</link>
      <description>Many thanks for all of the responses. If I can deal with some of the simple answers first.....&lt;BR /&gt;&lt;BR /&gt;We don't use a process monitor/killer anywhere on the system, so I think this can be ruled out.&lt;BR /&gt;&lt;BR /&gt;We used to Autogen every week, but have dropped to monthly - 2nd week in the month. There are pros and cons to this frequency, which I don't want to discuss here and detract from the initial problem.&lt;BR /&gt;&lt;BR /&gt;The application we run does the same thing, day in, day out. The same sets of programs are used, running the same data against the same code.&lt;BR /&gt;&lt;BR /&gt;It's very unlikely that something exists in the application software, especially as some of the DELPRC events have occurred whilst the bagtch job has been running DCL WAIT commands.&lt;BR /&gt;&lt;BR /&gt;Completely agree with enabling auditing towards the end of 2008, and we've switched some extra events on already, however, we only see this problem near the end of the year.&lt;BR /&gt;&lt;BR /&gt;I'm doing some extra checking on exactly when this problem has ocurred, as it may be we're running a program during our yearend that effects VMS in some way, and shows up a day or two later.&lt;BR /&gt;&lt;BR /&gt;Sorry for not including this earlier, but this is what DIAG thinks of it :-&lt;BR /&gt;&lt;BR /&gt;**** V3.4  ********************* ENTRY  533 ******************************** &lt;BR /&gt;&lt;BR /&gt; &lt;BR /&gt;&lt;BR /&gt; &lt;BR /&gt;&lt;BR /&gt;Logging OS                        1. OpenVMS &lt;BR /&gt;&lt;BR /&gt;System Architecture               2. Alpha &lt;BR /&gt;&lt;BR /&gt;OS version                           V7.3-2   &lt;BR /&gt;&lt;BR /&gt;Event sequence number         18983. &lt;BR /&gt;&lt;BR /&gt;Timestamp of occurrence              31-DEC-2007 07:45:52 &lt;BR /&gt;&lt;BR /&gt;Time since reboot                    15 Day(s) 16:50:36 &lt;BR /&gt;&lt;BR /&gt;Host name                            BETA     &lt;BR /&gt;&lt;BR /&gt; &lt;BR /&gt;&lt;BR /&gt;System Model                         AlphaServer ES45 Model 2B &lt;BR /&gt;&lt;BR /&gt; &lt;BR /&gt;&lt;BR /&gt;Entry Type                       40. System Bugcheck &lt;BR /&gt;&lt;BR /&gt; &lt;BR /&gt;&lt;BR /&gt;Bugcheck Minor class              2. System Bugcheck &lt;BR /&gt;&lt;BR /&gt; &lt;BR /&gt;&lt;BR /&gt;Bugcheck Msg                         RMSBUG, RMS has detected an invalid &lt;BR /&gt;&lt;BR /&gt;                                     condition &lt;BR /&gt;&lt;BR /&gt;Process ID                x007E0097 &lt;BR /&gt;&lt;BR /&gt;Process Name                         GBSINVRUN &lt;BR /&gt;&lt;BR /&gt;KSP                       x000000007FF88000 &lt;BR /&gt;&lt;BR /&gt;ESP                       x000000007FF8BC74 &lt;BR /&gt;&lt;BR /&gt;SSP                       x000000007FF9CD70 &lt;BR /&gt;&lt;BR /&gt;USP                       x000000007AD51A70 &lt;BR /&gt;&lt;BR /&gt;R0                        x0000000000000001 &lt;BR /&gt;&lt;BR /&gt;R1                        x0000000000000001 &lt;BR /&gt;&lt;BR /&gt;R2                        xFFFFFFFFFFFFFFFA &lt;BR /&gt;&lt;BR /&gt;R3                        xFFFFFFFF8A9022A0 &lt;BR /&gt;&lt;BR /&gt;R4                        xFFFFFFFF8A9025B0 &lt;BR /&gt;&lt;BR /&gt;R5                        x0000000000000000 &lt;BR /&gt;&lt;BR /&gt;R6                        x0000000000018001 &lt;BR /&gt;&lt;BR /&gt;R7                        x0000000000000000 &lt;BR /&gt;&lt;BR /&gt;R8                        x000000007FFCEE24 &lt;BR /&gt;&lt;BR /&gt;R9                        x000000007FFD0218 &lt;BR /&gt;&lt;BR /&gt;R10                       x0000000000000001 &lt;BR /&gt;&lt;BR /&gt;R11                       x000000007FFD00B8 &lt;BR /&gt;&lt;BR /&gt;R12                       x000000007FFCDA60 &lt;BR /&gt;&lt;BR /&gt;R13                       xFFFFFFFF8A90DEA8 &lt;BR /&gt;&lt;BR /&gt;R14                       xFFFFFFFF819E6080 &lt;BR /&gt;&lt;BR /&gt;R15                       x000000007AE4DE20 &lt;BR /&gt;&lt;BR /&gt;R16                       x00000000000003A0 &lt;BR /&gt;&lt;BR /&gt;R17                       xFFFFFFFFFFFFFFFA &lt;BR /&gt;&lt;BR /&gt;R18                       x0000000000000004 &lt;BR /&gt;&lt;BR /&gt;R19                       x000000007FFD0010 &lt;BR /&gt;&lt;BR /&gt;R20                       x000000007FFD0010 &lt;BR /&gt;&lt;BR /&gt;R21                       xFFFFFFFF8A914360 &lt;BR /&gt;&lt;BR /&gt;R22                       x00000000FFFFFFFF &lt;BR /&gt;&lt;BR /&gt;R23                       x0000000000000064 &lt;BR /&gt;&lt;BR /&gt;R24                       x0000000000000001 &lt;BR /&gt;&lt;BR /&gt;R25                       x0000000000000001 &lt;BR /&gt;&lt;BR /&gt;R26                       xFFFFFFFF8049A200 &lt;BR /&gt;&lt;BR /&gt;R27                       xFFFFFFFF8A90DEA8 &lt;BR /&gt;&lt;BR /&gt;R28                       xFFFFFFFF8049B354 &lt;BR /&gt;&lt;BR /&gt;FP                        x000000007FF8BC80 &lt;BR /&gt;&lt;BR /&gt;SP                        x0041474D24415445 &lt;BR /&gt;&lt;BR /&gt;PC                        xFFFFFFFF804A3550 &lt;BR /&gt;&lt;BR /&gt;PS                        x0000000000000009&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Last year, our support company suggested this may have been caused by PIOPAGES being set too low, so we upped it to 2000. As you can see, it hasn't helped so far.&lt;BR /&gt;&lt;BR /&gt;Rob.</description>
      <pubDate>Wed, 16 Jan 2008 13:38:51 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088089#M25197</guid>
      <dc:creator>Robert Atkinson</dc:creator>
      <dc:date>2008-01-16T13:38:51Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088090#M25198</link>
      <description>&lt;BR /&gt;Ah, but that's an RMS Bugcheck.&lt;BR /&gt;&lt;BR /&gt;Specifically&lt;BR /&gt;&lt;BR /&gt;FFFFFFFA == ASBALLFAIL  == couldn't allocate an asb (rm0stall)&lt;BR /&gt;&lt;BR /&gt;RMS ran out of stack more or less.&lt;BR /&gt;&lt;BR /&gt;Are you up to date on OpenVMS and patches?&lt;BR /&gt;&lt;BR /&gt;RMS crashed zap the process.&lt;BR /&gt;For full analysis you may have to run with SYSGEN BUGCHECK_FATAL = 1.&lt;BR /&gt;This will crash the syste, on an RMS bugcheck.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;more later... gotta run&lt;BR /&gt;&lt;BR /&gt;Hein.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 16 Jan 2008 14:30:18 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088090#M25198</guid>
      <dc:creator>Hein van den Heuvel</dc:creator>
      <dc:date>2008-01-16T14:30:18Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088091#M25199</link>
      <description>&amp;gt; Are you up to date on OpenVMS and patches?&lt;BR /&gt;&lt;BR /&gt;Yes, VMS 7.3-2 fully patched - we will probably be on VMS 8.3 when this comes around next year.&lt;BR /&gt;&lt;BR /&gt;BUGCHECKing the system is probably not an option, as it happens at our busiest time of the year - we're a book distributor!&lt;BR /&gt;&lt;BR /&gt;Rob.</description>
      <pubDate>Wed, 16 Jan 2008 14:33:33 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088091#M25199</guid>
      <dc:creator>Robert Atkinson</dc:creator>
      <dc:date>2008-01-16T14:33:33Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088092#M25200</link>
      <description>&lt;BR /&gt;Robert,&lt;BR /&gt;&lt;BR /&gt;   Do the programs process future data times?  Is it possible that at 7:45 AM they are processing data associated with the following year for the first time?&lt;BR /&gt;   If you're calculating something based upon differences in time (that effects memory allocation or some other resource allocation), and you get a negative number because the program is just using month and day and not paying attention to year, I can see a problem that would occur every late December.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 16 Jan 2008 14:52:24 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088092#M25200</guid>
      <dc:creator>Gregg Parmentier</dc:creator>
      <dc:date>2008-01-16T14:52:24Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088093#M25201</link>
      <description>The prgramming language takes care of dates/times, so that wouldn't cause a problem, plus it wouldn't explain why the error can occurr in DCL.</description>
      <pubDate>Wed, 16 Jan 2008 15:03:48 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088093#M25201</guid>
      <dc:creator>Robert Atkinson</dc:creator>
      <dc:date>2008-01-16T15:03:48Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088094#M25202</link>
      <description>It doen't have to be DCL. SYS$DELPRC - something stopped the process:&lt;BR /&gt;&lt;BR /&gt;Process ID x007E0097 &lt;BR /&gt;&lt;BR /&gt;Process Name GBSINVRUN &lt;BR /&gt;&lt;BR /&gt;A few thoughts:&lt;BR /&gt;&lt;BR /&gt;I think accounting won't tell more, just DELPRC. If a RMS bugcheck forces process deletion using SYS$DELPRC, it's nasty if that information is lost....&lt;BR /&gt;&lt;BR /&gt;Out of stack + busiest time of the year: If it is an RMS issue, I think it might be something in the program causing the stack to get exhausted, where 'normal' load willbe within limits. Too many concureent reads/writes, perhaps? Too many channels/files open?</description>
      <pubDate>Fri, 18 Jan 2008 12:55:08 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088094#M25202</guid>
      <dc:creator>Willem Grooters</dc:creator>
      <dc:date>2008-01-18T12:55:08Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088095#M25203</link>
      <description>Willem, I think you misunderstood my last comment.&lt;BR /&gt;&lt;BR /&gt;What I meant was that the failure can happen inside an application program and whilst the batch job is running DCL commands, so it's probably nothing to do with the application.&lt;BR /&gt;&lt;BR /&gt;Rob.</description>
      <pubDate>Fri, 18 Jan 2008 12:59:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088095#M25203</guid>
      <dc:creator>Robert Atkinson</dc:creator>
      <dc:date>2008-01-18T12:59:32Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088096#M25204</link>
      <description>Agreed - a batch process runs a DCL procedure - but the procedure can run an image. The problem could happen in the image, causing the process to be deleted (to prevent even more damage). The message is clear enough: "forced exit of image or process". So it might well be the problem is within the image, ifthis RMS bugcheck kills the process.&lt;BR /&gt;&lt;BR /&gt;Mu thought is that if this happens in a top-load period, AND it seems to be something related to RMS, there is some issue in either the DCL code or the image concerning file access. IMHO, date or time are no issues here, assuming your statement that the progream does handle future times properly.</description>
      <pubDate>Fri, 18 Jan 2008 13:16:22 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088096#M25204</guid>
      <dc:creator>Willem Grooters</dc:creator>
      <dc:date>2008-01-18T13:16:22Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088097#M25205</link>
      <description>I read your remark on DCL WAIT when it happened.... Weird, indeed.</description>
      <pubDate>Fri, 18 Jan 2008 13:20:51 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088097#M25205</guid>
      <dc:creator>Willem Grooters</dc:creator>
      <dc:date>2008-01-18T13:20:51Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088098#M25206</link>
      <description>Rob,&lt;BR /&gt;&lt;BR /&gt;Perhaps you weren't telling us all you knew about the problem at the start to see if anyone could come up with an alternate possibility than what your support organization suggested.&lt;BR /&gt;&lt;BR /&gt;However, now that the RMS BUGCHECK is out of the bag, can you please tell us more?&lt;BR /&gt;&lt;BR /&gt;Timestamps provide good circumstantial evidence that the termination of the GBSINVRUN job and the RMS BUGCHECK were related.&lt;BR /&gt;&lt;BR /&gt;As Hein stated, an RMS BUGCHECK will definitely kill the process that detected the inconsistency.  &lt;BR /&gt;&lt;BR /&gt;Was there an RMS BUGCHECK at the time the process in a DCL WAIT was deleted?  (BTW, how did you determine that was the state of the process?  WAIT is a CLIROUTINE, and as such doesn't have an image associated with it.)  What other events are showing up in the errlog?&lt;BR /&gt;&lt;BR /&gt;What changed 3 years ago?   Is that when you upgraded to the ES45?  &lt;BR /&gt;&lt;BR /&gt;Are you running T4 or some other system data collection software?&lt;BR /&gt;&lt;BR /&gt;Issues caused by lack of proper synchronization are most likely to become evident when the system is busy, so it would not surprise me if the underlying cause is synchronization related.  Unfortunately, these are in general not easy problems to debug, as often the detection of the corrupted data isn't discovered until later, and all VMS can do when it finds the inconsistency is to bugcheck.  The point being that the process that detects the inconsistency may not be related to the cause.  Since you don't want to allow a crash, is there a time that you can stress the system with BUGCHECKFATAL set to 1?&lt;BR /&gt;&lt;BR /&gt;Is there other software involved you are neglecting to tell us about?  Are you using RMS Journaling, Rdb, some third party remote journaling or shadowing product, etc.  Especially anything that does stuff in exec or kernel mode.&lt;BR /&gt;&lt;BR /&gt;If these are related to Exec mode bugchecks, I am not sure auditing is going to tell you much.  Definitely won't hurt,  &lt;BR /&gt;&lt;BR /&gt;Jon&lt;BR /&gt;</description>
      <pubDate>Fri, 18 Jan 2008 22:54:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088098#M25206</guid>
      <dc:creator>Jon Pinkley</dc:creator>
      <dc:date>2008-01-18T22:54:58Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088099#M25207</link>
      <description>&amp;gt; how did you determine that was the state of the process&lt;BR /&gt;&lt;BR /&gt;I can see from the batch logfile that the process was at a WAIT statement at the same time the BUGCHECK ocurred.&lt;BR /&gt;&lt;BR /&gt;As far as I'm aware, nothing in particular changed 3 years ago. We we're running HSG80/ES40 when this first ocurred. We copied the system disk over to the new hardware, so if the problem is VMS related, it would have been moved over.&lt;BR /&gt;&lt;BR /&gt;Although it is the end of the year this happens, the system is no busier than at any other point in time. The end of November is our peak processing point, so if it were load related, this is when I'd expect problems.&lt;BR /&gt;&lt;BR /&gt;We have T4 installed, but I'm not sure what data it's collecting.&lt;BR /&gt;&lt;BR /&gt;I checked the timing of the previous errors against this year, and some happen before our yearend processing, and some after, so that seems to rule out a rogue yearend program.&lt;BR /&gt;&lt;BR /&gt;My main query with the group was to see if anyone else had seen anything like this at yearend, which seems to not be the case. I'm glad of everyones input here, especially the audit suggestions, so I'm happy to close this unless anyone would like to make more comments/suggestions?&lt;BR /&gt;&lt;BR /&gt;Rob.</description>
      <pubDate>Mon, 21 Jan 2008 09:24:57 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088099#M25207</guid>
      <dc:creator>Robert Atkinson</dc:creator>
      <dc:date>2008-01-21T09:24:57Z</dc:date>
    </item>
    <item>
      <title>Re: Processes Mysteriously Being Deleted</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088100#M25208</link>
      <description>Rob,&lt;BR /&gt;&lt;BR /&gt;RE:"I can see from the batch logfile that the process was at a WAIT statement at the same time the BUGCHECK ocurred."&lt;BR /&gt;&lt;BR /&gt;Does that mean that the logfile ended with something like:&lt;BR /&gt;&lt;BR /&gt;$ WAIT 00:05:00&lt;BR /&gt;&lt;BR /&gt;and that was the last line in the file?&lt;BR /&gt;&lt;BR /&gt;Much more conclusive would be if &lt;BR /&gt;&lt;BR /&gt;$ SET PREFIX "(!8%T) "&lt;BR /&gt;&lt;BR /&gt;was in effect and you have something like:&lt;BR /&gt;&lt;BR /&gt;(23:03:18) $ WAIT 00:05:00&lt;BR /&gt;&lt;BR /&gt;and you knew the bugcheck occurred at 23:04:39, i.e. before the wait had expired.&lt;BR /&gt;&lt;BR /&gt;I would not be surprised if the logfile cannot be trusted to have flushed the buffers, because that's an RMS function, and there was an RMS bugcheck.   Hein or Volker will know.  My point is that I think there is a good possibility that code after the WAIT had completed was actually executing at the time of the Bugcheck.&lt;BR /&gt;&lt;BR /&gt;Is every one of these unexplained process deletion events paired with an RMS Bugcheck?&lt;BR /&gt;If so, you really need to try to determine what the cause is.  Making the EXEC mode Bugcheck fatal will give you the best chance of being able to determine the cause, but before you do that, make 100% sure your system is setup to be able to save the crash dump.  It would be a waste to take the crash but not end up with a valid dump.&lt;BR /&gt;&lt;BR /&gt;RE:"Although it is the end of the year this happens, the system is no busier than at any other point in time. The end of November is our peak processing point, so if it were load related, this is when I'd expect problems."  &lt;BR /&gt;&lt;BR /&gt;I was just going by your statement that you couldn't allow an avoidable crash because it was the busiest time of the year.  Perhaps it is all the activity in the files at the end of Nov that is causing the files (I am assuming indexed files here) to be badly tuned, although that in itself shouldn't cause a Bugcheck.  I am not familiar enough with RMS internals or decoding the errorlog contents to be able to know if anything useful can be obtained from your errorlog entries.  Hein was able to derive some info, how much more is possible, I have no idea.&lt;BR /&gt;&lt;BR /&gt;RE: "We have T4 installed, but I'm not sure what data it's collecting."   &lt;BR /&gt;&lt;BR /&gt;Having it installed without collecting data is possible, but not very useful.  The default collection command procedure will collect a lot of useful info, but the raw data normally gets deleted after a relatively short period, so it is questionable if you still have the raw monitor.dat file.&lt;BR /&gt;&lt;BR /&gt;RE:"I checked the timing of the previous errors against this year, and some happen before our yearend processing, and some after, so that seems to rule out a rogue yearend program."  &lt;BR /&gt;&lt;BR /&gt;Well it is possible that the fact these only happen at year end is a coincidence, but I would at least look for something that is triggering the bugchecks.&lt;BR /&gt;&lt;BR /&gt;Is it always the same job that is affected, of just random batch jobs.  If more than one job is affected, are there common files the jobs are using?&lt;BR /&gt;&lt;BR /&gt;Jon</description>
      <pubDate>Mon, 21 Jan 2008 11:05:03 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/processes-mysteriously-being-deleted/m-p/5088100#M25208</guid>
      <dc:creator>Jon Pinkley</dc:creator>
      <dc:date>2008-01-21T11:05:03Z</dc:date>
    </item>
  </channel>
</rss>

