<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Looping installed image in Operating System - OpenVMS</title>
    <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653714#M40592</link>
    <description>&amp;gt; Spoke to our DBA (that's Adam) &lt;BR /&gt;&lt;BR /&gt;And I'm sure you used the term loosely ;-)&lt;BR /&gt;&lt;BR /&gt;&amp;gt; who tells me that the looping process started after he extended &lt;BR /&gt;&amp;gt; the statement database so as to accomodate a further 5 years of data&lt;BR /&gt;&lt;BR /&gt;This is normally what I'd refer to as the "Ta-Dah!" moment. I can't say for sure that this is the cause of your problem but I'd certanly budget a few hours investigation on the strength of STM(N?)T_DB being verballed! (Statement limits? Statement Line Limits?) Having said that I believe you only show/retrieve one month at a time so I guess it shouldn't be a new problem.&lt;BR /&gt;&lt;BR /&gt;&amp;gt; - but if this was the problem I would expect to see issues through-out&lt;BR /&gt;&amp;gt; the day and not just after 6pm (there have not been any looping &lt;BR /&gt;&amp;gt; processes for the past 2 days so I have not been able to get anymore&lt;BR /&gt;&amp;gt; PC details).&lt;BR /&gt;&lt;BR /&gt;Yes and no. That's like saying "Why doesn't it happen at every 6pm?". IIRC you audit the query every user does with the option and the key value, so the OSIP for Fred User at 6:00pm might have the account number detail to check and attempt a reproducer? &lt;BR /&gt;&lt;BR /&gt;Then again being in the middle of a query and clicking |X| should be easy to test as well. Are there sometimes response problems and the user getting fed up waiting at knock-off time? (But statement retrieval was by design, and I'm sure still is, lightening fast.) Having said that, I do remember all those Rdb Freeze-Locks and Recovery Processes from other ACMS options. I'm sure Adam is "monitoring".&lt;BR /&gt;&lt;BR /&gt;Possible scenario: -&lt;BR /&gt;&lt;BR /&gt;1) User does a query&lt;BR /&gt;2) Database response poor as Rdb grabs a freeze lock and attempts to rollback txn from other impatient user.&lt;BR /&gt;3) User 1 gets fed up and shutsdown&lt;BR /&gt;4) When the process comes back FDV calls start failing due to lack of terminal&lt;BR /&gt;5) Loop (Why hasn't ACMS killed the process or exited the image?)&lt;BR /&gt;&lt;BR /&gt;&amp;gt; We have a process that runs every hour that does an ACMS/CANCEL &lt;BR /&gt;&amp;gt; on users that are inactive &lt;BR /&gt;&lt;BR /&gt;Yep, that's one way to do it.&lt;BR /&gt;&lt;BR /&gt;&amp;gt; I think the issue is related to FMS and returned statuses not being&lt;BR /&gt;&amp;gt; checked correctly - I have a vague recollection of seeing a looping &lt;BR /&gt;&amp;gt; bug many years ago when the workspace loaded into the FMS form was&lt;BR /&gt;&amp;gt; larger than the expected value (i.e. the COIN select field was &lt;BR /&gt;&amp;gt; 'set up' and returned to calling program). &lt;BR /&gt;&lt;BR /&gt;Certainly sounds plausible. Also 6:00pm does look suspiciously like a knock-off time and, as Hein suggested, something upsetting the UI at close. &lt;BR /&gt;&lt;BR /&gt;Have to wait until it happens again I suppose.&lt;BR /&gt;&lt;BR /&gt;Just a thought but, if you are rebuilding, it could be worth putting /CHECK=(bounds or all) Could also set that FDV$set_errors_to_signal call but there's probably half a dozen other errors that should be ligitimately ignored that would end up running amoc with a COBOL program not ready for errors to be signalled.&lt;BR /&gt;&lt;BR /&gt;Good-hunting!&lt;BR /&gt;&lt;BR /&gt;Cheers Richarde Maher&lt;BR /&gt;&lt;BR /&gt;PS. Please advise Zorba that he can thank me for the ability to simply add a few storage areas and then do an ALTER STORAGE MAP (with the mixed page format, clustered parent/child set-up, if it helps anybody?)&lt;BR /&gt;&lt;BR /&gt;PPS. Only 5 more years room? How many times have we seen/heard "COIN will be dead in 2 years!" before?&lt;BR /&gt;&lt;BR /&gt;PPPS. Hope Jae and the kids are well and you're as happy as I am that the World Cup, Wimbledon, and Tour de Boredom have wiped cricket off the map!</description>
    <pubDate>Fri, 02 Jul 2010 22:46:02 GMT</pubDate>
    <dc:creator>Richard J Maher</dc:creator>
    <dc:date>2010-07-02T22:46:02Z</dc:date>
    <item>
      <title>Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653687#M40565</link>
      <description>I have an installed image running under ACMS (DCL server with dynamic username) that intermitently loops (CPU bound not IO) and then needs to be stopped (killed).&lt;BR /&gt;&lt;BR /&gt;I have looked at the code and can see no obvious reason for the loop and need to know if there is any way to 'trace' (for want of a better word) where the program is looping.&lt;BR /&gt;&lt;BR /&gt;Note: Image is installed and runs against and RDB as an ACMS DCL server under dynamic username.&lt;BR /&gt;&lt;BR /&gt;Any help appreciated.&lt;BR /&gt;&lt;BR /&gt;OpenVMS V8.3&lt;BR /&gt;COBOL V2.8-1286&lt;BR /&gt;RDB 7.2-321&lt;BR /&gt;ACMS 5.1B&lt;BR /&gt;</description>
      <pubDate>Mon, 28 Jun 2010 13:01:05 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653687#M40565</guid>
      <dc:creator>Mick O'Brien</dc:creator>
      <dc:date>2010-06-28T13:01:05Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653688#M40566</link>
      <description>There are a number of methods for this, perhaps the simplest is to use SDA to examine the PCs of the looping process.  There is a trace utility within SDA, but I have found that it is too quick.  I usually create an X.COM file with 10-20 "exam PC" commands in it and set the SDA focus to the process that is looping.  Do an "@X.COM" to a file or the screen and map the PCs to the actual image.&lt;BR /&gt;&lt;BR /&gt;While little tedious, it gives you a gross idea of the code flow.  Once you narrow that part down, the PC trace utility can give you more details.&lt;BR /&gt;&lt;BR /&gt;Dan</description>
      <pubDate>Mon, 28 Jun 2010 13:11:04 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653688#M40566</guid>
      <dc:creator>abrsvc</dc:creator>
      <dc:date>2010-06-28T13:11:04Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653689#M40567</link>
      <description>&amp;gt;I have looked at the code and can see no obvious reason for the loop and need to know if there is any way to 'trace' (for want of a better word) where the program is looping.&lt;BR /&gt;&lt;BR /&gt;Run the application and collect the pc sampling data.&lt;BR /&gt;To Collect pc sampling:&lt;BR /&gt;$anal/sys&lt;BR /&gt;SDA&amp;gt;pcs load&lt;BR /&gt;SDA&amp;gt;pcs start trace&lt;BR /&gt;SDA&amp;gt;pcs stop trace&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Bhadresh&lt;BR /&gt;</description>
      <pubDate>Mon, 28 Jun 2010 13:16:10 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653689#M40567</guid>
      <dc:creator>Bhadresh</dc:creator>
      <dc:date>2010-06-28T13:16:10Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653690#M40568</link>
      <description>Hi Mark,&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt; I have looked at the code and can see no obvious reason for the loop and&lt;BR /&gt;&amp;gt;&amp;gt; need to know if there is any way to 'trace' (for want of a better word)&lt;BR /&gt;&amp;gt;&amp;gt; where the program is looping.&lt;BR /&gt;Yes, there is.&lt;BR /&gt;You can make use of the PC sampling feature in order to find out what the&lt;BR /&gt;looping process is doing. From the PC sampling output, you should see some&lt;BR /&gt;set of PC's getting repeatedly logged in case of a loop in the program.&lt;BR /&gt;Mapping such a PC back to your source code would tell you where in the&lt;BR /&gt;program you have the loop.&lt;BR /&gt;&lt;BR /&gt;PC Sampling Usage -&lt;BR /&gt;$ ANALYZE/SYSTEM&lt;BR /&gt;SDA&amp;gt; PCS LOAD&lt;BR /&gt;SDA&amp;gt; PCS START TRACE&lt;BR /&gt;... wait for some time ...&lt;BR /&gt;SDA&amp;gt; PCS STOP TRACE&lt;BR /&gt;SDA&amp;gt; SET OUTPUT PC.DAT&lt;BR /&gt;SDA&amp;gt; PCS SHOW TRACE&lt;BR /&gt;SDA&amp;gt; PCS UNLOAD&lt;BR /&gt;SDA&amp;gt; EXIT&lt;BR /&gt;&lt;BR /&gt;Then analyze the PC.DAT file for PC's that are getting repeatedly logged.&lt;BR /&gt;&lt;BR /&gt;If you know the PID of the process, you can narrow down the PC sampling by&lt;BR /&gt;making the PCS to sample PC's corresponding to a particular process.&lt;BR /&gt;&lt;BR /&gt;i.e. In the above command,&lt;BR /&gt;SDA&amp;gt; PCS START TRACE --&amp;gt; Traces all PC's of all process&lt;BR /&gt;&lt;BR /&gt;Instead use&lt;BR /&gt;SDA&amp;gt; PCS START TRACE/PID=XXX --&amp;gt; Traces all PC's of PID XXX&lt;BR /&gt;&lt;BR /&gt;This would cause the PCS to sample PC's for only the process with specified PID.&lt;BR /&gt;&lt;BR /&gt;Hope this helps.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Murali</description>
      <pubDate>Mon, 28 Jun 2010 13:30:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653690#M40568</guid>
      <dc:creator>P Muralidhar Kini</dc:creator>
      <dc:date>2010-06-28T13:30:27Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653691#M40569</link>
      <description>Murali,&lt;BR /&gt;&lt;BR /&gt;Next time the process loops I will use the code you supplied - very clearly explained.  Once I get the PC trace file I will no doubt be asking more questions.&lt;BR /&gt;&lt;BR /&gt;Mick</description>
      <pubDate>Mon, 28 Jun 2010 13:36:17 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653691#M40569</guid>
      <dc:creator>Mick O'Brien</dc:creator>
      <dc:date>2010-06-28T13:36:17Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653692#M40570</link>
      <description>Hi Mark,&lt;BR /&gt;&lt;BR /&gt;Also, for more information on the VMS SDA Extensions, Refer&lt;BR /&gt;* OpenVMS SDA Extensions&lt;BR /&gt;&lt;A href="http://www.connect-community.de/Events/OpenVMS2009/folien/05-sda_extensions.html" target="_blank"&gt;http://www.connect-community.de/Events/OpenVMS2009/folien/05-sda_extensions.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;For PCS sampling related information, in the above link, refer section&lt;BR /&gt;"PC Sampling Utility PCS commands:"&lt;BR /&gt;&lt;BR /&gt;It also has a example of PCS sampling usage with corresponding output.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Murali</description>
      <pubDate>Mon, 28 Jun 2010 13:36:18 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653692#M40570</guid>
      <dc:creator>P Muralidhar Kini</dc:creator>
      <dc:date>2010-06-28T13:36:18Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653693#M40571</link>
      <description>With respect to the other respondents and their literal (and correct) replies here, the PC stuff (in conjunction with the link maps and compiler listings) will get you into the general area of the loop.   &lt;BR /&gt;&lt;BR /&gt;Which is interesting.   &lt;BR /&gt;&lt;BR /&gt;But not entirely useful.&lt;BR /&gt;&lt;BR /&gt;While the tools cited are all functional, it's also easily feasible to snag a few PCs via a simple SHOW PROCESS /CONTINUOUS command.  &lt;BR /&gt;&lt;BR /&gt;A half-dozen PCs are often enough data.  Particularly if you see a repeated PC or two somewhere in P0 space, then you're usually ready to proceed. &lt;BR /&gt;&lt;BR /&gt;See where those addresses exist within the application. &lt;BR /&gt;&lt;BR /&gt;How to get to the source code from a virtual address?  How to translate from those PCs you have?  The sequence is described here:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://labs.hoffmanlabs.com/node/800" target="_blank"&gt;http://labs.hoffmanlabs.com/node/800&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;If that source code not pointing to an obvious trigger, then you can use more specific tools and techniques.  In particular, run this application under the debugger.  Yes, you can debug detached processes.  Here's how:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://labs.hoffmanlabs.com/node/803" target="_blank"&gt;http://labs.hoffmanlabs.com/node/803&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;You won't be able to debug an installed image, but you can set up the same privileged context for the detached process and run without the INSTALL. &lt;BR /&gt;&lt;BR /&gt;If necessary, you can program the debugger if you can identify a trigger but not its cause.  The debugger can then run in the background, waiting for the initial conditions for the loop to be met, and you'll then have some visibility into the run-up to the trigger.&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://labs.hoffmanlabs.com/node/848" target="_blank"&gt;http://labs.hoffmanlabs.com/node/848&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;If you're having difficulty with spotting the trigger with the debugger and with the debugger-level programming, then you can also build the application with the debugger activated in a signal handler, and send the debug signal over.&lt;BR /&gt;&lt;BR /&gt;And if you're using the word "killed" or analogous in conjunction with this effort, then consider switching techniques and using the process dump mechanisms.   Dial back the brute-force setting slightly.   The debugging sequence and the preference for creating process dumps or crashdumps is analogous to using the &amp;gt;&amp;gt;&amp;gt; BOOT rather than the SRM crashdump command; sure, the immediate freak-out is fixed, but these tend to arise anew, and, well, you can choose to repeat the &amp;gt;&amp;gt;&amp;gt; BOOT or you can write a dump and go looking.  In other words, capture a dump or other evidence, and +then+ restart.&lt;BR /&gt;&lt;BR /&gt;As for how, you can use the SET PROCESS /DUMP command, or toss the appropriate $forcex signal over.  &lt;BR /&gt;&lt;BR /&gt;That signal can be allowed to cause the dump, or can captured by a signal handler.  There's a simple signal handler (in C) here:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://labs.hoffmanlabs.com/node/1438" target="_blank"&gt;http://labs.hoffmanlabs.com/node/1438&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;For typical non-trivial production applications, it's usually best to instrument the code, too.  To integrate debugging.&lt;BR /&gt;&lt;BR /&gt;And given the use of COBOL (which avoids many of the foibles of C or Macro or Bliss), my initial suspicion would be in ACMS and system service and RTL error handling; not checking the return status or IOSB, for instance.   Even so-called optional IOSB arguments aren't really optional; you should always specify the IOSB, ad always check the return status and the IOSB.  Otherwise, errors can tend to accrete, and get weird.&lt;BR /&gt;&lt;BR /&gt;The other and ancillary question would involve why the image is installed.  If it is installed for security (privilege) reasons, then subsystem identifiers can be a good alternative.&lt;BR /&gt;&lt;BR /&gt;Stephen Hoffman&lt;BR /&gt;HoffmanLabs LLC</description>
      <pubDate>Mon, 28 Jun 2010 14:49:41 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653693#M40571</guid>
      <dc:creator>Hoff</dc:creator>
      <dc:date>2010-06-28T14:49:41Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653694#M40572</link>
      <description>I start out with a quick MONI MODE... all USER? EXEC?&lt;BR /&gt;&lt;BR /&gt;PC Samples are typically highly useful as others replied. I uses SHOW PROC/CONT, SDA&amp;gt; PCS, or SDA&amp;gt; PRF as I see fit. &lt;BR /&gt;But for for my (COBOL) customer the top hits are often in OTS$mumblfratz routines. So be sure to look for infrequenm but lower PC values to find out where in the code the program was when it call the OTS helpers.&lt;BR /&gt;&lt;BR /&gt;Or, just cut out the (PC trace) middle man and try to find out how the process got there.&lt;BR /&gt;How? $ ANALYZE / SYSTEM ... SET PROC ACMSxxxSPxx  ... SHOW STAC/USER/SUMM&lt;BR /&gt;&lt;BR /&gt;Repeat 2 - 5 times. Pattern?&lt;BR /&gt;&lt;BR /&gt;Good luck!&lt;BR /&gt;Hein&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 29 Jun 2010 11:53:29 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653694#M40572</guid>
      <dc:creator>Hein van den Heuvel</dc:creator>
      <dc:date>2010-06-29T11:53:29Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653695#M40573</link>
      <description>&amp;gt;&amp;gt; my initial suspicion would be in ACMS &lt;BR /&gt;&lt;BR /&gt;No way. In this setup ACMS only launches.&lt;BR /&gt;&lt;BR /&gt;But it reminded me... maybe you can use :&lt;BR /&gt;$ ACMS /DEBUG /SERVER ACMSxxxSPxxx&lt;BR /&gt;&lt;BR /&gt;With that you can break in to a running ACMS SP process and the DBG$INPUT and DBG$OUTPUT will be managed for you.&lt;BR /&gt;&lt;BR /&gt;I happen to have used it yesterday, but for a procedure server process, on a DCL server. I needed it because ACMS V5.1 on Itanium does not support its normal ACMS/DEBUG and we had (have!) and issue with TDMS not returning filled fields for a specific request with a long input list. Using %ALL worked.&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://h71000.www7.hp.com/doc/721final/6607/6607pro_017.html" target="_blank"&gt;http://h71000.www7.hp.com/doc/721final/6607/6607pro_017.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Hein&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 29 Jun 2010 12:04:36 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653695#M40573</guid>
      <dc:creator>Hein van den Heuvel</dc:creator>
      <dc:date>2010-06-29T12:04:36Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653696#M40574</link>
      <description>&amp;gt;&amp;gt; my initial suspicion would be in ACMS &lt;BR /&gt;&lt;BR /&gt;Might have misread the "and"?   I was pointing to the potential for errors in the error handling from all the wheels that are spinning here.  Not to ACMS itself.&lt;BR /&gt;</description>
      <pubDate>Tue, 29 Jun 2010 12:30:56 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653696#M40574</guid>
      <dc:creator>Hoff</dc:creator>
      <dc:date>2010-06-29T12:30:56Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653697#M40575</link>
      <description>Abrsvc , Bhadresh , Murali, Stephen (Hoff)&lt;BR /&gt;&lt;BR /&gt;I had another looping process last night and did as suggested and got out some PC details.  I noticed that there were a lot of calls to FDVSHR (sorry forgot to mention process uses FMS) which seems to indicate that process is hung up on a screen somewhere.  These looping processes only happen in the evening but when I talked to looping process owner this morning I was advised that they definitely logged out before going home.  Ignoring the end-users comments I was wondering whether a default timeout on an FDV$GETAL call is not being handled correctly within code thus causing the CPU loop - I looked for an FDV$STIME but could not find one.  Does anyone know if FDV$GETAL has a default timeout?&lt;BR /&gt;&lt;BR /&gt;Hein,&lt;BR /&gt;&lt;BR /&gt;I like the idea of sticking the server into debug but I suspect I would not be allowed to run debug on a production server within SLA (service line agreement) just in case it brings the rest of the application down!  Instead next time I'll do what you suggested to "...try to find out how the process got there."&lt;BR /&gt;&lt;BR /&gt;Mick</description>
      <pubDate>Tue, 29 Jun 2010 12:45:54 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653697#M40575</guid>
      <dc:creator>Mick O'Brien</dc:creator>
      <dc:date>2010-06-29T12:45:54Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653698#M40576</link>
      <description>Without seeing the actual PCs involved and looking at the code, I can sat this much:&lt;BR /&gt;&lt;BR /&gt;I ran into a problem where a mis-handled error condition would cause the SMG routines to loop forever in a COM state.  The problem in initially tracking this down was that it occurred during a process shutdown.  At the time of the loop, the majority of the process context was broken down.  This means that "normal" debugging techniques were useless.&lt;BR /&gt;&lt;BR /&gt;Check for return status checks that don't handle all status possibilities that might corrupt some buffers. &lt;BR /&gt;&lt;BR /&gt;If you can post parts of the listings here with the PC values, we can perhaps provide some specific suggestions.&lt;BR /&gt;&lt;BR /&gt;Dan</description>
      <pubDate>Tue, 29 Jun 2010 12:55:42 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653698#M40576</guid>
      <dc:creator>abrsvc</dc:creator>
      <dc:date>2010-06-29T12:55:42Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653699#M40577</link>
      <description>Hi Mick,&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt; I noticed that there were a lot of calls to FDVSHR (sorry forgot to&lt;BR /&gt;&amp;gt;&amp;gt; mention process uses FMS) &lt;BR /&gt;Now that you got the PC sample, the intresting thing would be to know the&lt;BR /&gt;source code to which the repeated PC values map.&lt;BR /&gt;Are you seeing some set of PC values getting repeatedly logged in the&lt;BR /&gt;PC sampling output ??&lt;BR /&gt;If Yes, to which source code line do those PC values map to ??&lt;BR /&gt;&lt;BR /&gt;This would give a idea as to where exactly in your source code there is a loop.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Murali</description>
      <pubDate>Tue, 29 Jun 2010 13:06:13 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653699#M40577</guid>
      <dc:creator>P Muralidhar Kini</dc:creator>
      <dc:date>2010-06-29T13:06:13Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653700#M40578</link>
      <description>Hi Mike,&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt; Does anyone know if FDV$GETAL has a default timeout?&lt;BR /&gt;FDV$STIME can be used to specify timeout value but you said that you did not&lt;BR /&gt;find that in your code.&lt;BR /&gt;&lt;BR /&gt;Also, Refer the following link -&lt;BR /&gt;&lt;A href="http://h71000.www7.hp.com/doc/73final/6619/6619pro_004.html" target="_blank"&gt;http://h71000.www7.hp.com/doc/73final/6619/6619pro_004.html&lt;/A&gt;&lt;BR /&gt;-&amp;gt; Section - 5.3.14.2 DECforms Timeout Values&lt;BR /&gt;&lt;BR /&gt;Check if thats applicable in your case.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Murali</description>
      <pubDate>Tue, 29 Jun 2010 13:35:06 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653700#M40578</guid>
      <dc:creator>P Muralidhar Kini</dc:creator>
      <dc:date>2010-06-29T13:35:06Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653701#M40579</link>
      <description>I suspect you'll have to roll up your sleeves and compile with debugging enabled and launch the debugger and isolate the error.  &lt;BR /&gt;&lt;BR /&gt;Either directly.  &lt;BR /&gt;&lt;BR /&gt;Or by instrumenting the code.&lt;BR /&gt;&lt;BR /&gt;Or both.&lt;BR /&gt;&lt;BR /&gt;The virtual addresses that other replies discuss will usually help you find the loop, but there's still work to be done in the source code to isolate the trigger for the loop.&lt;BR /&gt;&lt;BR /&gt;I've seen all manner of network and lower-level errors triggering this sort of misbehavior, and those can lead application code into all manner of dark corners.  &lt;BR /&gt;&lt;BR /&gt;And yes, application errors are most definitely entirely in play here, too.  &lt;BR /&gt;&lt;BR /&gt;Old code often has issues with its error handling.  The old stuff probably once worked fine, but just wasn't designed for modern systems and modern networks and modern problems.&lt;BR /&gt;&lt;BR /&gt;Folks don't necessarily think about how much stuff has actually changed within the whole of the stack, and what that means for the applications.  &lt;BR /&gt;&lt;BR /&gt;As a testament to the degree of compatibility that has been achieved here, comparatively simple user applications that once used FMS to chat with VT100 or VT220 terminals are now using various wholly new intermediate layers out a telnet connection to some random terminal emulation on some other operating system, and you can be assured that there are whole zoos filled with the errors that can now arise here.  &lt;BR /&gt;&lt;BR /&gt;Networking in particular offers a well-populated zoo.&lt;BR /&gt;&lt;BR /&gt;How much old COBOL code ever was ever implemented with the intermittent connectivity and with the requirements for reconnections and refreshes that can arise with mobile IP networking?&lt;BR /&gt;&lt;BR /&gt;There's no magic bullet here.  The PC is just the jumping off point.  You're just going to have to isolate and debug your code.&lt;BR /&gt;&lt;BR /&gt;And yes, this could well be an error in lower-level code, but (in most any non-trivial application environment) you're generally left to have to prove that is the case.&lt;BR /&gt;</description>
      <pubDate>Tue, 29 Jun 2010 14:44:39 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653701#M40579</guid>
      <dc:creator>Hoff</dc:creator>
      <dc:date>2010-06-29T14:44:39Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653702#M40580</link>
      <description>The only problem using SDA for this is that the process is executing at the time. Fix that with SUSPEND:&lt;BR /&gt;&lt;BR /&gt;When a process is looping:&lt;BR /&gt;&lt;BR /&gt;$ SET PROCESS/SUSPEND/ID=looping-process&lt;BR /&gt;&lt;BR /&gt;Now use SDA to examine the process. See SHOW CALL, SHOW CALL/NEXT etc... to walk up the call stack. The first few frames will be related to the SUSPEND, but you should get some idea of where the process is looping. Sanity check with:&lt;BR /&gt;&lt;BR /&gt;$ SET PROCESS/RESUME&lt;BR /&gt;$ SET PROCESS/SUSPEND&lt;BR /&gt;&lt;BR /&gt;and repeat the traceback. Do this a few times to find the deepest routine common to all. That will contain the loop. &lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 29 Jun 2010 22:21:13 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653702#M40580</guid>
      <dc:creator>John Gillings</dc:creator>
      <dc:date>2010-06-29T22:21:13Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653703#M40581</link>
      <description>&lt;!--!*#--&gt;All,&lt;BR /&gt;&lt;BR /&gt;I got another looping process last night for the same image - I have done as suggested and got out a PC file and have attached.  I have also attached a link map BUT note that the map was generated recently on development and does not relate to the image on production.&lt;BR /&gt;&lt;BR /&gt;Hein,&lt;BR /&gt;&lt;BR /&gt;Tried to use command 'SHOW STAC/USER/SUMM' but it wants a range of memory locations?&lt;BR /&gt;John,&lt;BR /&gt;I'll give your suggested investigation route a try but I noticed from 'HELP' that command required a 'starting-address'?&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Mick</description>
      <pubDate>Wed, 30 Jun 2010 06:53:17 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653703#M40581</guid>
      <dc:creator>Mick O'Brien</dc:creator>
      <dc:date>2010-06-30T06:53:17Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653704#M40582</link>
      <description>Mick,&lt;BR /&gt;&lt;BR /&gt;  That set of PC samples doesn't tell you much. The process is obviously doing stuff, but you're really not interested in anything below your own code, so ignore any of the samples in RTLs. The exceptions may be interesting. Perhaps something is failing, but your code doesn't notice?&lt;BR /&gt;&lt;BR /&gt;  Try SUSPENDing and examining the call stack. That should help localise the issue into your own code, somewhere you can check. Also see SDA&amp;gt; SHOW IMAGE to determine base addresses, and if you don't have a current MAP file, please get one! &lt;BR /&gt;&lt;BR /&gt;&amp;gt;but I noticed from 'HELP' that command &lt;BR /&gt;&amp;gt;required a 'starting-address'?&lt;BR /&gt;&lt;BR /&gt;  The starting address parameter is optional.&lt;BR /&gt;&lt;BR /&gt;From SDA use SHOW CALL:&lt;BR /&gt;&lt;BR /&gt;SDA&amp;gt; SHOW CALL&lt;BR /&gt;&lt;BR /&gt;this will show you the current call frame. To step to the frame of the caller:&lt;BR /&gt;&lt;BR /&gt;SDA&amp;gt; SHOW CALL/NEXT&lt;BR /&gt;&lt;BR /&gt;repeat until you reach the top.&lt;BR /&gt;&lt;BR /&gt;Minimally, take note of the return PC. There may be other things of interest. You can examine the instruction sequence leading up to the call with:&lt;BR /&gt;&lt;BR /&gt;SDA&amp;gt; EXAMINE/INSTR &lt;RETURN-PC&gt;-20;20&lt;BR /&gt;&lt;BR /&gt;SDA&amp;gt; SHOW CALL/SUMMARY&lt;BR /&gt;&lt;BR /&gt;gives one line per call frame, a bit like a traceback.&lt;/RETURN-PC&gt;</description>
      <pubDate>Wed, 30 Jun 2010 20:22:20 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653704#M40582</guid>
      <dc:creator>John Gillings</dc:creator>
      <dc:date>2010-06-30T20:22:20Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653705#M40583</link>
      <description>&amp;gt;&amp;gt;&amp;gt; SHOW STAC/USER/SUMM&lt;BR /&gt;&lt;BR /&gt;My bad. I mixed up SHOW CALL and SHOW STACK. Sorry.&lt;BR /&gt;&lt;BR /&gt;btw... Are you also using the full ACMS tasks with their (TDMS?) terminal request and procedure calls into server procedure images?&lt;BR /&gt;&lt;BR /&gt;The system you describe is Alpha right. Any Itaniums? Shoot me an Email ?&lt;BR /&gt;&lt;BR /&gt;Anyway...&lt;BR /&gt;&lt;BR /&gt;Thanks for the trace. &lt;BR /&gt;You gave the raw one, not the statistics.&lt;BR /&gt;&lt;BR /&gt;Here is the 'top 20' + user image top 10...&lt;BR /&gt;&lt;BR /&gt;$ perl -ne "$pc{$1}++ if / U [0-9A-F]+ (\S+)/; }{ for (sort {$pc{$b}&amp;lt;=&amp;gt;$pc{$a}} keys %pc){ print qq($pc&lt;BR /&gt;{$_}\t$_\n) if /COIN/ or $i++&amp;lt;20}" pc.dat&lt;BR /&gt;&lt;BR /&gt;592     FDVSHR+3DB04&lt;BR /&gt;568     FDVSHR+377C4&lt;BR /&gt;541     FDVSHR+37680&lt;BR /&gt;537     FDVSHR+378F4&lt;BR /&gt;503     FDVSHR+377FC&lt;BR /&gt;503     FDVSHR+376CC&lt;BR /&gt;486     FDVSHR+40098&lt;BR /&gt;451     LIBOTS+2417C&lt;BR /&gt;435     MMG_STD$SWAP_PTBR_C+00838&lt;BR /&gt;394     FDVSHR+3DBD4&lt;BR /&gt;392     FDVSHR+3D9C4&lt;BR /&gt;371     MMG_STD$SWAP_PTBR_C+00840&lt;BR /&gt;342     FDVSHR+40070&lt;BR /&gt;293     FDVSHR+3DA84&lt;BR /&gt;263     MMG_STD$SWAP_PTBR_C+00830&lt;BR /&gt;196     FDVSHR+36C00&lt;BR /&gt;185     EXE$SYNCH_LOOP_C+00DF4&lt;BR /&gt;175     LIBOTS+24170&lt;BR /&gt;152     LIBOTS+20290&lt;BR /&gt;145     LIBRTL+76A0C&lt;BR /&gt;10      COIN_DCL_OSIP+7E294&lt;BR /&gt;8       COIN_DCL_OSIP+7C150&lt;BR /&gt;8       COIN_DCL_OSIP+7E448&lt;BR /&gt;7       COIN_DCL_OSIP+869C4&lt;BR /&gt;6       COIN_DCL_OSIP+7E2EC&lt;BR /&gt;6       COIN_DCL_OSIP+86940&lt;BR /&gt;5       COIN_DCL_OSIP+7E460&lt;BR /&gt;5       COIN_DCL_OSIP+86A54&lt;BR /&gt;5       COIN_DCL_OSIP+7E344&lt;BR /&gt;5       COIN_DCL_OSIP+7E258&lt;BR /&gt;5       COIN_DCL_OSIP+7BED0&lt;BR /&gt;4       COIN_DCL_OSIP+7C1A4&lt;BR /&gt;4       COIN_DCL_OSIP+7E240&lt;BR /&gt;4       COIN_DCL_OSIP+7E1D8&lt;BR /&gt;4       COIN_DCL_OSIP+86960&lt;BR /&gt;3       COIN_DCL_OSIP+7E2C4&lt;BR /&gt;3       COIN_DCL_OSIP+7E4C0&lt;BR /&gt;3       COIN_DCL_OSIP+7E280&lt;BR /&gt;3       COIN_DCL_OSIP+869A4&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Hein&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 30 Jun 2010 20:43:54 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653705#M40583</guid>
      <dc:creator>Hein van den Heuvel</dc:creator>
      <dc:date>2010-06-30T20:43:54Z</dc:date>
    </item>
    <item>
      <title>Re: Looping installed image</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653706#M40584</link>
      <description>Biggus Mickus,&lt;BR /&gt;&lt;BR /&gt;Would it be fair to say this code has been running for over 25 years without the looping behaviour? What's changed? New VMS version, layered products, itanium test? When was the last change to OSIP?&lt;BR /&gt;&lt;BR /&gt;DDoes it only happen after running normally for X hours? Quotas? Leaks? &lt;BR /&gt;&lt;BR /&gt;Over time has an internal COBOL array grown past its limits and is overwriting memory below? Number or transactions, no. of customers etc? &lt;BR /&gt;&lt;BR /&gt;WHile your waiting for the correct oscilliscope settings it might be worth just inspecting the code for CALLs to services that don't check the return status at all or, as Hoff mentioned, the iosb.&lt;BR /&gt;&lt;BR /&gt;What username do the DCL servers run under? (Ooops! I IIRC don't answer that :-( )&lt;BR /&gt;&lt;BR /&gt;Anyway it seemed time for clutching at straws :-)&lt;BR /&gt;&lt;BR /&gt;Cheers Richard Maher&lt;BR /&gt;&lt;BR /&gt;PS. It's very cold here :-(</description>
      <pubDate>Wed, 30 Jun 2010 22:02:15 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/looping-installed-image/m-p/4653706#M40584</guid>
      <dc:creator>Richard J Maher</dc:creator>
      <dc:date>2010-06-30T22:02:15Z</dc:date>
    </item>
  </channel>
</rss>

