<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: system crash in Operating System - OpenVMS</title>
    <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580129#M69558</link>
    <description>Hi,&lt;BR /&gt;&lt;BR /&gt;please try to report the instruction at PC = 891EB (or 891EA)&lt;BR /&gt;&lt;BR /&gt;$ ANAL/SYS SYS$SYSTEM:SYSDUMP.DMP&lt;BR /&gt;SDA&amp;gt; EXA/INS 891EB&lt;BR /&gt;SDA&amp;gt; EXA/INS 891EA&lt;BR /&gt;&lt;BR /&gt;CLUE reported the failing instruction at PC=000C09D4, could you please also examine&lt;BR /&gt;&lt;BR /&gt;SDA&amp;gt; EXA/INS C09d4&lt;BR /&gt;SDA&amp;gt; EXA/INS C09D3&lt;BR /&gt;&lt;BR /&gt;Volker.&lt;BR /&gt;</description>
    <pubDate>Tue, 12 Jul 2005 13:00:13 GMT</pubDate>
    <dc:creator>Volker Halle</dc:creator>
    <dc:date>2005-07-12T13:00:13Z</dc:date>
    <item>
      <title>system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580126#M69555</link>
      <description>One my VAX machine hot standby(all processes remain in HIB mode) went down without any error message on operator log or application log. It was a application + vms crash. As per error log and clue file it is showing that it has failed to see duty machine, so it tried to come as new duty machine, but when it again saw a duty is there, it crashed itself. But, there was no network problem also. I am unable to understand the reason for it. This is 2nd time this machine has  gone down in similar situation. I am attaching clue file for your reference.&lt;BR /&gt;&lt;BR /&gt;Pls suggest..</description>
      <pubDate>Tue, 12 Jul 2005 02:20:09 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580126#M69555</guid>
      <dc:creator>Sk Noorul  Hassan</dc:creator>
      <dc:date>2005-07-12T02:20:09Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580127#M69556</link>
      <description>Good Morning Sk...&lt;BR /&gt;&lt;BR /&gt;It would appear that this Halt_Restart is very similar to the crash submitted by Rajarshi Gupta, back on 01-Jul-2005. In fact, in both Clue-Listings, the node name is the same. (TGEV01)&lt;BR /&gt;&lt;BR /&gt;The K-Stk footprint is similar (but not exactly the same) in both of these crashes. My suspicion, based on the same node-name, and your statement-- "This is the 2nd time this machine has gone down in similar situation" is that both you and Rajarshi are trying to troubleshoot and isolate this problem.&lt;BR /&gt;&lt;BR /&gt;If my previous two paragraphs are correct, and this "IS" the same system/vax-4100A, then we may have to lean towards a hardware failure. I say this because the first Halt that was reported by Rajarshi occurred in the SYSTSG image at appproximately PC=7E07 or 7E08 (updated Pc reflected?); while your second Halt occurred at PC=891EB or 891EA (again not sure if the Halt-Restart-Bugcheck displays the Failing-PC or the Updated-PC) in the SYSDSK image. &lt;BR /&gt;&lt;BR /&gt;In other words, I would find it hard to believe that you have two (2) different executable images with the similar code-threads, that execute Halt instructions while in Kernel-Mode. It would make more sense that if the "same" system has crashed more than once, in different code-streams, that it is likely to be an internal IC-Chip failure (ALU/Mux/Shift-Reg) on the Vax Processor module.&lt;BR /&gt;&lt;BR /&gt;But if the crashes are occurring on two systems, then it is likely to be a problem that is common, but independent of the actual system-boxes. For example you mention that this system checks to see if there is a "duty-machine", and if not, then this system tries to become the "new duty machine". If there are multiple systems that each check for "duty-machine" (via a keep-alive-broadcast over the network?) and the network-concentrator/switch/hub does not forward the broadcast/multicast, you may have a network-filtering problem... &lt;BR /&gt;&lt;BR /&gt;Just a couple of thoughts, not sure if they help or not...&lt;BR /&gt;&lt;BR /&gt;  Thanx,&lt;BR /&gt; whynot3k&lt;BR /&gt;</description>
      <pubDate>Tue, 12 Jul 2005 11:26:22 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580127#M69556</guid>
      <dc:creator>Richard White_5</dc:creator>
      <dc:date>2005-07-12T11:26:22Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580128#M69557</link>
      <description>EXE$GL_MEMERRS -&amp;gt; 800044F4  = 00000001&lt;BR /&gt;&lt;BR /&gt;would this suggest that you had one memory error somewhere sometime prior the crash.&lt;BR /&gt;&lt;BR /&gt;at least worth checking.&lt;BR /&gt;&lt;BR /&gt;_veli</description>
      <pubDate>Tue, 12 Jul 2005 11:50:55 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580128#M69557</guid>
      <dc:creator>Veli Körkkö</dc:creator>
      <dc:date>2005-07-12T11:50:55Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580129#M69558</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;please try to report the instruction at PC = 891EB (or 891EA)&lt;BR /&gt;&lt;BR /&gt;$ ANAL/SYS SYS$SYSTEM:SYSDUMP.DMP&lt;BR /&gt;SDA&amp;gt; EXA/INS 891EB&lt;BR /&gt;SDA&amp;gt; EXA/INS 891EA&lt;BR /&gt;&lt;BR /&gt;CLUE reported the failing instruction at PC=000C09D4, could you please also examine&lt;BR /&gt;&lt;BR /&gt;SDA&amp;gt; EXA/INS C09d4&lt;BR /&gt;SDA&amp;gt; EXA/INS C09D3&lt;BR /&gt;&lt;BR /&gt;Volker.&lt;BR /&gt;</description>
      <pubDate>Tue, 12 Jul 2005 13:00:13 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580129#M69558</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2005-07-12T13:00:13Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580130#M69559</link>
      <description>This is the same machine as reported before. Last boot time from this crash is just a couple of minutes after the previously reported crash time.&lt;BR /&gt;&lt;BR /&gt;Thanks for providing the CLUE file, this allows at least some educated guesses on what might have happened.&lt;BR /&gt;&lt;BR /&gt;Please also provide the data from SDA, as this may help make a decision between a software or hardware problem...&lt;BR /&gt;&lt;BR /&gt;Volker.</description>
      <pubDate>Tue, 12 Jul 2005 13:08:24 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580130#M69559</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2005-07-12T13:08:24Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580131#M69560</link>
      <description>Hi all, thanks for your suggestions. &lt;BR /&gt;&lt;BR /&gt;Volker,&lt;BR /&gt;could you please let me know how  to get SDA output which you require, so that I can attach that also.&lt;BR /&gt;&lt;BR /&gt;Richard,&lt;BR /&gt;you are right, it is the same machine crashing with two different image name in halt crash. Generally, when a system crashes, it gives some application error log pointing a probable reason for crash. But in the two crash, this machine is not giving any application reason.&lt;BR /&gt;&lt;BR /&gt;Pls suggest if you need any other log, which I can attach.</description>
      <pubDate>Wed, 13 Jul 2005 01:12:07 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580131#M69560</guid>
      <dc:creator>Sk Noorul  Hassan</dc:creator>
      <dc:date>2005-07-13T01:12:07Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580132#M69561</link>
      <description>Just a thought, correct me if I'm wrong, but AFAIK a VAX _requires_ a connected, switched-on terminal as console. I ran into a VAX system that halted due to the fact the VT200 used as a terminal broke down. &lt;BR /&gt;&lt;BR /&gt;Could it be that the console-terminal is broke or has a failing connection? Can it be that this is switched off by the application (due to the crash) and therefore crashing VMS?&lt;BR /&gt;&lt;BR /&gt;Willem</description>
      <pubDate>Wed, 13 Jul 2005 02:08:12 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580132#M69561</guid>
      <dc:creator>Willem Grooters</dc:creator>
      <dc:date>2005-07-13T02:08:12Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580133#M69562</link>
      <description>some VAXes halt if their VT consoles get switched off (especially VAXstations) but others don't mind at all.</description>
      <pubDate>Wed, 13 Jul 2005 04:28:53 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580133#M69562</guid>
      <dc:creator>Ian Miller.</dc:creator>
      <dc:date>2005-07-13T04:28:53Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580134#M69563</link>
      <description>Volker, pls find the instructions as asked by you.&lt;BR /&gt;&lt;BR /&gt;SDA&amp;gt;EXA/INS 891EB &lt;BR /&gt;000891EB: XFC&lt;BR /&gt;SDA&amp;gt;EXA/INS 891EA &lt;BR /&gt;000891EA: NOP&lt;BR /&gt;SDA&amp;gt;EXA/INS C09D4 &lt;BR /&gt;000C09D4 : RET&lt;BR /&gt;SDA&amp;gt;EXA/INS C09D3 &lt;BR /&gt;000C09D4 : HALT&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Pls suggest..&lt;BR /&gt;</description>
      <pubDate>Wed, 13 Jul 2005 08:39:02 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580134#M69563</guid>
      <dc:creator>Sk Noorul  Hassan</dc:creator>
      <dc:date>2005-07-13T08:39:02Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580135#M69564</link>
      <description>SDA&amp;gt;EXA/INS C09D3 &lt;BR /&gt;000C09D4 : HALT&lt;BR /&gt;^^^^^^^^ this should be 000C09D3, right ?!&lt;BR /&gt;SDA&amp;gt;EXA/INS C09D4 &lt;BR /&gt;000C09D4 : RET&lt;BR /&gt;&lt;BR /&gt;If this would be the real HALT-PC, it makes sense. The other 2 instructions could not have halted the system.&lt;BR /&gt;&lt;BR /&gt;Could you now also please try to examine the instruction stream leading to 000C09D3 and 000891EA ?&lt;BR /&gt;&lt;BR /&gt;Start with SDA&amp;gt; EXA/INS C09D4-10;10&lt;BR /&gt;If this provides a valid instruction stream up to address C09D4, please post it. Otherwise try -11;11 or -A;A - VAX instructions are variable length and you need to find the beginning of a valid instruction to be able to decode the whole instruction stream.&lt;BR /&gt;&lt;BR /&gt;Then please do the same with 891EA-10;10 and so on.&lt;BR /&gt;&lt;BR /&gt;Volker.</description>
      <pubDate>Wed, 13 Jul 2005 09:20:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580135#M69564</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2005-07-13T09:20:27Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580136#M69565</link>
      <description>Willem &amp;amp; Ian: &lt;BR /&gt;&lt;BR /&gt;You've reminded me of a similar situation. The VAX (don't remember what kind, but it wasn't a VAXstation) would just sometimes crash for no reason. &lt;BR /&gt;&lt;BR /&gt;Finally realized that they had a PC as the console, and were using the console to run a data entry application. The Terminal Emulator had F5 mapped to send &lt;BREAK&gt;, and the operator would sometimes accidently hit the F5 key. &lt;BR /&gt;&lt;BR /&gt;I don't remember if there was also a console command that set the break condition, but I moved that operators PC off of the console and put in a dedicated console with break disabled. It never crashed like that again.&lt;BR /&gt;&lt;/BREAK&gt;</description>
      <pubDate>Wed, 13 Jul 2005 09:45:00 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580136#M69565</guid>
      <dc:creator>Doug Phillips</dc:creator>
      <dc:date>2005-07-13T09:45:00Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580137#M69566</link>
      <description>re: console receiving &lt;BREAK&gt;&lt;BR /&gt;&lt;BR /&gt;I'm pretty sure that the system will just HALT and display the console prompt &amp;gt;&amp;gt;&amp;gt;&lt;BR /&gt;but it will not crash with a HALT restart bugcheck. The HALT restart crash should only happen, if the console detects, that the operating system has issued a HALT instruction in kernel mode (thus halting the CPU) and the HALT console parameter is set to RESTART. The console would then try to restart OpenVMS via the restart entry point.&lt;BR /&gt;&lt;BR /&gt;Volker.&lt;/BREAK&gt;</description>
      <pubDate>Wed, 13 Jul 2005 11:27:43 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580137#M69566</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2005-07-13T11:27:43Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580138#M69567</link>
      <description>Thanks, Volker. It was just an old fuzzy memory from the distant past, and I think it did just halt to the &amp;gt;&amp;gt;&amp;gt;. &amp;lt;:-\&lt;BR /&gt;</description>
      <pubDate>Wed, 13 Jul 2005 12:02:50 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580138#M69567</guid>
      <dc:creator>Doug Phillips</dc:creator>
      <dc:date>2005-07-13T12:02:50Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580139#M69568</link>
      <description>Thanks for the suggestion. &lt;BR /&gt;&lt;BR /&gt;Volker, &lt;BR /&gt;I will get back after trying your suggestions.</description>
      <pubDate>Thu, 14 Jul 2005 13:47:56 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580139#M69568</guid>
      <dc:creator>Sk Noorul  Hassan</dc:creator>
      <dc:date>2005-07-14T13:47:56Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580140#M69569</link>
      <description>Too many years and too many versions of O/S ago, a company I worked for tried something like this.  It is important to know, at least in overview, how the standby machine learns that it needs to become the duty machine - and how it learns later that it should NOT have tried to become the duty machine.&lt;BR /&gt;&lt;BR /&gt;Can you perhaps give us a brief overview of the method you are using to trigger the change in system status?  As I recall, there is a possible situation in which you could run afoul of one of the Goedel theorems on computability that would prevent this from being a truly reliable process, depending on exactly how you approach it.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 18 Jul 2005 08:12:23 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580140#M69569</guid>
      <dc:creator>Richard W Hunt</dc:creator>
      <dc:date>2005-07-18T08:12:23Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580141#M69570</link>
      <description>Please see my entry from today on the other thread:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://forums2.itrc.hp.com/service/forums/questionanswer.do?threadId=929654" target="_blank"&gt;http://forums2.itrc.hp.com/service/forums/questionanswer.do?threadId=929654&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;It may not be possible to obtain the HALT PC from the dump, but ONLY from the halt message on the console.&lt;BR /&gt;&lt;BR /&gt;Volker.</description>
      <pubDate>Mon, 18 Jul 2005 08:31:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580141#M69570</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2005-07-18T08:31:27Z</dc:date>
    </item>
    <item>
      <title>Re: system crash</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580142#M69571</link>
      <description>The data in the CLUE file (read from the crash) is inconsistent:&lt;BR /&gt;&lt;BR /&gt;For a valid restart crash (see [SYS]POWERFAIL routine EXE$RESTART_ATT), the following input values are expected:&lt;BR /&gt;&lt;BR /&gt;AP - Halt reason code (a value between 3 and 31.)&lt;BR /&gt;R10 - HALT PC&lt;BR /&gt;R11 - HALT PSL&lt;BR /&gt;&lt;BR /&gt;These values do not match in this crash, which makes a hardware problem even more likely.&lt;BR /&gt;&lt;BR /&gt;Volker.</description>
      <pubDate>Wed, 20 Jul 2005 01:22:45 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/system-crash/m-p/3580142#M69571</guid>
      <dc:creator>Volker Halle</dc:creator>
      <dc:date>2005-07-20T01:22:45Z</dc:date>
    </item>
  </channel>
</rss>

