<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Crash Dump Reset in Operating System - OpenVMS</title>
    <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065760#M86208</link>
    <description>We have an AS4100 with OpenVMS 7.3-2.  Patches are up to date (AFAIK).  &lt;BR /&gt;&lt;BR /&gt;We are having crashes after a recent hardware upgrade so we have a pretty good idea on what changed.  We had no crashes for like a year or more before that, and the crash last year was when the swap/page file disk died a horrible death.  No problems in understanding there, and prior to that, the last crash was 2001.  Very stable system.  So it is clear that the new hardware is the culprit.&lt;BR /&gt;&lt;BR /&gt;The problem is that our crash dump won't update.  I can't give the tech guys details on the crash dump because I don't have them.  &lt;BR /&gt;&lt;BR /&gt;We are NOT set up to use the page file for crash dumps.  We have a separate SYS$SYSTEM:SYSDUMP.DMP file.  DUMPSTYLE is set to 9 (selective, compressed, use system disk)  We took a crash dump a couple of days ago but we have crashed at least twice since then without another crash dump being written.&lt;BR /&gt;&lt;BR /&gt;I have heard there is some sort of interlock file but I don't know where it is and wading through the manuals has (so far) produced no joy.  I've looked in System Manager's Manual, System Management Utilities Manual, and a few other miscellaneous places.  I see a lot about how to analyze the dump.  I see a lot about copying a dump from the page file via the CDA utility.  But if there is a free-standing crash dump file in a valid path, why won't a crash overwrite the previous contents?&lt;BR /&gt;&lt;BR /&gt;</description>
    <pubDate>Wed, 29 Aug 2007 08:07:39 GMT</pubDate>
    <dc:creator>Richard W Hunt</dc:creator>
    <dc:date>2007-08-29T08:07:39Z</dc:date>
    <item>
      <title>Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065760#M86208</link>
      <description>We have an AS4100 with OpenVMS 7.3-2.  Patches are up to date (AFAIK).  &lt;BR /&gt;&lt;BR /&gt;We are having crashes after a recent hardware upgrade so we have a pretty good idea on what changed.  We had no crashes for like a year or more before that, and the crash last year was when the swap/page file disk died a horrible death.  No problems in understanding there, and prior to that, the last crash was 2001.  Very stable system.  So it is clear that the new hardware is the culprit.&lt;BR /&gt;&lt;BR /&gt;The problem is that our crash dump won't update.  I can't give the tech guys details on the crash dump because I don't have them.  &lt;BR /&gt;&lt;BR /&gt;We are NOT set up to use the page file for crash dumps.  We have a separate SYS$SYSTEM:SYSDUMP.DMP file.  DUMPSTYLE is set to 9 (selective, compressed, use system disk)  We took a crash dump a couple of days ago but we have crashed at least twice since then without another crash dump being written.&lt;BR /&gt;&lt;BR /&gt;I have heard there is some sort of interlock file but I don't know where it is and wading through the manuals has (so far) produced no joy.  I've looked in System Manager's Manual, System Management Utilities Manual, and a few other miscellaneous places.  I see a lot about how to analyze the dump.  I see a lot about copying a dump from the page file via the CDA utility.  But if there is a free-standing crash dump file in a valid path, why won't a crash overwrite the previous contents?&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 29 Aug 2007 08:07:39 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065760#M86208</guid>
      <dc:creator>Richard W Hunt</dc:creator>
      <dc:date>2007-08-29T08:07:39Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065761#M86209</link>
      <description>Richard,&lt;BR /&gt;&lt;BR /&gt;What kind of hardware upgrade was done?  If it was something that might have an effect on the I/O bus (especially a drive controller or disk drive), it is quite possible that whatever is causing the crash is also preventing any further I/O to the system disk.  It could be a problem with the system disk itself - maybe even a bad spot within SYSDUMP.DMP.  If you have enough space, you might try creating a new dumpfile (rename the old one, create a new one, reboot) and see if the behavior changes.&lt;BR /&gt;&lt;BR /&gt;Allan in Atlanta&lt;BR /&gt;</description>
      <pubDate>Wed, 29 Aug 2007 09:20:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065761#M86209</guid>
      <dc:creator>Allan Bowman</dc:creator>
      <dc:date>2007-08-29T09:20:52Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065762#M86210</link>
      <description>Richard,&lt;BR /&gt;&lt;BR /&gt;may be the system died a death without being able to write to the dumpfile, e.g. ane error on the main board etc.&lt;BR /&gt;&lt;BR /&gt;Are any messages on the console or in the error log (DIAGNOSE)?&lt;BR /&gt;&lt;BR /&gt;regards Kalle</description>
      <pubDate>Wed, 29 Aug 2007 09:21:17 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065762#M86210</guid>
      <dc:creator>Karl Rohwedder</dc:creator>
      <dc:date>2007-08-29T09:21:17Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065763#M86211</link>
      <description>The memory was upgraded from 2 Gb to 8 Gb.&lt;BR /&gt;&lt;BR /&gt;We are looking at the IOD board, the chance of having a DOA memory card, and the fact that S3 cards with a big memory &amp;amp; a 64-bit interface can hang a system.  The KGPSA we use for our SAN fits that bill perfectly.  Our tech guy has found some memories to swap out and he is also bringing a new IOD board.&lt;BR /&gt;&lt;BR /&gt;The DIAGNOSE command reveals some correctable ECC on the original crash, but later crashes go straight from a TIMESTAMP entry to a reboot entry.  I can't check for more than that right now because the hardware tech has the system even as I type this.&lt;BR /&gt;&lt;BR /&gt;The crash dump from two days ago was calling out a machine check from KERNEL mode and the K-stack was indicating (a) no currently runnning process and (b) The ECC_CORRECTABLE routine was referenced in the bowels of the stack.&lt;BR /&gt;&lt;BR /&gt;We are confident that this problem is either a bad memory or a bad IOD board.  (For those who don't know:  IOD translates addresses from PCI bus to Alpha memory bus.  That's the short answer and don't whack me for simplifying.  I know it does more than that.)&lt;BR /&gt;&lt;BR /&gt;My issue remains that I'm disturbed about not getting a crash dump.  No, no bad spots reported on the disk.  I can do an analyze RMS telling it to verify the blocks of the file and it passes that test, so the SYSDUMP.DMP isn't bad that I know of.  I just am worried that I can't provide better info to my tech support guys when they ask me "What's up?"  (Though the correct answer is more often "Well, we're not.")&lt;BR /&gt;</description>
      <pubDate>Wed, 29 Aug 2007 09:35:44 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065763#M86211</guid>
      <dc:creator>Richard W Hunt</dc:creator>
      <dc:date>2007-08-29T09:35:44Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065764#M86212</link>
      <description>It is possible that the dumpfile is not being written due to the setting of the SRM auto_action environmnet variable.&lt;BR /&gt;&lt;BR /&gt;For the following bugchecks:&lt;BR /&gt;&lt;BR /&gt;UNKRSTRT&lt;BR /&gt;KRNLSTKNV&lt;BR /&gt;INVSCBB&lt;BR /&gt;HALT&lt;BR /&gt;DBLERR&lt;BR /&gt;INVPTBR&lt;BR /&gt;MCHECKPAL&lt;BR /&gt;&lt;BR /&gt;the dumptile will not be written unless auto_action is set to "RESTART".&lt;BR /&gt;&lt;BR /&gt;Check with &lt;BR /&gt;&lt;BR /&gt;$ write sys$output f$getenv("AUTO_ACTION")&lt;BR /&gt;&lt;BR /&gt;or &lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt;&amp;gt; show auto_action&lt;BR /&gt;&lt;BR /&gt;To change in a supported way:&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt;&amp;gt; set auto_action RESTART&lt;BR /&gt;&lt;BR /&gt;If that is already set to restart, then please provide the output of &lt;BR /&gt;&lt;BR /&gt;$ mcr sysgen show dumpbug ! should be 1&lt;BR /&gt;$ mcr sysgen show dumpstyle&lt;BR /&gt;$ mcr sysgen show savedump&lt;BR /&gt;&lt;BR /&gt;example:&lt;BR /&gt;&lt;BR /&gt;$ write sys$output f$getenv("AUTO_ACTION")&lt;BR /&gt;RESTART&lt;BR /&gt;$ mcr sysgen show dumpbug ! should be 1&lt;BR /&gt;Parameter Name           Current    Default     Min.      Max.     Unit  Dynamic&lt;BR /&gt;--------------           -------    -------    -------   -------   ----  -------&lt;BR /&gt;DUMPBUG                         1          1         0          1 Boolean    &lt;BR /&gt;$ mcr sysgen show dumpstyle            &lt;BR /&gt;Parameter Name           Current    Default     Min.      Max.     Unit  Dynamic&lt;BR /&gt;--------------           -------    -------    -------   -------   ----  -------&lt;BR /&gt;DUMPSTYLE                       9          0         0         -1 Bitmask    D&lt;BR /&gt;$ mcr sysgen show savedump &lt;BR /&gt;Parameter Name           Current    Default     Min.      Max.     Unit  Dynamic&lt;BR /&gt;--------------           -------    -------    -------   -------   ----  -------&lt;BR /&gt;SAVEDUMP                        0          0         0          1 Boolean    &lt;BR /&gt;$</description>
      <pubDate>Wed, 29 Aug 2007 09:55:57 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065764#M86212</guid>
      <dc:creator>Jon Pinkley</dc:creator>
      <dc:date>2007-08-29T09:55:57Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065765#M86213</link>
      <description>You may want to connect something to the console to capture output (PC with terminal emulator that has record feature, or hard copy terminal).&lt;BR /&gt;&lt;BR /&gt;Then set bit 1 of DUMPSTYLE which will give verbose console output on crashdump.&lt;BR /&gt;&lt;BR /&gt;e.g. if current value of DUMPSTYLE is 9 (2^0 + 2^3) change to 11 (2^0 + 2^1 + 2^3)&lt;BR /&gt;&lt;BR /&gt;Good Luck,&lt;BR /&gt;&lt;BR /&gt;Jon</description>
      <pubDate>Wed, 29 Aug 2007 10:05:00 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065765#M86213</guid>
      <dc:creator>Jon Pinkley</dc:creator>
      <dc:date>2007-08-29T10:05:00Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065766#M86214</link>
      <description>OK, follow-up:&lt;BR /&gt;&lt;BR /&gt;This might be due to having an S3 Trio board in the system.  Our tech tells me this can cause a HANG, not a crash, and therefore you don't get a crash dump.  Two days ago we had a crash dump that led to replacing one of our CPU cards.  That didn't bother be because that crash dump occurred.  My concern was never that we crashed... I sort of expected problems after a hardware upgrade.  It was the lack of a crash dump.&lt;BR /&gt;&lt;BR /&gt;Auto-Action is RESTART so that's why I rather expected a crash dump, but of course if this is a HANG and my ops guys just hit the INIT button, that would do it too.  I need to communicate with them the urgency of knowing when the INIT button is hit.&lt;BR /&gt;&lt;BR /&gt;Thanks for the concern, guys.  We are going to replace the console video board and see if that clears up the last nagging problem.  I won't close this right away, but I'll check in if our system stabilized.&lt;BR /&gt;</description>
      <pubDate>Wed, 29 Aug 2007 10:29:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065766#M86214</guid>
      <dc:creator>Richard W Hunt</dc:creator>
      <dc:date>2007-08-29T10:29:27Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065767#M86215</link>
      <description>The S3 video card and fibre channel HBAs need to be installed on different PCI buses.  &lt;BR /&gt;&lt;BR /&gt;If I recall, correctly, you can violate this configuration rule if you have 2 GB or less of memory.  Since you have a recent memory upgrade, please check the location of HBAs and the video card.  &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Andy Bustamante</description>
      <pubDate>Fri, 31 Aug 2007 21:28:14 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065767#M86215</guid>
      <dc:creator>Andy Bustamante</dc:creator>
      <dc:date>2007-08-31T21:28:14Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065768#M86216</link>
      <description>You multiplied the momory by 4. Did you also increase the dump file size accordingly ?&lt;BR /&gt;&lt;BR /&gt;Wim</description>
      <pubDate>Mon, 03 Sep 2007 03:23:44 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065768#M86216</guid>
      <dc:creator>Wim Van den Wyngaert</dc:creator>
      <dc:date>2007-09-03T03:23:44Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065769#M86217</link>
      <description>Didn't have another PCI bus, so we swapped the S3 card for an ELSA Gloria Synergy, which is known to not have the problem.&lt;BR /&gt;&lt;BR /&gt;As to the DUMPFILE - we are currently running a compressed selective dump until I can go to the remote site to issue some console commands.  I have a full-sized dump file ready to go but I have to use DOSD, hence the console operations.&lt;BR /&gt;&lt;BR /&gt;As of Saturday, we have had no more problems.  I think our issue is resolve.  Thanks to all who replied.&lt;BR /&gt;</description>
      <pubDate>Thu, 06 Sep 2007 11:20:56 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065769#M86217</guid>
      <dc:creator>Richard W Hunt</dc:creator>
      <dc:date>2007-09-06T11:20:56Z</dc:date>
    </item>
    <item>
      <title>Re: Crash Dump Reset</title>
      <link>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065770#M86218</link>
      <description>After three events, we have stabilized our system.&lt;BR /&gt;&lt;BR /&gt;1.  Swapped video cards (see in thread)&lt;BR /&gt;&lt;BR /&gt;2.  Swapped out one of our CPU boards, which was doing something vile but unknown to the new memories and busses.&lt;BR /&gt;&lt;BR /&gt;3.  Scheduled a visit so we can fix the dump file permanently via DOSD techniques.&lt;BR /&gt;</description>
      <pubDate>Thu, 06 Sep 2007 11:23:53 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-openvms/crash-dump-reset/m-p/5065770#M86218</guid>
      <dc:creator>Richard W Hunt</dc:creator>
      <dc:date>2007-09-06T11:23:53Z</dc:date>
    </item>
  </channel>
</rss>

