<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: memory single bit errors in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724143#M253496</link>
    <description>It's a big jump from saying the data is not what they expect to see to saying it must be a hardware problem, there's all sorts of software reasons that could cause this.&lt;BR /&gt;&lt;BR /&gt;If the STM memory tool is not showing any errors, then I think you are entitled to tell Oracle that you have checked the hardware and that the ball is back in their court to do some serious troubleshooting rather passing the buck.&lt;BR /&gt;&lt;BR /&gt;Do you have the latest patches applied for Oracle?  &lt;BR /&gt;&lt;BR /&gt;What settings do you have for the Oracle parameters "db_block_checking" and "db_block_checksum"?   Setting these to 'true' has been seen to reduce the occurrence of these errors with Oracle 8 and Superdome systems in the past.&lt;BR /&gt;&lt;BR /&gt;Andrew</description>
    <pubDate>Mon, 06 Feb 2006 11:17:45 GMT</pubDate>
    <dc:creator>Andrew Merritt_2</dc:creator>
    <dc:date>2006-02-06T11:17:45Z</dc:date>
    <item>
      <title>memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724128#M253481</link>
      <description>Hi&lt;BR /&gt; We are running Oracle 9 database on one of the HP server ( RP4440 ) and recently we are getting many failures on the db operations. Pls find the error trace as below.&lt;BR /&gt;Errors in file /u01/app/oracle/admin/PSFFA/udump/psffa_ora_29571.trc:&lt;BR /&gt;&lt;BR /&gt;ORA-00600: internal error code, arguments: [17114], [0x800003FB800670D0], [], [], [], [], [], []&lt;BR /&gt;&lt;BR /&gt;Thu Feb  2 06:41:02 2006&lt;BR /&gt;&lt;BR /&gt;Errors in file /u01/app/oracle/admin/PSFFA/udump/psffa_ora_29571.trc:&lt;BR /&gt;&lt;BR /&gt;ORA-00600: internal error code, arguments: [17147], [0x800003FB800670D0], [], [], [], [], [], []&lt;BR /&gt;&lt;BR /&gt;Thu Feb  2 06:41:08 2006&lt;BR /&gt;&lt;BR /&gt;Errors in file /u01/app/oracle/admin/PSFFA/udump/psffa_ora_29571.trc:&lt;BR /&gt;&lt;BR /&gt;ORA-00600: internal error code, arguments: [17147], [0x800003FB800670D0], [], [], [], [], [], []&lt;BR /&gt;&lt;BR /&gt;ORA-00600: internal error code, arguments: [17147], [0x800003FB800670D0], [], [], [], [], [], []&lt;BR /&gt;&lt;BR /&gt;Thu Feb  2 06:41:09 2006&lt;BR /&gt;&lt;BR /&gt;Errors in file /u01/app/oracle/admin/PSFFA/udump/psffa_ora_29559.trc:&lt;BR /&gt;&lt;BR /&gt;ORA-00600: internal error code, arguments: [17147], [0x800003FB800673F8], [], [], [], [], [], []&lt;BR /&gt;&lt;BR /&gt;Thu Feb  2 06:41:09 2006&lt;BR /&gt;&lt;BR /&gt;Errors in file /u01/app/oracle/admin/PSFFA/udump/psffa_ora_29565.trc:&lt;BR /&gt;&lt;BR /&gt;ORA-00600: internal error code, arguments: [17147], [0x800003FB00010668], [], [], [], [], [], []&lt;BR /&gt;&lt;BR /&gt;Oracle suppport team debugged these errors and told us that there are "single bit memory error" causing this problem. But our CSTM logs not showing any memory errors. Is there any otherway to find out about memory related issues.&lt;BR /&gt;&lt;BR /&gt;Thanks in advance,&lt;BR /&gt;Chandra&lt;BR /&gt;</description>
      <pubDate>Fri, 03 Feb 2006 13:58:22 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724128#M253481</guid>
      <dc:creator>Chandra Sekhar_5</dc:creator>
      <dc:date>2006-02-03T13:58:22Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724129#M253482</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;run stm (cstm, mstm, xstm) info tool on the memory item. This will report you information about single bit errors. If you get no info about a single bit error, you probably have no error!</description>
      <pubDate>Fri, 03 Feb 2006 14:03:40 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724129#M253482</guid>
      <dc:creator>Torsten.</dc:creator>
      <dc:date>2006-02-03T14:03:40Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724130#M253483</link>
      <description>You can also configure The Event Monitoring System (EMS) with monconfig.&lt;BR /&gt;&lt;BR /&gt;The hardware usually has a service processor that can show logs about hardware events.&lt;BR /&gt;&lt;BR /&gt;Sinble bit errors can also be cause because of a inappropiate cooling of the system.</description>
      <pubDate>Fri, 03 Feb 2006 14:13:48 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724130#M253483</guid>
      <dc:creator>Ivan Ferreira</dc:creator>
      <dc:date>2006-02-03T14:13:48Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724131#M253484</link>
      <description>Get a baseball bat (a Cricket bat will do in a pinch) and apply vigorously to the cranial areas of your Oracle support team; if CSTM is not reporting single-bit memory errors then you don't have them; moreover, single-bit errors with error-correcting memory should be invisible to an application --- that's why you have ECC memory; the application wouldn't have a clue that the parity bit even exists.&lt;BR /&gt;</description>
      <pubDate>Fri, 03 Feb 2006 14:16:47 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724131#M253484</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-02-03T14:16:47Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724132#M253485</link>
      <description>The good news is that the baseball bat will correct single-bit errors in the Oracle Support Team. Multiple treatments may be required.</description>
      <pubDate>Fri, 03 Feb 2006 14:20:25 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724132#M253485</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-02-03T14:20:25Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724133#M253486</link>
      <description>As always - trust A. Clay Stephenson and follow his recommendations!&lt;BR /&gt;;-)</description>
      <pubDate>Fri, 03 Feb 2006 14:24:25 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724133#M253486</guid>
      <dc:creator>Torsten.</dc:creator>
      <dc:date>2006-02-03T14:24:25Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724134#M253487</link>
      <description>&lt;BR /&gt;I agree with sthephen. I dont think there is any memory issue as we dont see any event from CSTM. But just to be safe side before discussing with Oracle team, just want to confirm on the CSTM version side too..i am running CSTM A.47.00 , does this have any limitation in reporting memory errors..do i need to updgrade ?! &lt;BR /&gt;</description>
      <pubDate>Fri, 03 Feb 2006 14:40:13 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724134#M253487</guid>
      <dc:creator>Chandra Sekhar_5</dc:creator>
      <dc:date>2006-02-03T14:40:13Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724135#M253488</link>
      <description>A.47.00 is version Dec 2004 for HP-UX 11.11, A.49.10 (Sep 05) is current.&lt;BR /&gt;&lt;BR /&gt;see &lt;A href="http://docs.hp.com/en/diag/stm/stm_upd.htm" target="_blank"&gt;http://docs.hp.com/en/diag/stm/stm_upd.htm&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;To be sure you should check our firmware (PDC) level! This is the only important point regarding single bit errors.</description>
      <pubDate>Fri, 03 Feb 2006 15:05:00 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724135#M253488</guid>
      <dc:creator>Torsten.</dc:creator>
      <dc:date>2006-02-03T15:05:00Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724136#M253489</link>
      <description>Hi&lt;BR /&gt; My PDC version is 45.11 . Is that fine or does it requires any upgrade ?&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 03 Feb 2006 15:27:25 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724136#M253489</guid>
      <dc:creator>Chandra Sekhar_5</dc:creator>
      <dc:date>2006-02-03T15:27:25Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724137#M253490</link>
      <description>Chandra,&lt;BR /&gt;&lt;BR /&gt;Clay is not only very funny, but correct.  There is no issue with your machine's memory.  If you had any single bit memory errors they would be in your syslog (as well as reported by STM) such as:&lt;BR /&gt;&lt;BR /&gt;Feb  3 11:09:28 servername vmunix: LPMC type : SEDC (ECC-corrected single-bit error)&lt;BR /&gt;&lt;BR /&gt;Get a bat ;^)</description>
      <pubDate>Fri, 03 Feb 2006 15:42:24 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724137#M253490</guid>
      <dc:creator>Tom Danzig</dc:creator>
      <dc:date>2006-02-03T15:42:24Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724138#M253491</link>
      <description>Hi Chandra,&lt;BR /&gt;   Just to concur with the above comments, it is extremely unlikely that SBEs are causing the problem with Oracle.&lt;BR /&gt;&lt;BR /&gt;To expand on the OnlineDiags behaviour, and correct a couple of the comments above:&lt;BR /&gt;&lt;BR /&gt;A.47.00 is not the latest version of OnlineDiags, but it is supported, and there aren't any relevant known problems with it.  PHSS_33673 is the latest patch for it, and you should have that installed (you can tell if you run STM, the version will be shown as A.47.15 (I think)).&lt;BR /&gt;&lt;BR /&gt;If there have been SBEs detected, you should see these listed when you run the Memory Info Tool in STM.&lt;BR /&gt;&lt;BR /&gt;What you won't see is events in syslog, or in /var/opt/resmon/log/event.log for individual Single Bit Errors.  Since these are normal events, at a low frequency, and since the error correction takes care of them, they no longer generate EMS events.  This is because some customers have been unnecessarily alarmed when they see them.&lt;BR /&gt;&lt;BR /&gt;If there is a significant number of SBEs in a short time, then you will see EMS events being generated warning of this and indicating the hardware needs to be replaced.&lt;BR /&gt;&lt;BR /&gt;Andrew</description>
      <pubDate>Mon, 06 Feb 2006 06:27:30 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724138#M253491</guid>
      <dc:creator>Andrew Merritt_2</dc:creator>
      <dc:date>2006-02-06T06:27:30Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724139#M253492</link>
      <description>ORA-00600:&lt;BR /&gt;&lt;BR /&gt;Oracle knows something is wrong, is not going to work any more. Oracle has NO IDEA what is wrong.&lt;BR /&gt;&lt;BR /&gt;Don't blame the hardware, its not causing this.&lt;BR /&gt;&lt;BR /&gt;SEP</description>
      <pubDate>Mon, 06 Feb 2006 06:38:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724139#M253492</guid>
      <dc:creator>Steven E. Protter</dc:creator>
      <dc:date>2006-02-06T06:38:32Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724140#M253493</link>
      <description>The whole point is that you were given a completely bogus answer by your Oracle guys and that you should communicate that back to them very strongly. Ask them very simple questions like "Where are your data to support this claim?" Single-bit errors under ECC -- which you have -- are corrected "on the fly" and are completely invisible to an application. A SBE would never make itself known to an application.</description>
      <pubDate>Mon, 06 Feb 2006 10:26:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724140#M253493</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-02-06T10:26:58Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724141#M253494</link>
      <description>&lt;BR /&gt;Pls find the oracle support response as below . &lt;BR /&gt;&lt;BR /&gt;It appears the error is a single bit swap and does not line up with a known Oracle bug. Based on this the AR team is asking that hardware diagnostics be run on the server.&lt;BR /&gt; &lt;BR /&gt;- - - -&lt;BR /&gt;From the error in trace file psffa_ora_29567.trc&lt;BR /&gt; &lt;BR /&gt;Chunk 800003fb0003f938 sz=1074156928 ERROR, BAD MAGIC NUMBER (800003FAC0065580&lt;BR /&gt;)&lt;BR /&gt;The lower 8 bytes in (800003FAC0065580), C0065580 indicate the sizo of the the chunk.&lt;BR /&gt; &lt;BR /&gt;The sz=1074156928 in hex is sz0x40065580.&lt;BR /&gt; &lt;BR /&gt;Looking in the dump of addr=0x800003FB0003F938&lt;BR /&gt; &lt;BR /&gt;Dump of memory from 0x800003FB0003F8F8 to 0x800003FB0003FA38&lt;BR /&gt;800003FB0003F8F0 00000000 00000000 [........]&lt;BR /&gt;800003FB0003F900 00000000 00000000 00000000 00000000 [................]&lt;BR /&gt;Repeat 2 times&lt;BR /&gt;800003FB0003F930 00000000 00000000 800003FA C0065580 [..............U.] &amp;lt;---------Here &lt;BR /&gt;800003FB0003F940 800003FB 0003F480 40000000 0053C9F8 [........@....S..]&lt;BR /&gt;800003FB0003F950 00000FA0 00000000 00000000 00000001 [................]&lt;BR /&gt; &lt;BR /&gt;We see the value is C0065580 instead of 40065580 &lt;BR /&gt;C in binary is 1100. &lt;BR /&gt;4 in binary is 0100.&lt;BR /&gt; &lt;BR /&gt;As you can see, it seems there is a one bit failure. Instead of 4 we have C.</description>
      <pubDate>Mon, 06 Feb 2006 10:55:41 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724141#M253494</guid>
      <dc:creator>Chandra Sekhar_5</dc:creator>
      <dc:date>2006-02-06T10:55:41Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724142#M253495</link>
      <description>I agree that the data differs by one bit. That does not imply that the hardware is responsible; it only means that the data have changed by one bit over some time interval --- probably by a software instruction --- even if unintentional. However, if this were memory induced, ECC would correct this SBE "on the fly" and the dump would never see it. Moreover, a message would be sent to syslog indicating that the error was detected and corrected.&lt;BR /&gt;&lt;BR /&gt;Run the diagnostics on your box and report the null results to the Oracle guys and then have them find the real problem.</description>
      <pubDate>Mon, 06 Feb 2006 11:07:42 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724142#M253495</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-02-06T11:07:42Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724143#M253496</link>
      <description>It's a big jump from saying the data is not what they expect to see to saying it must be a hardware problem, there's all sorts of software reasons that could cause this.&lt;BR /&gt;&lt;BR /&gt;If the STM memory tool is not showing any errors, then I think you are entitled to tell Oracle that you have checked the hardware and that the ball is back in their court to do some serious troubleshooting rather passing the buck.&lt;BR /&gt;&lt;BR /&gt;Do you have the latest patches applied for Oracle?  &lt;BR /&gt;&lt;BR /&gt;What settings do you have for the Oracle parameters "db_block_checking" and "db_block_checksum"?   Setting these to 'true' has been seen to reduce the occurrence of these errors with Oracle 8 and Superdome systems in the past.&lt;BR /&gt;&lt;BR /&gt;Andrew</description>
      <pubDate>Mon, 06 Feb 2006 11:17:45 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724143#M253496</guid>
      <dc:creator>Andrew Merritt_2</dc:creator>
      <dc:date>2006-02-06T11:17:45Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724144#M253497</link>
      <description>&amp;gt; Moreover, a message would be sent to syslog&lt;BR /&gt;&amp;gt; indicating that the error was detected and&lt;BR /&gt;&amp;gt; corrected.&lt;BR /&gt;&lt;BR /&gt;Unless I'm getting confused with IA behaviour, I don't think that part is true.  Individual SBEs shouldn't now be logged to syslog, nor to event.log, though you should see them in the logtool output in STM, and the corresponding entries should be present in the PDT, viewable with the STM Memory Info tool.&lt;BR /&gt;&lt;BR /&gt;Andrew</description>
      <pubDate>Mon, 06 Feb 2006 11:30:22 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724144#M253497</guid>
      <dc:creator>Andrew Merritt_2</dc:creator>
      <dc:date>2006-02-06T11:30:22Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724145#M253498</link>
      <description>Hi&lt;BR /&gt;&lt;BR /&gt;With so many people helping on this issue, if this issues is resolved I think you should assign points for the people who have been spending their valuable time helping you out. You have not aasigned a single point to anyone.&lt;BR /&gt;&lt;BR /&gt;Rgds&lt;BR /&gt;&lt;BR /&gt;HGN</description>
      <pubDate>Mon, 06 Feb 2006 11:34:16 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724145#M253498</guid>
      <dc:creator>HGN</dc:creator>
      <dc:date>2006-02-06T11:34:16Z</dc:date>
    </item>
    <item>
      <title>Re: memory single bit errors</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724146#M253499</link>
      <description>&lt;BR /&gt;Now, the ball is in Oracle court..they eliminated reason of physical memory issue..&lt;BR /&gt;&lt;BR /&gt;Thanks a lot for your quick suggestions and &lt;BR /&gt;sorry that i didnt give points to my last questions. This group is really amazing and the thanks to each individual who has provided inputs to this.I will definately provide points to each responder.&lt;BR /&gt;&lt;BR /&gt;Thanks again.&lt;BR /&gt;-Chandra&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 07 Feb 2006 16:39:31 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/memory-single-bit-errors/m-p/3724146#M253499</guid>
      <dc:creator>Chandra Sekhar_5</dc:creator>
      <dc:date>2006-02-07T16:39:31Z</dc:date>
    </item>
  </channel>
</rss>

