<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Syslog Message. in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887019#M818962</link>
    <description>At this point, you're getting some errors in memory which are being self-corrected.  At this point, I would only continue to look to see how often this repeats itself.  &lt;BR /&gt;&lt;BR /&gt;You also may want to run stm and check run some information against your memory to check your PDT.  If you got a couple of entries in there, it may be a good thing to start placing a call for a possible problem on your memory board.</description>
    <pubDate>Tue, 21 Jan 2003 18:49:31 GMT</pubDate>
    <dc:creator>Marco Santerre</dc:creator>
    <dc:date>2003-01-21T18:49:31Z</dc:date>
    <item>
      <title>Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887017#M818960</link>
      <description>Hi:&lt;BR /&gt;&lt;BR /&gt; I have attached the syslog message(11.0 n-class)and also the corresponding action i took. Please let me know what i should do further...&lt;BR /&gt;&lt;BR /&gt;Syslog Message:&lt;BR /&gt;Jan 20 12:49:05  EMS [1827]: ------ EMS Event Notification ------ &lt;BR /&gt;  Value: "MAJORWARNING (3)" for Resource: "/system/events/memory/192" &lt;BR /&gt;    (Threshold:  &amp;gt;= " 3")    Execute the following command to obtain&lt;BR /&gt; event details:   /opt/resmon/bin/resdata -R 119734274 -r&lt;BR /&gt; /system/events/memory/192 -n 119734277 -a &lt;BR /&gt;-----------------------------------------------&lt;BR /&gt;/opt/resmon/bin/resdata -R 119734274 -r /system/events/memory/192 &lt;BR /&gt;-n 119734277 -a OUTPUT BELOW.....&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;CURRENT MONITOR DATA:&lt;BR /&gt;&lt;BR /&gt;Event Time..........: Mon Jan 20 12:49:05 2003&lt;BR /&gt;Severity............: MAJORWARNING&lt;BR /&gt;Monitor.............: dm_memory&lt;BR /&gt;Event #.............: 4300                &lt;BR /&gt;System..............: &lt;BR /&gt;&lt;BR /&gt;Summary:&lt;BR /&gt;     Memory Event Type : Single bit error (SBE) event.  A correctable single&lt;BR /&gt;     bit error has been detected and logged.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Description of Error:&lt;BR /&gt;&lt;BR /&gt;    The memory component:&lt;BR /&gt;&lt;BR /&gt;          Cab/Cell or Node: 0&lt;BR /&gt;          MC/EXT: 1&lt;BR /&gt;          DIMM: 0b&lt;BR /&gt;&lt;BR /&gt;        is experiencing correctable single bit errors (SBE) on a single&lt;BR /&gt;        component.&lt;BR /&gt;&lt;BR /&gt;Probable Cause / Recommended Action:&lt;BR /&gt;&lt;BR /&gt;   Although the single bit errors are being corrected, it may be advisable to&lt;BR /&gt;   monitor the situation.  If an excessive rate of single bit errors occur, an&lt;BR /&gt;   event with higher severity will be generated.&lt;BR /&gt;&lt;BR /&gt;Additional Event Data: &lt;BR /&gt;     System IP Address...: &lt;BR /&gt;     Event Id............: 0x3e2c369100000000&lt;BR /&gt;     Monitor Version.....: B.01.00&lt;BR /&gt;     Event Class.........: I/O&lt;BR /&gt;     Client Configuration File...........:&lt;BR /&gt;     /var/stm/config/tools/monitor/default_dm_memory.clcfg &lt;BR /&gt;     Client Configuration File Version...: A.01.00 &lt;BR /&gt;          Qualification criteria met.&lt;BR /&gt;               Number of events..: 70 &lt;BR /&gt;     Associated OS error log entry id(s): &lt;BR /&gt;          None&lt;BR /&gt;     Additional System Data:&lt;BR /&gt;          System Model Number.............: 9000/800 &lt;BR /&gt;          EMS Version.....................: A.03.20 &lt;BR /&gt;          STM Version.....................: A.30.00 &lt;BR /&gt;     Latest information on this event:&lt;BR /&gt;          &lt;A href="http://docs.hp.com/hpux/content/hardware/ems/dm_memory.htm#4300" target="_blank"&gt;http://docs.hp.com/hpux/content/hardware/ems/dm_memory.htm#4300&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;v-v-v-v-v-v-v-v-v-v-v-v-v    D  E  T  A  I  L  S    v-v-v-v-v-v-v-v-v-v-v-v-v&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Component Data: &lt;BR /&gt;     Physical Device Path....: 192&lt;BR /&gt;     Tag 2...................: 20&lt;BR /&gt;&lt;BR /&gt;Thanks in advance,&lt;BR /&gt;&lt;BR /&gt;David.&lt;BR /&gt;</description>
      <pubDate>Tue, 21 Jan 2003 18:44:11 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887017#M818960</guid>
      <dc:creator>david_252</dc:creator>
      <dc:date>2003-01-21T18:44:11Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887018#M818961</link>
      <description>This is reporting a single-bit error. Every once in awhile you could receive these errors. As long as the errors are not frequent and/or numerous you can monitor. If you receive numerous single-bit errors and/or they occur frequently, you most likely have memory issues. You can use stm to provide more - example, which bank is reporting the trouble.&lt;BR /&gt;&lt;BR /&gt;Do keep an eye on.</description>
      <pubDate>Tue, 21 Jan 2003 18:48:55 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887018#M818961</guid>
      <dc:creator>Rick Garland</dc:creator>
      <dc:date>2003-01-21T18:48:55Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887019#M818962</link>
      <description>At this point, you're getting some errors in memory which are being self-corrected.  At this point, I would only continue to look to see how often this repeats itself.  &lt;BR /&gt;&lt;BR /&gt;You also may want to run stm and check run some information against your memory to check your PDT.  If you got a couple of entries in there, it may be a good thing to start placing a call for a possible problem on your memory board.</description>
      <pubDate>Tue, 21 Jan 2003 18:49:31 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887019#M818962</guid>
      <dc:creator>Marco Santerre</dc:creator>
      <dc:date>2003-01-21T18:49:31Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887020#M818963</link>
      <description>Hi David,&lt;BR /&gt;&lt;BR /&gt;As Rick has mentioned a sole single-bit error or several over a long period is nothing to become alarmed about - that's what ECC memory is designed to handle.&lt;BR /&gt;What I would recommend is to go into stm &amp;amp; check the PDT (Page Deallocation Table) &amp;amp; verify that you don't have a bunch of pages deallocated. That would indicate a DIMM(s) is going bad.&lt;BR /&gt;&lt;BR /&gt;Rgds,&lt;BR /&gt;Jeff</description>
      <pubDate>Tue, 21 Jan 2003 18:52:09 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887020#M818963</guid>
      <dc:creator>Jeff Schussele</dc:creator>
      <dc:date>2003-01-21T18:52:09Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887021#M818964</link>
      <description>David,&lt;BR /&gt;&lt;BR /&gt;Since the error was self corrected, there is nothing you can do now but keep monitoring the situation based on the following action recommended in your post:&lt;BR /&gt;&lt;BR /&gt;Probable Cause / Recommended Action: &lt;BR /&gt;&lt;BR /&gt;Although the single bit errors are being corrected, it may be advisable to &lt;BR /&gt;monitor the situation. If an excessive rate of single bit errors occur, an &lt;BR /&gt;event with higher severity will be generated. &lt;BR /&gt;&lt;BR /&gt;Hai</description>
      <pubDate>Tue, 21 Jan 2003 18:52:38 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887021#M818964</guid>
      <dc:creator>Hai Nguyen_1</dc:creator>
      <dc:date>2003-01-21T18:52:38Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887022#M818965</link>
      <description>David&lt;BR /&gt;&lt;BR /&gt;We had started receiving this error on our V-2600 server and it started out similar about 1 every month or so, then without warning we lost a STICK of Memory and the server de-allocated the memory from usage. We had to schedule a downtime period to have the CE come in and replace the STICK.&lt;BR /&gt;&lt;BR /&gt;My advice to you is go ahead and have the STICK replaced as long as you have a Maintenance Agreement on the server. Don't wait for it to manifest itself into a bigger problem.&lt;BR /&gt;&lt;BR /&gt;Gl &lt;BR /&gt;&lt;BR /&gt;Frank G.</description>
      <pubDate>Tue, 21 Jan 2003 19:25:50 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887022#M818965</guid>
      <dc:creator>fg_1</dc:creator>
      <dc:date>2003-01-21T19:25:50Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887023#M818966</link>
      <description>I strongly agree with everyone telling that you should replace this memory module. As soon as it reports single-bit errors (70 events), they can be corrected with CRC, but if double-bit error will occur server will likely to crash&lt;BR /&gt;Eugeny</description>
      <pubDate>Tue, 21 Jan 2003 19:39:03 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887023#M818966</guid>
      <dc:creator>Eugeny Brychkov</dc:creator>
      <dc:date>2003-01-21T19:39:03Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887024#M818967</link>
      <description>Thanks a lot. Since we have a maintenance agreement i think i can try that. But meanwhile, if i have to run stm, Can i run it as non-root user? If so can it be run at any time of the day? (so as not to disturb the peak period) and pl. advise me what to look fo in that report.&lt;BR /&gt;&lt;BR /&gt;Thanks much for all the responses..&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;David.</description>
      <pubDate>Tue, 21 Jan 2003 19:50:21 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887024#M818967</guid>
      <dc:creator>david_252</dc:creator>
      <dc:date>2003-01-21T19:50:21Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887025#M818968</link>
      <description>Yes you can run it as a non-root user.  &lt;BR /&gt;&lt;BR /&gt;What you have to do is basically, find the memory and highlight it, then click on Information, then click on Run.  This will generate a log which at the bottom of the log, you will find the PDT Table Entries.  It will tell you how many are Free, how many are Used, and how many are Available&lt;BR /&gt;</description>
      <pubDate>Tue, 21 Jan 2003 19:54:50 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887025#M818968</guid>
      <dc:creator>Marco Santerre</dc:creator>
      <dc:date>2003-01-21T19:54:50Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887026#M818969</link>
      <description>Sorry, I forgot.  Yes you can run it at any time of the day.  It is non-disrtuptive, espceially when you only go and get information</description>
      <pubDate>Tue, 21 Jan 2003 19:56:10 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887026#M818969</guid>
      <dc:creator>Marco Santerre</dc:creator>
      <dc:date>2003-01-21T19:56:10Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887027#M818970</link>
      <description>Hi Again:&lt;BR /&gt;&lt;BR /&gt;I have attached my stm report. Can someone suggest accordingly.&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;David.</description>
      <pubDate>Tue, 21 Jan 2003 20:18:34 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887027#M818970</guid>
      <dc:creator>david_252</dc:creator>
      <dc:date>2003-01-21T20:18:34Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887028#M818971</link>
      <description>Hi David,&lt;BR /&gt;&lt;BR /&gt;Well...3 pages (12 Mb) deallocated is not that bad. But on the otherhand they're all on the same DIMM. You're not in any imminent danger but I would think that you should schedule downtime to have that particular DIMM replaced under warranty within the next couple of weeks...just to be safe....and to get that 12 MB of RAM back.&lt;BR /&gt;&lt;BR /&gt;Rgds,&lt;BR /&gt;Jeff</description>
      <pubDate>Tue, 21 Jan 2003 20:25:37 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887028#M818971</guid>
      <dc:creator>Jeff Schussele</dc:creator>
      <dc:date>2003-01-21T20:25:37Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887029#M818972</link>
      <description>Hi:&lt;BR /&gt;&lt;BR /&gt; Thanks for all the answers. I just got couple of more questions to clarify/learn:&lt;BR /&gt;&lt;BR /&gt;1.How do we know from the reports (attached above)that it occurs on the same DIMM (Jeff pl.help_&lt;BR /&gt;&lt;BR /&gt;2. CRC - what is this&lt;BR /&gt;&lt;BR /&gt;3. Is STM the only way to find this error or is there any other alternate method?&lt;BR /&gt;&lt;BR /&gt;Please advise.&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;David.</description>
      <pubDate>Thu, 23 Jan 2003 19:45:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887029#M818972</guid>
      <dc:creator>david_252</dc:creator>
      <dc:date>2003-01-23T19:45:52Z</dc:date>
    </item>
    <item>
      <title>Re: Syslog Message.</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887030#M818973</link>
      <description>Hi (again) David,&lt;BR /&gt;&lt;BR /&gt;Ok, I'll tackle your questions.&lt;BR /&gt;&lt;BR /&gt;1) From the stm report you'll note:&lt;BR /&gt;A) on the err log summary, ALL errors occurred on DIMM EXT1 (mem carrier 1) / Ob (slot b - all memory comes in DIMM pairs. In this case 0/a &amp;amp; 0/b are a pair). So you see that ALL errors are on the 0b DIMM&lt;BR /&gt;B) on the PDT - you can match back the addr (0x03dbf38, b38 &amp;amp; fb8) to the summary to verify that these are all on EXT1/0b, even the the PDT references both 0a &amp;amp; 0b.&lt;BR /&gt;&lt;BR /&gt;2) CRC - Cyclical Redundancy Check. This is the err checking schema employed. Basically a checksum used to calculate whether the value has changed or not.&lt;BR /&gt;&lt;BR /&gt;3) Although STM is the best way, there are others. The system log (/var/adm/syslog/syslog.log) as well as the GSP error log are other places to find these errors denoted.&lt;BR /&gt;&lt;BR /&gt;HTH,&lt;BR /&gt;Jeff</description>
      <pubDate>Thu, 23 Jan 2003 20:06:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/syslog-message/m-p/2887030#M818973</guid>
      <dc:creator>Jeff Schussele</dc:creator>
      <dc:date>2003-01-23T20:06:58Z</dc:date>
    </item>
  </channel>
</rss>

