<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: OS vs Oracle on failing drive in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407012#M201888</link>
    <description>Hi, &lt;BR /&gt;&lt;BR /&gt;In my openion OS shall be latest patched and also latest diagnostics, monitoring tool shall be installed....&lt;BR /&gt;&lt;BR /&gt;Besides this, redo logs shall have multiple copies on the server, 3-copies I believe ateast shall be there for latest logs..And if possible keep the same on contigency copy also...You shall be running some script to copy logs of some time back only...&lt;BR /&gt;&lt;BR /&gt;Hope this helps..&lt;BR /&gt;&lt;BR /&gt;Prashant</description>
    <pubDate>Mon, 25 Oct 2004 08:37:08 GMT</pubDate>
    <dc:creator>Prashant Zanwar_4</dc:creator>
    <dc:date>2004-10-25T08:37:08Z</dc:date>
    <item>
      <title>OS vs Oracle on failing drive</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407010#M201886</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;I just want to share this story and get your comments.&lt;BR /&gt;&lt;BR /&gt;A couple days ago, a production Oracle DB halted twice with the following errors in alert.log:&lt;BR /&gt;ARC0: Beginning to archive log# 5 seq# 47&lt;BR /&gt;ARC0: Failed to archive log# 5 seq# 47&lt;BR /&gt;Thu Oct 21 12:50:24 2004&lt;BR /&gt;Log corruption near block 224 change  time &lt;BR /&gt;All Archive destinations made inactive &lt;BR /&gt;ARC1: Failed to archive log# 5 seq# 47&lt;BR /&gt;ARCH: Archival stopped, error occurred. Will continue retrying Thu Oct 21 12:50:24 2004 ORACLE Instance PLIN - Archival Error&lt;BR /&gt;ARCH: Connecting to console port...&lt;BR /&gt;Thu Oct 21 12:50:24 2004&lt;BR /&gt;ORA-16038: log 5 sequence# 47 cannot be archived&lt;BR /&gt;ORA-00354: corrupt redo log block header&lt;BR /&gt;ORA-00312: online log 5 thread 1: '/u03/data/oradata/PLIN/log5.log'&lt;BR /&gt;&lt;BR /&gt;Basically the redo logs became corrupt. This kind of error pointed to the HW, i.e bad disk drive and  DBA moved all redo logs off the local drives and put them onto SAN storage.&lt;BR /&gt;&lt;BR /&gt;The local drives was vg01, 8 drives, 4 drives mirrored over the other four. We ran diskinfo and  dd on all drives - no errors at all. We did not see any errors in syslog.log either. &lt;BR /&gt;&lt;BR /&gt;On a weekend, when rebooting this server (K460 , running HP-UX 11.00) I figured that one drive did fail and it was replaced.&lt;BR /&gt;&lt;BR /&gt;Now I feel kind of uneasy... It looks like Oracle figured a bad drive prior to OS started to report errors. Moreover, even though the drive was mirrored, it really did not help at all! For some reason one failing drive in a mirrored pair caused a production problem. Is there anything I can do about this other than move data off the local drives?&lt;BR /&gt;&lt;BR /&gt;Your opinions are appreciated.&lt;BR /&gt;Elena.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 25 Oct 2004 08:22:01 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407010#M201886</guid>
      <dc:creator>Elena Leontieva</dc:creator>
      <dc:date>2004-10-25T08:22:01Z</dc:date>
    </item>
    <item>
      <title>Re: OS vs Oracle on failing drive</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407011#M201887</link>
      <description>I've had this happen to me in the past.&lt;BR /&gt;&lt;BR /&gt;1) Shut the databaase and get a gold backup.&lt;BR /&gt;2) Use cstm or mstm or xstm (X wind) and run the excercize command on every disk in the system.&lt;BR /&gt;3) dmesg or vi /var/adm/syslog/syslog.log&lt;BR /&gt;&lt;BR /&gt;If you find a bad disk arrange replacement.&lt;BR /&gt;&lt;BR /&gt;These systems generally have larger numbers of small disks. Its unlikely though entirely possible that Oracle and the boot disk are the same disk.&lt;BR /&gt;&lt;BR /&gt;I hope you have been doing make_tape_recovery tapes handy.&lt;BR /&gt;&lt;BR /&gt;Its always a good idea to have vg00 seperate from your oracle data.&lt;BR /&gt;&lt;BR /&gt;SEP</description>
      <pubDate>Mon, 25 Oct 2004 08:32:10 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407011#M201887</guid>
      <dc:creator>Steven E. Protter</dc:creator>
      <dc:date>2004-10-25T08:32:10Z</dc:date>
    </item>
    <item>
      <title>Re: OS vs Oracle on failing drive</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407012#M201888</link>
      <description>Hi, &lt;BR /&gt;&lt;BR /&gt;In my openion OS shall be latest patched and also latest diagnostics, monitoring tool shall be installed....&lt;BR /&gt;&lt;BR /&gt;Besides this, redo logs shall have multiple copies on the server, 3-copies I believe ateast shall be there for latest logs..And if possible keep the same on contigency copy also...You shall be running some script to copy logs of some time back only...&lt;BR /&gt;&lt;BR /&gt;Hope this helps..&lt;BR /&gt;&lt;BR /&gt;Prashant</description>
      <pubDate>Mon, 25 Oct 2004 08:37:08 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407012#M201888</guid>
      <dc:creator>Prashant Zanwar_4</dc:creator>
      <dc:date>2004-10-25T08:37:08Z</dc:date>
    </item>
    <item>
      <title>Re: OS vs Oracle on failing drive</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407013#M201889</link>
      <description>&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt; even though the drive was mirrored, it really did not help at all! For some reason one failing drive in a mirrored pair caused a production problem&lt;BR /&gt;&lt;BR /&gt;Well, Oracle does do basic sanity checking on the data. Thus it can report data problems, without IO errors.&lt;BR /&gt;&lt;BR /&gt;The mirroring may actually hinder in finding a problem. Just imagine the HBA / cable injects a bad bits for the write to one of the members. Or one of the members does nto faithfully write through. Now it is going to be pot-luck ads to whether you see good data or bad data. You may be reading froma good disk most of the time, but under heavier load, you may get data from the other member, over a problem path.&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt;  Besides this, redo logs shall have multiple copies on the server, 3-copies I believe ateast shall be there for latest logs..And if possible keep the same on contigency copy also...You shall be running some script to copy logs of some time back only...&lt;BR /&gt;&lt;BR /&gt;So you have multiple redo groups in Oracle with multiple members within each group, each of those members being LVM mirrored (for 4+ data copies). Here I would have expected Oracle to do the rigth thing when one member deliver doubtfull data.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;fwiw,&lt;BR /&gt;Hein.&lt;BR /&gt;</description>
      <pubDate>Mon, 25 Oct 2004 09:02:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/os-vs-oracle-on-failing-drive/m-p/3407013#M201889</guid>
      <dc:creator>Hein van den Heuvel</dc:creator>
      <dc:date>2004-10-25T09:02:58Z</dc:date>
    </item>
  </channel>
</rss>

