<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to check the sanity of disks, controllers, and i/o subsystem? in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640018#M646543</link>
    <description>Yogeeraj&lt;BR /&gt;&lt;BR /&gt;Well, the measureware will NOT re-install! But as to your query regarding the emailing of problems, I have just set up SCM (Service Control Manager), which is free off the latest Support+ CD's, this with EMS and ODE enables me to send email notifications thru to me, or, if your Cellular service provider allows this -- to a mobile phone via SMS! It also integrates VERY nicely with HP TopTools v.5.5&lt;BR /&gt;&lt;BR /&gt;I am a little stumped on your problems with the L1000... will read thru all this again!&lt;BR /&gt;&lt;BR /&gt;MND&lt;BR /&gt;&lt;BR /&gt;PS: Think it is time I came and spent a week in MU!</description>
    <pubDate>Mon, 14 Jan 2002 12:29:32 GMT</pubDate>
    <dc:creator>Marc Dijkstra</dc:creator>
    <dc:date>2002-01-14T12:29:32Z</dc:date>
    <item>
      <title>How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2639999#M646524</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;I am currently investigating on a serious problem that we encountered on my Oracle 8i database on our L1000 last saturday (05/01/2002). &lt;BR /&gt;&lt;BR /&gt;The error occured with one of our Oracle Datafiles and our users were not able to use our application for some hours. We feared the worst. Fortunately there were no data loss because of this problem. We were able to identify it and create another datafile to replace the defective one.&lt;BR /&gt;&lt;BR /&gt;The error message we got is: ORA-01115 when running applications.&lt;BR /&gt; ______________________________________________________________&lt;BR /&gt;&amp;gt;&lt;BR /&gt;&amp;gt; ORA-01115: IO error reading block from file 13 (block #45276)&lt;BR /&gt;&amp;gt; ORA-01110: data file 13:&lt;BR /&gt;&amp;gt; '/d06/oradata/cmtdb/pfs_indx_kn01.dbf'&lt;BR /&gt;&amp;gt; ORA-27050: function called with invalid FIB/IOV structure&lt;BR /&gt;&amp;gt; Additional information: 10&lt;BR /&gt; ______________________________________________________________&lt;BR /&gt;&lt;BR /&gt;Since then we have been investigating on all possible causes of the problem.&lt;BR /&gt;&lt;BR /&gt;One of our Oracle contact mentions about possible problems with HARDWARE and recommend "Run operating system level utilities and diagnostic tools that check for the sanity of disks, controllers, and the I/O subsystem" (Please find attached excerpt of report received from Oracle and my log files)&lt;BR /&gt;&lt;BR /&gt;How do i troubleshoot?&lt;BR /&gt;&lt;BR /&gt;Thank you very much for a reply.&lt;BR /&gt;&lt;BR /&gt;Regards&lt;BR /&gt;Yogeeraj</description>
      <pubDate>Mon, 07 Jan 2002 13:32:45 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2639999#M646524</guid>
      <dc:creator>Yogeeraj</dc:creator>
      <dc:date>2002-01-07T13:32:45Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640000#M646525</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;Try STM.&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://software.hp.com/cgi-bin/swdepot_parser.cgi/cgi/try.pl?productNumber=B6191AAE&amp;amp;date=" target="_blank"&gt;https://software.hp.com/cgi-bin/swdepot_parser.cgi/cgi/try.pl?productNumber=B6191AAE&amp;amp;date=&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;you can get this from your support Plus CD.&lt;BR /&gt;&lt;BR /&gt;Here is the STM FAQ,&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://docs.hp.com/hpux/onlinedocs/diag/stm/stm_faq.htm" target="_blank"&gt;http://docs.hp.com/hpux/onlinedocs/diag/stm/stm_faq.htm&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Here is the link from hp docs site for more info,&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://docs.hp.com/hpux/diag/index.html#Online%20Diagnostics:%20Support%20Tools%20Manager%20(STM)" target="_blank"&gt;http://docs.hp.com/hpux/diag/index.html#Online%20Diagnostics:%20Support%20Tools%20Manager%20(STM)&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Hope this helps.&lt;BR /&gt;&lt;BR /&gt;Regds&lt;BR /&gt;</description>
      <pubDate>Mon, 07 Jan 2002 13:39:28 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640000#M646525</guid>
      <dc:creator>Sanjay_6</dc:creator>
      <dc:date>2002-01-07T13:39:28Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640001#M646526</link>
      <description>You can use the "stm" programs, I prefer xstm (graphical version).&lt;BR /&gt;&lt;BR /&gt;Also, did you get any errors in syslog?&lt;BR /&gt;&lt;BR /&gt;live free or die&lt;BR /&gt;harry</description>
      <pubDate>Mon, 07 Jan 2002 13:40:34 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640001#M646526</guid>
      <dc:creator>harry d brown jr</dc:creator>
      <dc:date>2002-01-07T13:40:34Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640002#M646527</link>
      <description>Hi:&lt;BR /&gt;&lt;BR /&gt;Install and the EMS and Predictive Support diagnostics tools.  These are available on the SupportPlus CDROM as part of the DIAGNOSTICS bundle and/or from the link below.  These tools will give you early warning alerts of hardware problems.&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www.software.hp.com/cgi-bin/swdepot_parser.cgi/cgi/displayProductInfo.pl?productNumber=B6191AAE" target="_blank"&gt;http://www.software.hp.com/cgi-bin/swdepot_parser.cgi/cgi/displayProductInfo.pl?productNumber=B6191AAE&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Regards!&lt;BR /&gt;&lt;BR /&gt;...JRF...</description>
      <pubDate>Mon, 07 Jan 2002 13:47:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640002#M646527</guid>
      <dc:creator>James R. Ferguson</dc:creator>
      <dc:date>2002-01-07T13:47:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640003#M646528</link>
      <description>Also check for file system errors.  Are there any vxfs errors in syslog or dmesg?  If so you may need to umount the file system and run a full fsck.  I would also recommend installing the latest vxfs patches if this is the case.&lt;BR /&gt;&lt;BR /&gt;Regards,&lt;BR /&gt;Steve</description>
      <pubDate>Mon, 07 Jan 2002 13:51:37 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640003#M646528</guid>
      <dc:creator>Steven Gillard_2</dc:creator>
      <dc:date>2002-01-07T13:51:37Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640004#M646529</link>
      <description>As the others have said, STM will be your best bet, as will looking at the system logs (dmesg and /var/adm/syslog/syslog.log).  &lt;BR /&gt;&lt;BR /&gt;If you just want to check individual disks, like the one that contained that data file, you can also do a dd of the disks.&lt;BR /&gt;&lt;BR /&gt;Do something like:&lt;BR /&gt;&lt;BR /&gt;# dd if=/dev/dsk/c#t#d# of=/dev/null bs=4k&lt;BR /&gt;&lt;BR /&gt;and if it completes successfully with no errors then the disk is probably OK.  If you do get a read error, then you have a disk problem and the disk should be replaced at the earliest possible opportunity.</description>
      <pubDate>Mon, 07 Jan 2002 14:37:06 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640004#M646529</guid>
      <dc:creator>Patrick Wallek</dc:creator>
      <dc:date>2002-01-07T14:37:06Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640005#M646530</link>
      <description>Hi:&lt;BR /&gt;&lt;BR /&gt;The others have given you the answer (stm ot xstm) but the better question is 'How can I setup my system so that disk/controller/cable failures don't harm my application?". The answer to that is either Mirror/UX or arrays with multiple paths. If done correctly, in most cases you can repair the equipment without ever taking the system down. You can take this to the next level with MC/ServiceGuard. For critical systems, you need to take the approach that stuff happens. When attacked correctly, you can then say 'so what' and your users never know anything has happened.&lt;BR /&gt;&lt;BR /&gt;Food for thought, Clay&lt;BR /&gt;</description>
      <pubDate>Mon, 07 Jan 2002 14:59:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640005#M646530</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2002-01-07T14:59:52Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640006#M646531</link>
      <description>Basically it looks like a disk failure &lt;BR /&gt;.. check that disks are shown CLAIMED in S/W state of this command:&lt;BR /&gt;.. # isocan -fnC disk &lt;BR /&gt;.. on each disk of that Volume group run:&lt;BR /&gt;.. # diskinfo /dev/rdsk/cXtYdZ ; this should return correct mnufacturer info with size different than null 0.&lt;BR /&gt;.. the success of the above dd command would proves that disks are OK.&lt;BR /&gt;.. check syslog.log &amp;amp; OLDsyslog for LBOLT, scsi timeout &amp;amp; Power Fail messages about a dev_T with hex number for example 0x1f006000 --&amp;gt; remove last 2 zeros, &amp;amp; the disk is c0t6d0.   &lt;BR /&gt;.. check with pvdislay /dev/dsk/cXtYdZ the &lt;BR /&gt;IO Timeout (Seconds), if it's default verify with DB vendor what's the appropriate value.&lt;BR /&gt;.. sanity of filesystems are checked with:&lt;BR /&gt;# fsck -F fstype -y -o full, nolog /dev/vgXX/rlvolYY&lt;BR /&gt;to run fsck you need to unmount filesystem of course.&lt;BR /&gt;&lt;BR /&gt;G'd luck&lt;BR /&gt;t++&lt;BR /&gt;</description>
      <pubDate>Mon, 07 Jan 2002 18:17:11 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640006#M646531</guid>
      <dc:creator>T. M. Louah</dc:creator>
      <dc:date>2002-01-07T18:17:11Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640007#M646532</link>
      <description>Hello everybody,&lt;BR /&gt;&lt;BR /&gt;Thanks for all these replies.&lt;BR /&gt;I have checked my /var/adm/syslog/syslog.log. No errors have been logged!!&lt;BR /&gt;&lt;BR /&gt;# ioscan -fnC disk&lt;BR /&gt;Class     I  H/W Path     Driver S/W State   H/W Type     Description&lt;BR /&gt;=====================================================================&lt;BR /&gt;disk     17  0/0/1/1.0.0  sdisk CLAIMED     DEVICE       SEAGATE ST318404LC&lt;BR /&gt;                         /dev/dsk/c1t0d0   /dev/rdsk/c1t0d0&lt;BR /&gt;disk      0  0/0/1/1.2.0  sdisk CLAIMED     DEVICE       SEAGATE ST318404LC&lt;BR /&gt;                         /dev/dsk/c1t2d0   /dev/rdsk/c1t2d0&lt;BR /&gt;disk     18  0/0/2/0.0.0  sdisk CLAIMED     DEVICE       SEAGATE ST318404LC&lt;BR /&gt;                         /dev/dsk/c2t0d0   /dev/rdsk/c2t0d0&lt;BR /&gt;disk      1  0/0/2/0.2.0  sdisk CLAIMED     DEVICE       SEAGATE ST318404LC&lt;BR /&gt;                         /dev/dsk/c2t2d0   /dev/rdsk/c2t2d0&lt;BR /&gt;disk      2  0/0/2/1.2.0  sdisk CLAIMED     DEVICE       HP      DVD-ROM 304&lt;BR /&gt;                         /dev/dsk/c3t2d0   /dev/dsk/cdrom    /dev/rdsk/c3t2d0&lt;BR /&gt;#&lt;BR /&gt;&lt;BR /&gt;I would also, add that the problem datafile contained some indexes for my Oracle Tables. Hence, i was fortunate not to suffer any data loss. The indexes could be reconstructed without any problem on another tablespace/datafile. &lt;BR /&gt;&lt;BR /&gt;The tablespace which used the problem datafile has been left intact. I would like to know what to do if this is no case disk failure.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;regards&lt;BR /&gt;yogeeraj</description>
      <pubDate>Tue, 08 Jan 2002 08:31:16 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640007#M646532</guid>
      <dc:creator>Yogeeraj</dc:creator>
      <dc:date>2002-01-08T08:31:16Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640008#M646533</link>
      <description>If I were you , and like many peer suggest , better use STM and LOGTOOL to check for evident of hardware problem.&lt;BR /&gt;&lt;BR /&gt;on Xwindows use&lt;BR /&gt;#xstm&lt;BR /&gt;goto tool -&amp;gt; utility -&amp;gt; Run&lt;BR /&gt;select LOGTOOL.&lt;BR /&gt;&lt;BR /&gt;select raw current log&lt;BR /&gt;format raw log&lt;BR /&gt;then view formated log.&lt;BR /&gt;this is GUI that very easy to use and you can get text file that report event associate with hardware that it detect I/O error.&lt;BR /&gt;then check for I/O path that corresponding to your index file.</description>
      <pubDate>Tue, 08 Jan 2002 08:50:51 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640008#M646533</guid>
      <dc:creator>Printaporn_1</dc:creator>
      <dc:date>2002-01-08T08:50:51Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640009#M646534</link>
      <description>(In a later response,) You indicated that you did not see any errors in your syslog.log file, but:&lt;BR /&gt;&lt;BR /&gt;1) Do you still have the syslog.log file *of the time of the Oracle (ORA-01115) error*? I.e. the system may report no errors *now*, but may have reported errors *before*.&lt;BR /&gt;&lt;BR /&gt;2) Have you set up dmesg(1M) as per the example in the manual page (or root's crontab)? If so, does *that* log contain any errors?</description>
      <pubDate>Fri, 11 Jan 2002 10:34:29 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640009#M646534</guid>
      <dc:creator>Frank Slootweg</dc:creator>
      <dc:date>2002-01-11T10:34:29Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640010#M646535</link>
      <description>Yogeeraj&lt;BR /&gt;&lt;BR /&gt;We had a similar problem.  Unfortunately going to HP &amp;amp; saying it all went wrong &amp;amp; there is no evedence in syslog.log or ?stm does not cut much ice!!!  Below is the way we convienced them to take the problem more seriously (&amp;amp; we got a fix) --&amp;gt;&lt;BR /&gt;&lt;BR /&gt;Do you run MeasureWare?  If so you can do a few things to prove when it went pear shaped&lt;BR /&gt;&lt;BR /&gt;make a reptall; say rep.GLOBAL; file with the following in it&lt;BR /&gt;&lt;BR /&gt;REPORT "MWA Export !DATE !TIME Logfile: !LOGFILE !COLLECTOR !SYSTEM_ID"&lt;BR /&gt;FORMAT ASCII &lt;BR /&gt;HEADINGS ON &lt;BR /&gt;SEPARATOR="|"&lt;BR /&gt;SUMMARY=60&lt;BR /&gt;MISSING=0&lt;BR /&gt; &lt;BR /&gt;DATA TYPE DISK&lt;BR /&gt; DATE                                         &lt;BR /&gt; TIME                                         &lt;BR /&gt; BYDSK_DEVNAME&lt;BR /&gt; BYDSK_PHYS_READ_RATE                         &lt;BR /&gt; BYDSK_PHYS_WRITE_RATE                        &lt;BR /&gt; BYDSK_PHYS_IO_RATE                           &lt;BR /&gt; BYDSK_SYSTEM_IO_RATE&lt;BR /&gt; BYDSK_UTIL                                   &lt;BR /&gt; BYDSK_REQUEST_QUEUE                          &lt;BR /&gt;** The below metrics are optional &amp;amp; may be useful&lt;BR /&gt; BYDSK_LOGL_READ_RATE                         &lt;BR /&gt; BYDSK_LOGL_WRITE_RATE                        &lt;BR /&gt; BYDSK_FS_READ_RATE                           &lt;BR /&gt; BYDSK_FS_WRITE_RATE                          &lt;BR /&gt; BYDSK_RAW_READ_RATE                          &lt;BR /&gt; BYDSK_RAW_WRITE_RATE                         &lt;BR /&gt; BYDSK_VM_IO_RATE                             &lt;BR /&gt;&lt;BR /&gt;Everything above the comment line I would use.&lt;BR /&gt;&lt;BR /&gt;do the extract for the day that went hay wire.  Say it was 10 Jan 2002 from 10:00 to 11:00, I would add an hour either side so 9:00 to 12:00&lt;BR /&gt;&lt;BR /&gt;# extract -xp -v -d -r rep.GLOBAL -b 01/10/02 09:00 -e 01/10/02 12:00&lt;BR /&gt;&lt;BR /&gt;This will report on all disks so you will need to extract info from the xfrdDISK.asc file for each disk.&lt;BR /&gt;&lt;BR /&gt;# egrep "0/0/1/1.0.0|MWA|Dev|Nam" xfrdDISK.asc &amp;gt; disk1.asc&lt;BR /&gt;# egrep "0/0/1/1.2.0|MWA|Dev|Nam" ..etc..&lt;BR /&gt;&lt;BR /&gt;From this you will get the disk?.asc files.&lt;BR /&gt;copy them over to a PC with excel on it &amp;amp; import.&lt;BR /&gt;&lt;BR /&gt;For each disk also calculate the IO time (or a guestimate of the avserv time in sar -d).  IO Time is in ms (miliseconds)&lt;BR /&gt;IO Time = BYDSK_PHYS_IO_RATE * 10 / BYDSK_UTIL&lt;BR /&gt;&lt;BR /&gt;You can now draw some graphs for each  disk. Here are some guidelines&lt;BR /&gt; o Disk % if this is high you may have a problem/bottleneck&lt;BR /&gt; o For the ST3?? disks an IO time of 8ms is ok (expected) any less &amp;amp; you are doing well much more than 16-20 you have problems.  The ST3?? are 10,000 rpm which is about 3ms seek time and about 5 ms latecy time, this gives about 8ms AVERAGE time spent looking for the data, so an IO time of 8ms is fine.  However it should be a bit lower than this if you use buffercache or are doing massive reads or writes etc.&lt;BR /&gt; o Check that the reads &amp;amp; writes seems OK (BYDSK_PHYS_READ_RATE &amp;amp; BYDSK_PHYS_WRITE_RATE)&lt;BR /&gt;If you do use buffercache you may see no reads but lots of system IO, this is OK as it goes via buffercache (if you use it).&lt;BR /&gt; o Any queues on the disk are bad&lt;BR /&gt; o I'm told if you have a latter version of MeasureWare it also does ammount of data extracted (kB/s), this may be useful to look at.&lt;BR /&gt;&lt;BR /&gt;If your controllers or disks were duff I would expect to see high disk utilisation &amp;amp; low IO throughput. (i.e. IO time would be high).  We had a similar problem with fc60 disks &amp;amp; it ended up being a kernel parameter scsi_max_qdepth was too low.  &lt;BR /&gt;&lt;BR /&gt;** please bear in mind our system was fiber channel yours seems to be SCSI so the above kernel parameter may not be the problem **&lt;BR /&gt;&lt;BR /&gt;Tim&lt;BR /&gt;</description>
      <pubDate>Fri, 11 Jan 2002 11:22:57 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640010#M646535</guid>
      <dc:creator>Tim D Fulford</dc:creator>
      <dc:date>2002-01-11T11:22:57Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640011#M646536</link>
      <description>thanks frank.&lt;BR /&gt;My answers are NO and NO.&lt;BR /&gt;-------------------------------------------&lt;BR /&gt;In fact, yesterday i did some further tests to try locate where the exact problem might be.&lt;BR /&gt;&lt;BR /&gt;I did the following in sequence:&lt;BR /&gt;=========================================================&lt;BR /&gt; a. Restart the server&lt;BR /&gt;          Verify that the database is shutting down and starting up correctly&lt;BR /&gt; b. shutdown the database&lt;BR /&gt; c. Unmount all user file Systems and run FSCK&lt;BR /&gt; d. Check for any evident hardware problems using HP-UX 11 OS utilities STM and logtool&lt;BR /&gt; e. Mount all file systems&lt;BR /&gt; f. Restart the database&lt;BR /&gt; g. Create a new table on the problem tablespace with initial extent of same size as the tablespace.&lt;BR /&gt; h. Populate the table with data that will fill up the initial extent.&lt;BR /&gt; i. Query or export the table and check for possible error&lt;BR /&gt;(will be checking for error occurrence at each steps)&lt;BR /&gt;=========================================================&lt;BR /&gt;I have detected no problem.&lt;BR /&gt;&lt;BR /&gt;I still fearing to reuse that 700MB space used by the datafile which got data block errors.&lt;BR /&gt;&lt;BR /&gt;I am attaching my syslog.&lt;BR /&gt;&lt;BR /&gt;Thank you all for your replies.&lt;BR /&gt;&lt;BR /&gt;Any further help and recomendations will be the most welcomed.&lt;BR /&gt;&lt;BR /&gt;Best Regards&lt;BR /&gt;Yogeeraj</description>
      <pubDate>Fri, 11 Jan 2002 11:40:39 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640011#M646536</guid>
      <dc:creator>Yogeeraj</dc:creator>
      <dc:date>2002-01-11T11:40:39Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640012#M646537</link>
      <description>This is my STM report:&lt;BR /&gt;(The overtemp messages are when the last we had a power-cut. The IO error are they related to the test i had been doing sometime ago whereby if had filled up some files systems quite often?)&lt;BR /&gt;============================================================&lt;BR /&gt;.... L1000  :  132.147.160.9 .... &lt;BR /&gt;&lt;BR /&gt;-- Logtool Utility: View Formatted Summary --&lt;BR /&gt;&lt;BR /&gt;Summary of:           /var/stm/logs/os/log1.fmt1&lt;BR /&gt;Formatted from:       /var/stm/logs/os/log1.raw.cur&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;  Date/time of first entry:    Sun Aug 20 15:45:37 2000&lt;BR /&gt; &lt;BR /&gt;  Date/time of last  entry:    Sat Dec 22 09:16:44 2001&lt;BR /&gt; &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;  Number of LPMC entries:               0 &lt;BR /&gt;  Number of System Overtemp entries:    15 &lt;BR /&gt;  Number of LVM entries:                0 &lt;BR /&gt;  Number of Logger Event entries:       0 &lt;BR /&gt;&lt;BR /&gt;  Number of I/O Error entries:          228 &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;    Device paths for which entries exist: &lt;BR /&gt;&lt;BR /&gt;       (220)  0/0/2/1.2.0&lt;BR /&gt;       (4)  0/0/2/0.0.0&lt;BR /&gt;       (2)  0/0/2/0.2.0&lt;BR /&gt;       (2)  0/0/1/1.2.0&lt;BR /&gt;&lt;BR /&gt;    Products for which entries exist: &lt;BR /&gt;&lt;BR /&gt;       (228)  SCSI Disk &lt;BR /&gt;&lt;BR /&gt;    Product Qualifiers for which entries exist: &lt;BR /&gt;&lt;BR /&gt;       (220)  HPDVD-ROM &lt;BR /&gt;       (8)  SEAGATEST318404LC&lt;BR /&gt;&lt;BR /&gt;    Logger Events for which entries exist: &lt;BR /&gt;&lt;BR /&gt;       (228)  sdisk &lt;BR /&gt;&lt;BR /&gt;    Device Types for which entries exist: &lt;BR /&gt;&lt;BR /&gt;       (228)  Disk &lt;BR /&gt;&lt;BR /&gt;    Device Qualifiers for which entries exist: &lt;BR /&gt;&lt;BR /&gt;       (220)  DVDROM &lt;BR /&gt;       (8)  Hard&lt;BR /&gt;</description>
      <pubDate>Fri, 11 Jan 2002 11:46:12 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640012#M646537</guid>
      <dc:creator>Yogeeraj</dc:creator>
      <dc:date>2002-01-11T11:46:12Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640013#M646538</link>
      <description>Hi Yogeeraj!&lt;BR /&gt;&lt;BR /&gt;I see from your response that you have done the test discussed and that there were no errors on the L-class.&lt;BR /&gt;&lt;BR /&gt;Have you checked for any evidence of errors on the GSP?&lt;BR /&gt;(secure Web Console)&lt;BR /&gt;&lt;BR /&gt;Also, I think that Tim's suggestion with the Measureware is a good one. You can load the demo for Measureware and PerfView from your 11.0 application CD's.&lt;BR /&gt;&lt;BR /&gt;MND</description>
      <pubDate>Fri, 11 Jan 2002 12:22:36 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640013#M646538</guid>
      <dc:creator>Marc Dijkstra</dc:creator>
      <dc:date>2002-01-11T12:22:36Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640014#M646539</link>
      <description>&amp;gt; The IO error are they related to the test i had been doing sometime ago whereby if had filled up some files systems quite often?)&lt;BR /&gt;&lt;BR /&gt;*NO*! Full file systems do not give I/O errors. Since STM reported many I/O errors and the original Oracle error also said "IO error" (or some such), you will have to look at these I/O errors. Perhaps others can help with that (as I have (nearly) no experience with STM). &lt;BR /&gt;&lt;BR /&gt;STM mentions these addresses:&lt;BR /&gt;&lt;BR /&gt;&amp;gt; (220) 0/0/2/1.2.0&lt;BR /&gt;&amp;gt; (4) 0/0/2/0.0.0&lt;BR /&gt;&amp;gt; (2) 0/0/2/0.2.0&lt;BR /&gt;&amp;gt; (2) 0/0/1/1.2.0&lt;BR /&gt;&lt;BR /&gt;So it would be interesting to know if /d06/oradata/cmtdb/pfs_indx_kn01.dbf&lt;BR /&gt;in on any of these addresses:&lt;BR /&gt;&lt;BR /&gt;bdf /d06/oradata/cmtdb/pfs_indx_kn01.dbf (gives LV name)&lt;BR /&gt;lvdisplay -v /dev/vg??/... (gives PV (/dev/dsk/...) name(s))&lt;BR /&gt;lssf /dev/dsk/... (gives hardware address(es) of PV(s)/disk(s))&lt;BR /&gt;</description>
      <pubDate>Fri, 11 Jan 2002 12:54:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640014#M646539</guid>
      <dc:creator>Frank Slootweg</dc:creator>
      <dc:date>2002-01-11T12:54:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640015#M646540</link>
      <description>This area that is giving problems, is it on the AutoRaid 12H? If I remember correctly there is a utility to look after arrays (arraymgr or some such) that will pop up errors if there is a problem with a LUN.&lt;BR /&gt;&lt;BR /&gt;Is the K-Class server also talking to this area at the same time or is the mapping seperate?&lt;BR /&gt;&lt;BR /&gt;MND</description>
      <pubDate>Fri, 11 Jan 2002 13:06:48 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640015#M646540</guid>
      <dc:creator>Marc Dijkstra</dc:creator>
      <dc:date>2002-01-11T13:06:48Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640016#M646541</link>
      <description>Attention: Mr. Frank Slootweg&lt;BR /&gt;Thanks for the reply and comments.&lt;BR /&gt;/d06/oradata/cmtdb/pfs_indx_kn01.dbf (mirrored) is on &lt;BR /&gt;0/0/1/1.0.0 /dev/dsk/c1t0d0 and&lt;BR /&gt;0/0/2/0.0.0 /dev/dsk/c2t0d0&lt;BR /&gt;&lt;BR /&gt;Hence, from the STM report on one of the device path where we had 4 error i.e 0/0/2/0.0.0&lt;BR /&gt;&lt;BR /&gt;I would also like to mention that i have a large file system in the same Volume Group (VG 01) that is not mirrored and spans over the 2 disks. &lt;BR /&gt;LV Name                     /dev/vg01/lv_d05&lt;BR /&gt;   LV Status                   available/syncd           &lt;BR /&gt;   LV Size (Mbytes)            8192            &lt;BR /&gt;   Current LE                  2048      &lt;BR /&gt;   Allocated PE                2048        &lt;BR /&gt;   Used PV                     2&lt;BR /&gt;--- Distribution of logical volume ---&lt;BR /&gt;PV Name            LE on PV  PE on PV&lt;BR /&gt;/dev/dsk/c1t0d0    1722      1722&lt;BR /&gt;/dev/dsk/c2t0d0    326       326&lt;BR /&gt;&lt;BR /&gt;mounted on /d05 (Oracle 9iAS)&lt;BR /&gt;===================================================&lt;BR /&gt;L1000: home/deg&amp;gt; bdf /d06/oradata/cmtdb/pfs_indx_kn01.dbf&lt;BR /&gt;Filesystem          kbytes    used   avail %used Mounted on&lt;BR /&gt;/dev/vg01/lv_d06   4194304 3687617  475024   89% /d06&lt;BR /&gt;&lt;BR /&gt;L1000: home/deg&amp;gt; lvdisplay -v /dev/vg01/lv_d06&lt;BR /&gt;--- Logical volumes ---&lt;BR /&gt;LV Name                     /dev/vg01/lv_d06&lt;BR /&gt;VG Name                     /dev/vg01&lt;BR /&gt;LV Permission               read/write&lt;BR /&gt;LV Status                   available/syncd&lt;BR /&gt;Mirror copies               1&lt;BR /&gt;Consistency Recovery        MWC&lt;BR /&gt;Schedule                    parallel&lt;BR /&gt;LV Size (Mbytes)            4096&lt;BR /&gt;Current LE                  1024&lt;BR /&gt;Allocated PE                2048&lt;BR /&gt;Stripes                     0&lt;BR /&gt;Stripe Size (Kbytes)        0&lt;BR /&gt;Bad block                   on&lt;BR /&gt;Allocation                  strict&lt;BR /&gt;IO Timeout (Seconds)        default&lt;BR /&gt;&lt;BR /&gt;   --- Distribution of logical volume ---&lt;BR /&gt;   PV Name            LE on PV  PE on PV&lt;BR /&gt;   /dev/dsk/c1t0d0    1024      1024&lt;BR /&gt;   /dev/dsk/c2t0d0    1024      1024&lt;BR /&gt;&lt;BR /&gt;   --- Logical extents ---&lt;BR /&gt;   LE   PV1                PE1  Status 1 PV2                PE2  Status 2&lt;BR /&gt;   0000 /dev/dsk/c1t0d0    0000 current  /dev/dsk/c2t0d0    0000 current&lt;BR /&gt;   0001 /dev/dsk/c1t0d0    0001 current  /dev/dsk/c2t0d0    0001 current&lt;BR /&gt;   0002 /dev/dsk/c1t0d0    0002 current  /dev/dsk/c2t0d0    0002 current&lt;BR /&gt;   0003 /dev/dsk/c1t0d0    0003 current  /dev/dsk/c2t0d0    0003 current&lt;BR /&gt;   0004 /dev/dsk/c1t0d0    0004 current  /dev/dsk/c2t0d0    0004 current&lt;BR /&gt;   0005 /dev/dsk/c1t0d0    0005 current  /dev/dsk/c2t0d0    0005 current&lt;BR /&gt;   ...&lt;BR /&gt;   ...&lt;BR /&gt;   1020 /dev/dsk/c1t0d0    1120 current  /dev/dsk/c2t0d0    2620 current&lt;BR /&gt;   1021 /dev/dsk/c1t0d0    1121 current  /dev/dsk/c2t0d0    2621 current&lt;BR /&gt;   1022 /dev/dsk/c1t0d0    1122 current  /dev/dsk/c2t0d0    2622 current&lt;BR /&gt;   1023 /dev/dsk/c1t0d0    1123 current  /dev/dsk/c2t0d0    2623 current&lt;BR /&gt;&lt;BR /&gt;L1000: home/deg&amp;gt;lssf /dev/dsk/c1t0d0&lt;BR /&gt;sdisk card instance 1 SCSI target 0 SCSI LUN 0 section 0 at address 0/0/1/1.0.0 /dev/dsk/c1t0d0&lt;BR /&gt;&lt;BR /&gt;L1000: home/deg&amp;gt;lssf /dev/dsk/c2t0d0&lt;BR /&gt;sdisk card instance 2 SCSI target 0 SCSI LUN 0 section 0 at address 0/0/2/0.0.0 /dev/dsk/c2t0d0&lt;BR /&gt;&lt;BR /&gt;______________________________________________________________</description>
      <pubDate>Sat, 12 Jan 2002 08:40:25 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640016#M646541</guid>
      <dc:creator>Yogeeraj</dc:creator>
      <dc:date>2002-01-12T08:40:25Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640017#M646542</link>
      <description>Hi Marc.&lt;BR /&gt;Nice to hear from u.&lt;BR /&gt;&lt;BR /&gt;1. GSP&lt;BR /&gt;As far as i remember, the GSP displayed an error last time there was a power cut. It was about Temperature. Since, then i never saw the front panel ALARM LED blinking yellow. I will check it again on Monday and let you know.&lt;BR /&gt;&lt;BR /&gt;By the way, is it possible to direct messages generated to an email address? (We have recently configured SMTP on L1000 so that we can now send emails to our Exchange Server)&lt;BR /&gt;&lt;BR /&gt;NB. The secure web console is still not operational. Remember, we were told that we can have either the console or the web console (not both at the same time)!&lt;BR /&gt;&lt;BR /&gt;2. Measurement software&lt;BR /&gt;Well, it has already expired! I will try to uninstall then reinstall and do the tests. I hope it works.&lt;BR /&gt;&lt;BR /&gt;3. Problem area/AutoRAID 12H&lt;BR /&gt;No. The problem is not on the autoRAID. It is still connected to the K250 and have not been connected to the L1000 yet. We are here talking about the Internal Disks of the L1000. Remember we have 4x18 GB disks included in 2 volume groups.&lt;BR /&gt;&lt;BR /&gt;4. Problem area/K250 talking to that area.&lt;BR /&gt;Well, how to explain? Let's see...&lt;BR /&gt;We have 2 file systems from the L1000 that have been mounted on the K250 using NFS. &lt;BR /&gt;mount gigal:/BACKUP/ /tmp_mnt/&lt;BR /&gt;mount gigal:/users/ /users1/&lt;BR /&gt;_______________________________________________&lt;BR /&gt;LV Name /dev/vg00/lv_backup&lt;BR /&gt;   LV Status                   available/syncd           &lt;BR /&gt;   LV Size (Mbytes)            2700            &lt;BR /&gt;   Current LE                  675       &lt;BR /&gt;   Allocated PE                675         &lt;BR /&gt;   Used PV                     2&lt;BR /&gt;--- Distribution of logical volume ---&lt;BR /&gt;PV Name            LE on PV  PE on PV&lt;BR /&gt;/dev/dsk/c1t2d0    339       339&lt;BR /&gt;/dev/dsk/c2t2d0    336       336&lt;BR /&gt;&lt;BR /&gt;LV Name                     /dev/vg01/lv_users&lt;BR /&gt;   LV Status                   available/syncd           &lt;BR /&gt;   LV Size (Mbytes)            400             &lt;BR /&gt;   Current LE                  100       &lt;BR /&gt;   Allocated PE                200         &lt;BR /&gt;   Used PV                     2&lt;BR /&gt;--- Distribution of logical volume ---&lt;BR /&gt;PV Name            LE on PV  PE on PV&lt;BR /&gt;/dev/dsk/c1t0d0    100       100&lt;BR /&gt;/dev/dsk/c2t0d0    100       100&lt;BR /&gt;_______________________________________________&lt;BR /&gt;&lt;BR /&gt;Now, /tmp_mnt keeps our Database Exports files that created every night at 23:00 on the K250.&lt;BR /&gt;/users1 keeps user files that are periodically generated on the K250 and that are FTPed (from the L1000 every 5 mins to one of our remote servers where it is used for a batch update.&lt;BR /&gt;(Hope that this is not too confusing!)&lt;BR /&gt;&lt;BR /&gt;NB. These file systems are not used for Oracle Data files.&lt;BR /&gt;&lt;BR /&gt;The only possible case where the two servers might be accessing the same file is in the FTP case (described above). The file are on average 200k to 400k each.&lt;BR /&gt;&lt;BR /&gt;Hope that all these answer to your questions. I will be posting more info from our GSP and reports from measureware on monday.&lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;Yogeeraj&lt;BR /&gt;</description>
      <pubDate>Sat, 12 Jan 2002 09:49:30 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640017#M646542</guid>
      <dc:creator>Yogeeraj</dc:creator>
      <dc:date>2002-01-12T09:49:30Z</dc:date>
    </item>
    <item>
      <title>Re: How to check the sanity of disks, controllers, and i/o subsystem?</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640018#M646543</link>
      <description>Yogeeraj&lt;BR /&gt;&lt;BR /&gt;Well, the measureware will NOT re-install! But as to your query regarding the emailing of problems, I have just set up SCM (Service Control Manager), which is free off the latest Support+ CD's, this with EMS and ODE enables me to send email notifications thru to me, or, if your Cellular service provider allows this -- to a mobile phone via SMS! It also integrates VERY nicely with HP TopTools v.5.5&lt;BR /&gt;&lt;BR /&gt;I am a little stumped on your problems with the L1000... will read thru all this again!&lt;BR /&gt;&lt;BR /&gt;MND&lt;BR /&gt;&lt;BR /&gt;PS: Think it is time I came and spent a week in MU!</description>
      <pubDate>Mon, 14 Jan 2002 12:29:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/how-to-check-the-sanity-of-disks-controllers-and-i-o-subsystem/m-p/2640018#M646543</guid>
      <dc:creator>Marc Dijkstra</dc:creator>
      <dc:date>2002-01-14T12:29:32Z</dc:date>
    </item>
  </channel>
</rss>

