<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: HSG80 - problem determination in Disk Enclosures</title>
    <link>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982268#M23488</link>
    <description>Hi,&lt;BR /&gt;&lt;BR /&gt;Agree 100% withabove noter.&lt;BR /&gt;&lt;BR /&gt;It is also worth looking for unwritable_data. In your note it is more likely to be lost_data but if it is not then try this:&lt;BR /&gt;&lt;BR /&gt;RETRY_ERRORS UNWRITEABLE_DATA D1&lt;BR /&gt;if this works then your data should be OK. You should still check your data integrity once it has completed. You may have to wait for a little while for this to complete. If it dosn't work then use:&lt;BR /&gt;CLEAR_ERRORS D1 UNWRITEABLE_DATA &lt;BR /&gt;the problem with this is that you may loose data but if the retry command does not work then this is what you will have to do to make the unit presentable again.&lt;BR /&gt;NOTE: make sure if you have unwritable data you use the RETRY command first.&lt;BR /&gt;&lt;BR /&gt;Either way as the above noter explained not all / any of the units will not be presented till:&lt;BR /&gt;1: you have first sorted out the cache issue [one or both controllers]&lt;BR /&gt;2: you have cleared lost_data or unwritable_data on each unit concerned&lt;BR /&gt;3: check your files from the os for data integrity&lt;BR /&gt;&lt;BR /&gt;You should look into replacing your batteries ASAP...&lt;BR /&gt;&lt;BR /&gt;Mark...</description>
    <pubDate>Wed, 18 Apr 2007 03:00:00 GMT</pubDate>
    <dc:creator>Mark...</dc:creator>
    <dc:date>2007-04-18T03:00:00Z</dc:date>
    <item>
      <title>HSG80 - problem determination</title>
      <link>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982266#M23486</link>
      <description>Hello,&lt;BR /&gt;There is a problem with Compaq HSG80 controller.&lt;BR /&gt;Can somebody tell me, what i have to do with that situation?&lt;BR /&gt;&lt;BR /&gt;########### LOGS: ###################&lt;BR /&gt;HSG80_B&amp;gt; show this_controller&lt;BR /&gt;%CER--HSG80_B&amp;gt; --16-APR-2007 15:38:39-- Invalid cache -- CLI command set-&lt;BR /&gt;reduced.  Type SHOW THIS_CONTROLLER. Please see product documentation to-&lt;BR /&gt;determine corrective action&lt;BR /&gt;HSG80_B&amp;gt; show this_con&lt;BR /&gt;Controller:&lt;BR /&gt;        HSG80 ZG10707640 Software V86F-13, Hardware  E12&lt;BR /&gt;        NODE_ID          = 5000-1FE1-0014-1A30&lt;BR /&gt;        ALLOCATION_CLASS = 0&lt;BR /&gt;        SCSI_VERSION     = SCSI-2&lt;BR /&gt;        Configured for MULTIBUS_FAILOVER with ZG13802078&lt;BR /&gt;            In dual-redundant configuration&lt;BR /&gt;        Device Port SCSI address 6&lt;BR /&gt;        Time: 16-APR-2007 15:38:40&lt;BR /&gt;        Command Console LUN is disabled&lt;BR /&gt;Host PORT_1:&lt;BR /&gt;        Reported PORT_ID = 5000-1FE1-0014-1A31&lt;BR /&gt;        PORT_1_TOPOLOGY  = FABRIC (standby)&lt;BR /&gt;Host PORT_2:&lt;BR /&gt;        Reported PORT_ID = 5000-1FE1-0014-1A32&lt;BR /&gt;        PORT_2_TOPOLOGY  = FABRIC (standby)&lt;BR /&gt;        NOREMOTE_COPY&lt;BR /&gt;Cache:&lt;BR /&gt;        256 megabyte write cache, version 0022&lt;BR /&gt;        Cache is INVALID.  Cache containing unflushed data&lt;BR /&gt;         has been removed from this controller&lt;BR /&gt;        Unknown unflushed data in cache&lt;BR /&gt;        CACHE_FLUSH_TIMER = DEFAULT (10 seconds)&lt;BR /&gt;Mirrored Cache:&lt;BR /&gt;        256 megabyte write cache, version 0022&lt;BR /&gt;        Cache is INVALID.  Cache containing unflushed data&lt;BR /&gt;         has been removed from this controller&lt;BR /&gt;        No unflushed data in cache&lt;BR /&gt;Battery:&lt;BR /&gt;        NOUPS&lt;BR /&gt;        DANGER: BATTERY LIFETIME HAS EXPIRED, REPLACE BATTERY NOW!&lt;BR /&gt;This controller has an invalid cache module&lt;BR /&gt;Cache battery is near its end of life, it should be replaced SOON.  Run frutil-&lt;BR /&gt;to replace.&lt;BR /&gt;Cache battery charge is low&lt;BR /&gt;Mirror cache battery charge is low&lt;BR /&gt;Invalid cache -- CLI command set reduced.  Type SHOW THIS_CONTROLLER. Please-&lt;BR /&gt;see product documentation to determine corrective action&lt;BR /&gt;HSG80_B&amp;gt; show this_controller&lt;BR /&gt;%CER--HSG80_B&amp;gt; --16-APR-2007 15:38:49-- Invalid cache -- CLI command set-&lt;BR /&gt;reduced.  Type SHOW THIS_CONTROLLER. Please see product documentation to-&lt;BR /&gt;determine corrective action&lt;BR /&gt;HSG80_B&amp;gt; show this_con&lt;BR /&gt;Controller:&lt;BR /&gt;        HSG80 ZG10707640 Software V86F-13, Hardware  E12&lt;BR /&gt;        NODE_ID          = 5000-1FE1-0014-1A30&lt;BR /&gt;        ALLOCATION_CLASS = 0&lt;BR /&gt;        SCSI_VERSION     = SCSI-2&lt;BR /&gt;        Configured for MULTIBUS_FAILOVER with ZG13802078&lt;BR /&gt;            In dual-redundant configuration&lt;BR /&gt;        Device Port SCSI address 6&lt;BR /&gt;        Time: 16-APR-2007 15:38:50&lt;BR /&gt;        Command Console LUN is disabled&lt;BR /&gt;Host PORT_1:&lt;BR /&gt;        Reported PORT_ID = 5000-1FE1-0014-1A31&lt;BR /&gt;        PORT_1_TOPOLOGY  = FABRIC (standby)&lt;BR /&gt;Host PORT_2:&lt;BR /&gt;        Reported PORT_ID = 5000-1FE1-0014-1A32&lt;BR /&gt;        PORT_2_TOPOLOGY  = FABRIC (standby)&lt;BR /&gt;        NOREMOTE_COPY&lt;BR /&gt;Cache:&lt;BR /&gt;        256 megabyte write cache, version 0022&lt;BR /&gt;        Cache is INVALID.  Cache containing unflushed data&lt;BR /&gt;         has been removed from this controller&lt;BR /&gt;        Unknown unflushed data in cache&lt;BR /&gt;        CACHE_FLUSH_TIMER = DEFAULT (10 seconds)&lt;BR /&gt;Mirrored Cache:&lt;BR /&gt;        256 megabyte write cache, version 0022&lt;BR /&gt;        Cache is INVALID.  Cache containing unflushed data&lt;BR /&gt;         has been removed from this controller&lt;BR /&gt;        No unflushed data in cache&lt;BR /&gt;Battery:&lt;BR /&gt;        NOUPS&lt;BR /&gt;        DANGER: BATTERY LIFETIME HAS EXPIRED, REPLACE BATTERY NOW!&lt;BR /&gt;This controller has an invalid cache module&lt;BR /&gt;Cache battery is near its end of life, it should be replaced SOON.  Run frutil-&lt;BR /&gt;to replace.&lt;BR /&gt;Cache battery charge is low&lt;BR /&gt;Mirror cache battery charge is low&lt;BR /&gt;Invalid cache -- CLI command set reduced.  Type SHOW THIS_CONTROLLER. Please-&lt;BR /&gt;see product documentation to determine corrective action&lt;BR /&gt;################################################&lt;BR /&gt;Best Regards&lt;BR /&gt;Rafal N.</description>
      <pubDate>Mon, 16 Apr 2007 11:15:03 GMT</pubDate>
      <guid>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982266#M23486</guid>
      <dc:creator>Rafal Niesiobedzki</dc:creator>
      <dc:date>2007-04-16T11:15:03Z</dc:date>
    </item>
    <item>
      <title>Re: HSG80 - problem determination</title>
      <link>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982267#M23487</link>
      <description>OK - looking at the output, you have only shown one controller (HSG_B) - if this is a dual-controller configuration - check the status of the other controller to see if that also has "invalid cache" - this condition is often caused when the HSG is switched off without performing a controlled shutdown (HSG&amp;gt; SHUTDOWN THIS) - but in your case, it is suggesting the controller was removed with unflushed data in cache. Either way, there is some possibility of data corruption on the attached disk unit(s).&lt;BR /&gt;&lt;BR /&gt;To get rid of the fault condition type in:&lt;BR /&gt;&lt;BR /&gt;HSG&amp;gt; clear this invalid_cache destroy_unflushed_data&lt;BR /&gt;&lt;BR /&gt;This will clear the cache (you may need to do this on both controllers). You also need to check each of the units for "lost data" which often accompanies the "invalid cache" condition. For each unit with lost data - type in :&lt;BR /&gt;&lt;BR /&gt;HSG&amp;gt;Clear D1 lost_data   (then D2, D3, D4 etc)&lt;BR /&gt;&lt;BR /&gt;Like I said - if write operations were in progress when the error occured - you could have some data corruption. The only way to find out is to run a filesystem or database check on the units (you can't do this from the HSG itself)&lt;BR /&gt;&lt;BR /&gt;A couple of other thing - your firmware (ACS) version is low - it should really be at V8.7 or 8.8, and the batteries are shown as needing replacement.&lt;BR /&gt;&lt;BR /&gt;Rich&lt;BR /&gt;</description>
      <pubDate>Mon, 16 Apr 2007 20:14:41 GMT</pubDate>
      <guid>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982267#M23487</guid>
      <dc:creator>rich pattison</dc:creator>
      <dc:date>2007-04-16T20:14:41Z</dc:date>
    </item>
    <item>
      <title>Re: HSG80 - problem determination</title>
      <link>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982268#M23488</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;Agree 100% withabove noter.&lt;BR /&gt;&lt;BR /&gt;It is also worth looking for unwritable_data. In your note it is more likely to be lost_data but if it is not then try this:&lt;BR /&gt;&lt;BR /&gt;RETRY_ERRORS UNWRITEABLE_DATA D1&lt;BR /&gt;if this works then your data should be OK. You should still check your data integrity once it has completed. You may have to wait for a little while for this to complete. If it dosn't work then use:&lt;BR /&gt;CLEAR_ERRORS D1 UNWRITEABLE_DATA &lt;BR /&gt;the problem with this is that you may loose data but if the retry command does not work then this is what you will have to do to make the unit presentable again.&lt;BR /&gt;NOTE: make sure if you have unwritable data you use the RETRY command first.&lt;BR /&gt;&lt;BR /&gt;Either way as the above noter explained not all / any of the units will not be presented till:&lt;BR /&gt;1: you have first sorted out the cache issue [one or both controllers]&lt;BR /&gt;2: you have cleared lost_data or unwritable_data on each unit concerned&lt;BR /&gt;3: check your files from the os for data integrity&lt;BR /&gt;&lt;BR /&gt;You should look into replacing your batteries ASAP...&lt;BR /&gt;&lt;BR /&gt;Mark...</description>
      <pubDate>Wed, 18 Apr 2007 03:00:00 GMT</pubDate>
      <guid>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982268#M23488</guid>
      <dc:creator>Mark...</dc:creator>
      <dc:date>2007-04-18T03:00:00Z</dc:date>
    </item>
    <item>
      <title>Re: HSG80 - problem determination</title>
      <link>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982269#M23489</link>
      <description>Hi Mark&lt;BR /&gt;I think you'll find that If a storageset or disk drive fails before its data has been written to it, the controller reports an unwriteable data error, but in this case we know the cache is invalid, so this can't be the case - unless cache and unit both failed at the same time.&lt;BR /&gt;&lt;BR /&gt;Unwriteable data would still be held in cache and a retry (flush) might be possible if the unit is back online, and the cache is still valid.&lt;BR /&gt;&lt;BR /&gt;Bottom line is LOST_DATA is a cache problem,&lt;BR /&gt;UNWRITEABLE_DATA is a unit/storageset problem.&lt;BR /&gt;&lt;BR /&gt;Rich</description>
      <pubDate>Wed, 18 Apr 2007 16:59:12 GMT</pubDate>
      <guid>https://community.hpe.com/t5/disk-enclosures/hsg80-problem-determination/m-p/3982269#M23489</guid>
      <dc:creator>rich pattison</dc:creator>
      <dc:date>2007-04-18T16:59:12Z</dc:date>
    </item>
  </channel>
</rss>

