Disk Arrays
cancel
Showing results for 
Search instead for 
Did you mean: 

HSZ50 - Unit has lost data

Andreas Holz_3
Occasional Advisor

HSZ50 - Unit has lost data

Hello all,

we had a power outage last week which resulted in one dead disk within one raidset of our RaidArray 450.

The disk in question has not been taken into the failedset by the controller. The unit containig the raid with the defective disk has the "Unit has lost data" state.

A try to clear the lost data flag via:

CLEAR_ERRORS D300 LOST_DATA is returning

Error 7120: CLEAR LOST_DATA attempt failed on D300.

The replacement policy of the raidset is BEST_FIT and there is an appropriate disk in the spareset available.

What can I do to clear the lost_data flag?

10 REPLIES
Martin Schweizer
Occasional Visitor

Re: HSZ50 - Unit has lost data

Hello Andreas,
the poweroutage was probably to long for your batteries and you may have "invalid cache". This must be fixed first before you start to clear the lost_data on the lun. Veryfy this by show this and show other.
Good luck

Martin
Andreas Holz_3
Occasional Advisor

Re: HSZ50 - Unit has lost data

Hello Martin

the cache error flag has been cleared already, two (from three) raidsets are up and running.

Only the third has the lost data flag still set, which I'm not able to reset.
WilliamSmith11
Super Advisor

Re: HSZ50 - Unit has lost data

Send the following output
HSZ50>show this_controller
HSZ50>show other_controller
HSZ50>show unit DXXX

Where DXXX is the affected unit number.

HSZ50> Show raidset XYZ

where XYZ is the name of the affected raidset.

Williams
Andreas Holz_3
Occasional Advisor

Re: HSZ50 - Unit has lost data

Sorry for the delay in posting the following informations!

--------------------------------------------

Show THIS
Controller:
HSZ50-AX ZG82443697 Firmware V52Z-0, Hardware A04
Not configured for dual-redundancy
SCSI address 7
Time: NOT SET
Host port:
SCSI target(s) (1, 2, 3, 4), Preferred target(s) (1, 2, 3, 4)
TRANSFER_RATE_REQUESTED = 10MHZ
Cache:
32 megabyte write cache, version 3
Cache is GOOD
Unflushed data in cache
CACHE_FLUSH_TIMER = DEFAULT (10 seconds)
CACHE_UPS
Host Functionality Mode = A

---------------------------------------
We onyl habe one controller running


show d300
LUN Uses
--------------------------------------------------------------

D300 R3 (partition)
Switches:
RUN NOWRITE_PROTECT READ_CACHE
WRITEBACK_CACHE
MAXIMUM_CACHED_TRANSFER_SIZE = 32
State:
INOPERATIVE
Unit has lost data
PREFERRED_PATH = THIS_CONTROLLER
WRITE_PROTECT - DATA SAFETY
Size: 71057335 blocks

------------------------------------------

show r3
Name Storageset Uses Used by
------------------------------------------------------------------------------

R3 raidset DISK110 D300
DISK210
DISK310
DISK410
DISK510
Switches:
POLICY (for replacement) = BEST_FIT
RECONSTRUCT (priority) = NORMAL
CHUNKSIZE = 256 blocks
State:
NORMAL
DISK110 (member 0) is NORMAL
DISK210 (member 1) is NORMAL
DISK310 (member 2) is NORMAL
DISK410 (member 3) is NORMAL
DISK510 (member 4) is NORMAL
Size: 71057340 blocks
Partitions:
Partition number Size Starting Block Used by
--------------------------------------------------------------------
1 71057335 ( 36381.35 MB) 0 D300


---------------------------------------------
Rob Leadbeater
Honored Contributor

Re: HSZ50 - Unit has lost data

Hi,

You could try clearing the unflushed data on the controller, that may then allow you to clear the error on the D300 unit.

CLEAR_ERRORS THIS_CONTROLLER INVALID_CACHE DESTROY_UNFLUSHED_DATA

Hope this helps,

Regards,

Rob
Andreas Holz_3
Occasional Advisor

Re: HSZ50 - Unit has lost data

I had no success!
articles
Occasional Visitor

Re: HSZ50 - Unit has lost data


You left with only one option, use data recovery software and services to get back your lost data.Kernel for windows is the utility to get back lost data due to various physical and logical reasons

Visit
http://www.nucleustechnologies.com/FAT-NTFS-Data-Recovery.html
to download the software and recover your data..
Brad Emrich
Occasional Visitor

Re: HSZ50 - Unit has lost data


---------------------------------------------

Hello. I am in a similar situation. However, I do not need to recover any lost data. I just need to re-initialize and wipe the drive. However no commands work, and I am unable to free up any of the spindles.

Is there any way to wipe this RAID array without wiping the whole controller?


HSJ014>delete d1443
Error 9410: Cannot delete unit -- LOST_DATA error exists on unit that
must be cleared first. To clear error type:
CLEAR_ERRORS D1443 LOST_DATA
HSJ014>CLEAR_ERRORS D1443 LOST_DATA
Error 7120: CLEAR LOST_DATA attempt failed on D1443.
HSJ014>delete d1443
Error 9410: Cannot delete unit -- LOST_DATA error exists on unit that
must be cleared first. To clear error type:
CLEAR_ERRORS D1443 LOST_DATA
HSJ014>CLEAR_ERRORS D1443 LOST_DATA
Error 7120: CLEAR LOST_DATA attempt failed on D1443.
HSJ014>
Brad Emrich
Occasional Visitor

Re: HSZ50 - Unit has lost data

Here's more info as requested earlier in this discussion:

HSJ014>
HSJ014>show this
Controller:
HSJ40 (C) DEC ZG54612319 Firmware V37J-1, Hardware H09
Configured for dual-redundancy with ZG54412018
In dual-redundant configuration
SCSI address 7
Time: 13-APR-2010 14:44:25
Host port:
Node name: HSJ014, valid CI node 3, 32 max nodes
System ID 420010031922
Path A is ON
Path B is ON
MSCP allocation class 5
TMSCP allocation class 5
CI_ARBITRATION = SYNCHRONOUS
MAXIMUM_HOSTS = 15
Cache:
32 megabyte write cache, version 2
Cache is GOOD
Battery is FAILED
No unflushed data in cache
CACHE_FLUSH_TIMER = DEFAULT (10 seconds)
CACHE_UPS
HSJ014>show other
Controller:
HSJ40 (C) DEC ZG54412018 Firmware V37J-1, Hardware H07
Configured for dual-redundancy with ZG54612319
In dual-redundant configuration
SCSI address 6
Time: 13-APR-2010 14:44:30
Host port:
Node name: HSJ14B, valid CI node 4, 32 max nodes
System ID 42001004F128
Path A is ON
Path B is ON
MSCP allocation class 5
TMSCP allocation class 5
CI_ARBITRATION = SYNCHRONOUS
MAXIMUM_HOSTS = 15
Cache:
32 megabyte write cache, version 2
Cache is GOOD
Battery is FAILED
Unflushed data in cache
CACHE_FLUSH_TIMER = DEFAULT (10 seconds)
CACHE_UPS
HSJ014>show d1443
MSCP unit Uses
--------------------------------------------------------------

D1443 R7
Switches:
NORUN NOWRITE_PROTECT READ_CACHE
WRITEBACK_CACHE
MAXIMUM_CACHED_TRANSFER_SIZE = 32
State:
NO VOLUME MOUNTED
Unit has lost data
PREFERRED_PATH = OTHER_CONTROLLER
WRITE_PROTECT - DATA SAFETY
Size: 53294505 blocks
HSJ014>set d1443 run
HSJ014>show d1443
MSCP unit Uses
--------------------------------------------------------------

D1443 R7
Switches:
RUN NOWRITE_PROTECT READ_CACHE
WRITEBACK_CACHE
MAXIMUM_CACHED_TRANSFER_SIZE = 32
State:
INOPERATIVE
Unit has lost data
PREFERRED_PATH = OTHER_CONTROLLER
WRITE_PROTECT - DATA SAFETY
Size: 53294505 blocks
HSJ014>show r7
Name Storageset Uses Used by
------------------------------------------------------------------------------

R7 raidset DISK100 D1443
DISK430
DISK540
DISK650
Switches:
NOPOLICY (for replacement)
RECONSTRUCT (priority) = NORMAL
CHUNKSIZE = 256 blocks
State:
NORMAL
DISK100 (member 0) is NORMAL
DISK430 (member 1) is NORMAL
DISK540 (member 2) is NORMAL
DISK650 (member 3) is NORMAL
Size: 53294505 blocks
HSJ014>
HSJ014>

I have run the command "clear_errors other_controller invalid_cache destroy_unflushed_data" many many times but the results do not seem to change.
cnb
Honored Contributor

Re: HSZ50 - Unit has lost data

Brad,

Start/Manage your own thread, then...

Post:
SHOW THIS FULL
SHOW OTHER FULL.
SHOW STORAGE FULL
SHOW UNIT FULL

Which controller owns the drive?

Rgds,