- Community Home
- >
- Storage
- >
- Data Protection and Retention
- >
- StoreEver Tape Storage
- >
- Re: MSL2024 1 Drive 1840 backup failure
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-26-2011 12:16 AM
тАО04-26-2011 12:16 AM
Re: MSL2024 1 Drive 1840 backup failure
I made the registry change and this fixed the issue. I had several weeks to a month worth of successful backups.
This has just started to fail again recently.
The only change has been the installation of PSP 8.70
This is the reg key:
[HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Enum\SCSI\Sequential&Ven_HP&Prod_Ultrium_4-SCSI\8&2f5f9346&0&000400\Device Parameters\Storport]
"BusyRetryCount"=dword:000000fa
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-26-2011 02:07 AM
тАО04-26-2011 02:07 AM
Re: MSL2024 1 Drive 1840 backup failure
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-26-2011 11:13 AM
тАО04-26-2011 11:13 AM
Re: MSL2024 1 Drive 1840 backup failure
If you want to turn the storage agents back on you'll want to look and see if installing the PSP caused Windows to decide to create a new registry entry for that drive. Depending on what the PSP installs Windows can create new registry entries on PSP installation.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-27-2011 03:28 AM
тАО04-27-2011 03:28 AM
Re: MSL2024 1 Drive 1840 backup failure
I've attached the two reg locations.
There is only one entry for the actual tape drive but there is another entry for the SCSI HBA.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-27-2011 06:34 AM
тАО04-27-2011 06:34 AM
SolutionThat is an unusual configuration as there is an MSA1000 on the same controller as the tape drive. That isn't a recommended configuration and would probably be considered unsupported but it usually can be made to work.
Since the array is on the same controller as the tape drive it is likely that there are some additional delays happening somewhere.
I would start out by increasing the BusyRetryCount quite a bit higher. The 0xfa value was determined to be a good setting for a single tape drive on the controller but having other devices easily could cause it to need to be much higher. I would recommend changing it to 0xffff.
All that parameter does is say how long to wait before giving up when a device is busy. Before Microsoft created the registry entry the wait time was infinite. That was the default when this card was first made. That worked fine most of the time but some devices could get stuck reporting Busy and cause a hang condition. To fix that a timeout was put into the lower layer drivers but whoever picked it chose a value that is frequently too low.
There are a couple of other registry entries that were created at the same time that you might try:
Value - BusyPauseTime
Type - DWORD
Data - 250 Decimal (default)
Range - number of milliseconds
If you change the pause time I would recommend trying 500 or 1000.
Value - QueueFullWaitIoPercentage
Type - DWORD
Data - 25 Decimal (default)
Range - 1 to 100 percentage of time
This value for tape would be better to be more like 50 to 75 but be careful with an array attached as you could impact performance on a heavily loaded array by making this number too high.
It is a real pain messing with these registry entries especially in a production environment but there are too many potential interactions to calculate precisely what you need. For the retry count, too high of a value does nothing except cause a slightly longer delay to reporting errors on a fatal permanent busy condition (really rare). For the pause time you can cause a few milliseconds extra delay in detecting the end of a busy condition which normally means nothing but can add up if the system is really heavily loaded and busy occurs frequently. The queue full parameter won't effect tape performance at all but can effect disk.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-27-2011 06:50 AM
тАО04-27-2011 06:50 AM
Re: MSL2024 1 Drive 1840 backup failure
The MSA1000 is on a FC1242SR 4Gb PCI-e DC HBA FC HBA.
The tape drive is part of the MSL2024 library which is attached via the SC11Xe which is parallel SCSI.
Perhaps there is something wrong with the registry keys.
The MSA1000 is due to be replaced with a spare MSA70 soon.
I'll have a look at making those changes you recommend. I have an exact hardware copy at another site that I can do testing on.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-27-2011 06:59 AM
тАО04-27-2011 06:59 AM
Re: MSL2024 1 Drive 1840 backup failure
If you are willing to experiment a bit hearing how it goes for you would be very helpful. I have requested quite a bit of testing of this specific configuration trying to reproduce problems like you have seen with the BusyRetryCount registry entry set to 250 but none of the lab tests have experienced any failures after setting that entry. You obviously have it set so we might be able to learn something new.
Since you indicate that you have a mirror system outside of production where you can run tests I'll mention that if you would like to try it there is a software SCSI analyzer HP uses that has a client you can download and run to take low level traces and possibly catch a SCSI bus trace of a failure. That tool is called BusTRACE and the busTRACE capture client on the following page can take traces that we can analyze back at the lab.
http://bustrace.com/downloads/free_utilities.php
If the failure happens at the physical level (HBA or on the wire) then that tool won't capture it and we have to use a hardware analyzer but frequently it captures everything we need.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-27-2011 07:43 AM
тАО04-27-2011 07:43 AM
Re: MSL2024 1 Drive 1840 backup failure
Are there any specific instructions for using it?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-28-2011 02:36 AM
тАО04-28-2011 02:36 AM
Re: MSL2024 1 Drive 1840 backup failure
I filtered on only the LSI adapter and MSL G3 and 1840 tape drive.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО04-29-2011 06:26 AM
тАО04-29-2011 06:26 AM