StoreEver Tape Storage
1752280 Members
4299 Online
108786 Solutions
New Discussion юеВ

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

 
SOLVED
Go to solution
CLEB
Valued Contributor

LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

I have two DL380G5 servers with SC44Ge HBA cards connected to StorageWorks 1760 1U rackmount tape drives.

Both servers fail to backup using NT Backup or ARCserve 12.0 SP2.

The event log shows Error Event Source: Lsi_sas
Event Category: None
Event ID: 11
Description:
The driver detected a controller error on \Device\RaidPort1.

I have upgraded the HBA driver to v1.28.2.1 (B) and the firmware to 06.18.05.00 (1.23.43.00A)


Tape drive firmware is U52D

I have disabled all insight agent services too.
30 REPLIES 30
CLEB
Valued Contributor

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

Tape drive has been replaced.

Still get lsi_sas errors in event log and LTT read/write test fails.
CLEB
Valued Contributor

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

The HBA has been replaced also.
GustavoT
Valued Contributor

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

What version of OS are you running? See if you are also getting event IDs 129 along with the Event ID 11. That and if you are running Windows 2003 Server SP2 you may want to check the storport driver. All sp2 systems are required to run storport KB945119. Upgrading the storport driver could help you fix this problem.
CLEB
Valued Contributor

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

OS is Windows 2003 R2 SP2.

I am only getting EventID 11.

I have tried every hotfix for storport.sys bar this one!

I have another server which is running the same hardware but the firmware version and scsi driver have never been updated since install. The storport.sys is SP2 version though.

Unfortunately I cannot downgrade the firmware on the hba as it does not allow that.

I will try this storport.sys, thank you for your help.

Curtis Ballard
Honored Contributor

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

Post the full DWord format binary details from the event log.

The 1U rack mount enclosure has an active interface card internal to the box. Has that been replaced or the cables?
CLEB
Valued Contributor

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

The cables were replaced and the HBA. I believe just the actual drive was shipped. This server is the other side of the world to me.

If the internal interface card is faulty then this would make sense as my local technician swapped the complete rackmount unit with another at the DR site. Perhaps there is an issue with this model?

I have since tried all newer firmware and driver revisions.

Event Type: Error
Event Source: Lsi_sas
Event Category: None
Event ID: 11
Date: 25/09/2009
Time: 8:50:55 AM
User: N/A
Computer: CANFP1
Description:
The driver detected a controller error on \Device\RaidPort1.

0000: 0018000f 00680001 00000000 c004000b
0010: 31130000 00000000 00000000 00000000
0020: 00000000 00000000 00000000 00000000
0030: 00000000 c004000b 00000000 00000000

Event Type: Error
Event Source: Lsi_sas
Event Category: None
Event ID: 11
Date: 15/05/2009
Time: 9:04:01 PM
User: N/A
Computer: CANFP1
Description:
The driver detected a controller error on \Device\RaidPort1.

Data:
0000: 0010000f 00680001 00000000 c004000b
0010: ad0f1600 00000000 00000000 00000000
0020: 00000000 00000000 00000000 00000000
0030: 00000000 c004000b

First message was before the tape drive was swapped.

I see now that HP have closed my support case even though I told them I still cannot perform a backup successfully.
Curtis Ballard
Honored Contributor
Solution

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

This error record dump looks familiar. I think you may have posted it on another thread where we were looking at a different issue.

The issues that we have identified and fixed in the various firmware was for an abort after a command was sent to the device. In generic terms a "hang" condition.

This is a different failure and the error data is reporting that the host issued an abort before the HBA had even processed the command to send on to the target. I'm not certain what might cause that.

You mention that the CE replaced the rack enclosure at your DR site. Is that the site that is having problems or was that another site that previously had problems but is now working?

You may have mentioned it before but I can't see most of your comments during a reply. What length cables are you using? I assume you are using the standard skinny "tape" cable. If you have a normal full width SAS cable it might be worth trying that just to eliminate the possibility that somehow you got multiple bad cables. There is nothing unique about the special tape cable except that it is a little cheaper than the standard cables because it doesn't need all the extra SAS pairs.
CLEB
Valued Contributor

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

The tape drive and HBA have definitely been replaced. I'm told it was the actual drive that was swapped. The 1U enclosure is still the same.

I have an exact same setup at DR site and the SAS cable was taken from there to try. The cable being used is the one that came with the tape unit.

Both sites still have issues, with my primary site getting progressively worse. This morning there is lots of these events:

Event Type: Error
Event Source: PlugPlayManager
Event Category: None
Event ID: 12
Date: 28/09/2009
Time: 7:21:02 AM
User: N/A
Computer: CANFP1
Description:
The device 'Hewlett Packard LTO Ultrium-4 drive' (SCSI\Sequential&Ven_HP&Prod_Ultrium_4-SCSI&Rev_U52D\5&2f1e44f4&0&000500) disappeared from the system without first being prepared for removal.

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Data:
0000: 00 00 00 00 ....

I have just checked and the tape drive is showing in device manager again.

I will run a test and post the results.

Unfortunately this site is in Canada, I am in the UK and my local support person is away for the next few weeks. I trying to get hold of an alternative technician.
CLEB
Valued Contributor

Re: LSI_SAS event ID 11 with SC44Ge and StorageWorks 1760 1U Rackmount

What storport version is required? Someone suggested 5.2.3790.4189. I have tried this but have since reverted to 5.2.3790.3959.