Servers - General

RAID controller - HPE SmartArray P408i-a hangs and freezes OS

 
Frank_DA
Visitor

RAID controller - HPE SmartArray P408i-a hangs and freezes OS

Good day,

 

From the HPE Servers Support & OS Certification Matrices it appears that Canonical Ubuntu 20.04 LTS with kernel 5.4.0 is supported and certified for the DL 360 Gen10 with the in-distro drivers. 

 

Has anyone Canonical Ubuntu 20.04 LTS with kernel 5.4.0 been tested for the DL 360 Gen10 with the RAID controller - HPE SmartArray P408i-a ?

 

Which in-distro driver (hpdsa or smartpqi) is recommended ?

 

On https://techlibrary.hpe.com/us/en/enterprise/servers/supportmatrix/exceptions/ubuntu_exceptions.aspx is mentioned that Canonical Ubuntu 18.04 with kernel 4.15 is tested.  (HPE tested using in-distro driver with these adapters: - P408i Smart Array Controller)

 

 

The problem:

 

The problem is that initially the raid controller did not catch or detect hard disk errors which caused the application to stop due to the IO error. 

 

 

# Kernel Logs 

 

kernel: [10250982.149112] sd 1:1:0:0: [sda] tag#174 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10250982.149114] sd 1:1:0:0: [sda] tag#174 Sense Key : Medium Error [current]

 kernel: [10250982.149116] sd 1:1:0:0: [sda] tag#174 Add. Sense: Unrecovered read error

 kernel: [10250982.149118] sd 1:1:0:0: [sda] tag#174 CDB: Read(10) 28 00 19 d4 9a c0 00 00 20 00

 kernel: [10250982.149120] print_req_error: I/O error, dev sda, sector 433363648

 kernel: [10251220.552923] sd 1:1:0:0: [sda] tag#483 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10251220.552927] sd 1:1:0:0: [sda] tag#483 Sense Key : Medium Error [current]

 kernel: [10251220.552929] sd 1:1:0:0: [sda] tag#483 Add. Sense: Unrecovered read error

 kernel: [10251220.552931] sd 1:1:0:0: [sda] tag#483 CDB: Read(10) 28 00 19 c8 17 00 00 00 20 00

 kernel: [10251220.552933] print_req_error: I/O error, dev sda, sector 432543488

 kernel: [10251809.636162] sd 1:1:0:0: [sda] tag#53 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10251809.636171] sd 1:1:0:0: [sda] tag#53 Sense Key : Medium Error [current]

 kernel: [10251809.636176] sd 1:1:0:0: [sda] tag#53 Add. Sense: Unrecovered read error

 kernel: [10251809.636183] sd 1:1:0:0: [sda] tag#53 CDB: Read(10) 28 00 1a 29 39 20 00 00 20 00

 kernel: [10251809.636187] print_req_error: I/O error, dev sda, sector 438909216

 kernel: [10252238.798677] sd 1:1:0:0: [sda] tag#917 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10252238.798680] sd 1:1:0:0: [sda] tag#917 Sense Key : Medium Error [current]

 kernel: [10252238.798682] sd 1:1:0:0: [sda] tag#917 Add. Sense: Unrecovered read error

 kernel: [10252238.798685] sd 1:1:0:0: [sda] tag#917 CDB: Read(10) 28 00 19 d2 73 c0 00 00 20 00

 kernel: [10252238.798687] print_req_error: I/O error, dev sda, sector 433222592

 kernel: [10252361.296318] sd 1:1:0:0: [sda] tag#784 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10252361.296322] sd 1:1:0:0: [sda] tag#784 Sense Key : Medium Error [current]

 kernel: [10252361.296324] sd 1:1:0:0: [sda] tag#784 Add. Sense: Unrecovered read error

 kernel: [10252361.296326] sd 1:1:0:0: [sda] tag#784 CDB: Read(10) 28 00 19 d3 37 e0 00 00 20 00

 kernel: [10252361.296328] print_req_error: I/O error, dev sda, sector 433272800

 kernel: [10252459.738921] sd 1:1:0:0: [sda] tag#167 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10252459.738924] sd 1:1:0:0: [sda] tag#167 Sense Key : Medium Error [current]

 kernel: [10252459.738926] sd 1:1:0:0: [sda] tag#167 Add. Sense: Unrecovered read error

 kernel: [10252459.738928] sd 1:1:0:0: [sda] tag#167 CDB: Read(10) 28 00 1a 29 39 20 00 00 20 00

 kernel: [10252459.738930] print_req_error: I/O error, dev sda, sector 438909216

 kernel: [10252850.102638] sd 1:1:0:0: [sda] tag#405 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10252850.102640] sd 1:1:0:0: [sda] tag#405 Sense Key : Medium Error [current]

 kernel: [10252850.102642] sd 1:1:0:0: [sda] tag#405 Add. Sense: Unrecovered read error

 kernel: [10252850.102644] sd 1:1:0:0: [sda] tag#405 CDB: Read(10) 28 00 19 b9 96 e0 00 00 20 00

 kernel: [10252850.102646] print_req_error: I/O error, dev sda, sector 431593184

 kernel: [10253365.783826] sd 1:1:0:0: [sda] tag#817 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10253365.783830] sd 1:1:0:0: [sda] tag#817 Sense Key : Medium Error [current

 kernel: [10253365.783832] sd 1:1:0:0: [sda] tag#817 Add. Sense: Unrecovered read error

 kernel: [10253365.783834] sd 1:1:0:0: [sda] tag#817 CDB: Read(10) 28 00 19 bf 1f 60 00 00 20 00

 kernel: [10253365.783836] print_req_error: I/O error, dev sda, sector 431955808

 kernel: [10253515.907023] sd 1:1:0:0: [sda] tag#970 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

 kernel: [10253515.907036] sd 1:1:0:0: [sda] tag#970 Sense Key : Medium Error [current]

 kernel: [10253515.907039] sd 1:1:0:0: [sda] tag#970 Add. Sense: Unrecovered read error

 kernel: [10253515.907041] sd 1:1:0:0: [sda] tag#970 CDB: Read(10) 28 00 1a 29 39 20 00 00 20 00

 kernel: [10253515.907043] print_req_error: I/O error, dev sda, sector 438909216

 

 

By updating the Raid Controller software to version 3.00 IO Errors could be detected again. However, the raid controller now stops working completely from time to time and needs to be replaced. 

 

#dmesg

719-Slot 0 Drive Array - A controller failure event occurred prior to this power-up. (Previous lock up code = 0x6100C) Action: Install the latest controller firmware. If the problem persists, replace the controller.

 

1 REPLY 1
Sebasbin
HPE Pro

Re: RAID controller - HPE SmartArray P408i-a hangs and freezes OS


Hi,

As per the OS Support and Certification Matrix Ubuntu 20.04 LTS (5.4.0) is tested for DL360 Gen10 server 
Please refer below link
https://techlibrary.hpe.com/us/en/enterprise/servers/supportmatrix/ubuntu.aspx

 

For smart array controller issue ,you need to log a support case with HPE as its need further investigation including log analysis 


Note: While I am an HPE Employee, all of my comments (whether noted or not), are my own and are not any official representation of the company

Accept or Kudo