ProLiant Servers (ML,DL,SL)
1752272 Members
4588 Online
108786 Solutions
New Discussion юеВ

Re: Excessive Failed Hard Drives

 
SOLVED
Go to solution
cnb
Honored Contributor

Re: Excessive Failed Hard Drives

Click on Save Report, which will save the file in ZIP format. Make sure to rename the file extension to .ZIP or .Zip (not .zip) so that the forum members can open it. Then use the attachment BROWSE button in the reply window to attach the file.

Rgds,
Rancher
Honored Contributor

Re: Excessive Failed Hard Drives

I finally found the hard drive matrix and my firmware is the latest. My last failed hard drive showed the following when I ran diagnostics:

Failed
Error: 640001: Controller has reported a SMART error on this drive
Error: 640006: The Read and/or Write HARD error rate is above threshold
This drive has experienced/recorded error conditions reported by diagnosis and requires replacement
cnb
Honored Contributor

Re: Excessive Failed Hard Drives

Hmmm...

Which Version of Windows 2008 Server are you using exactly?
What Version of Firmware is the MSA60 at?
Is this MSA60 in a dual domain configuration?
What version of Firmware is the P800 controller that you just updated?
What are the drive Model & P/N's?
What version of ACU and ADU are you using (yes it makes a difference ;-))?
IMHO, ADU would be the better application to check and post errors, rather than the diagnostics.

Firmware CD 8.70 Support Guide Matrix:

ftp://ftp.hp.com/pub/c-products/servers/management/smartstart/FWServerSupportGuide8.70.pdf


If you can't post the ADU GUI report, then please try posting the CLI report:

Run the ADU CLI application:
ADU can run from the command-line to create a report text file.
The hpaducli executable is located in the directory where the ADU component was
installed, by default C:\Program Files\Compaq\hpadu\Bin.
"hpaducli -f [filename]" (filename being the file name the adu report
text file will be written to.)
For other command-line options just type:
"hpaducli -h"

Rgds,



Rancher
Honored Contributor

Re: Excessive Failed Hard Drives

Here is some of the information you requested:
Which Version of Windows 2008 Server are you using exactly? Enterprise SP1
What Version of Firmware is the MSA60 at? 2.18
Is this MSA60 in a dual domain configuration? No
What version of Firmware is the P800 controller that you just updated? 7.08

What are the drive Model & P/N's?
146 G SAS, 418367-B21: 450 G SAS, 454232-B21

What version of ACU and ADU are you using (yes it makes a difference ;-))? ACU 8.28
I was under the impression that the ADU is not part of the Online Diagnostic Utility.
I do have a seperate ADU on my 2003 servers, but not the 2008 boxes.
cnb
Honored Contributor

Re: Excessive Failed Hard Drives

the ADU cli report will contain the internal error logs of the drives. Can you post the report?


Rgds,
Rancher
Honored Contributor

Re: Excessive Failed Hard Drives

I do not have this on my server: :\Program Files\Compaq\hpadu\
I do not see the ADU at all, only the ACU. I thought the ADU is not part of the ACU or the new diagnostics.
cnb
Honored Contributor

Re: Excessive Failed Hard Drives

Yep I just saw that.

According to the Release Notes it was integrated with ACU in 8.28:

Changes for ACU 8.28.X.X:

Diagnostics (ADU - Array Diagnostic Utility) is now integrated with ACU (Array Configuration Utility)
GUI interface and icon updates
Tabs control for major task categories...Configuration, Diagnostics, and Wizards
Controller/Device Dropdown control for selecting controllers and devices


Will check out 8.28 and let you know.


Rgds,
cnb
Honored Contributor

Re: Excessive Failed Hard Drives

Maybe you need to update your ACU version?

ACU (8.35) shows there should be three tabs at the top, one is DIAGNOSTICS and there you will have the controllers on the left-side to check which one(s) to check out. Once you've selected the controller(s), you should have two options View and Generate Diagnostic Report. Click on Generate and SAVE the report as a zip file and post it with the ZIP or Zip extension.

Rgds,
Lazarix
Occasional Advisor

Re: Excessive Failed Hard Drives

It is also possible that the backplane may need replacing on the MSA that is failing the hard drives. I have had a MSA-60 that would randomly fail 2x SATA 750GB drives in an array, yet pulling them out and putting them back in would fix it until a few months later when they would 'fail' again.
A call to HP recommended that the backplane needs replacing on the MSA-60
Live by the sword
Rancher
Honored Contributor

Re: Excessive Failed Hard Drives

I am running ACU 8.28 on my servers and do have the diagnose tab. And, it appears that I have another drive failing. I ran Online diagnostics and this was the result:
Physical Hard Drive 61, Serial Number: D2A2P9601GPH0927, Controller Serial Number: PAFGF0N9SXK04A
Failed
Error: 640006: The Read and/or Write HARD error rate is above threshold

However, when you looke at the summary for storage, it shows that everything is fine.

I also ran diagnose thorugh the ACU and have attached the report.