ProLiant Servers (ML,DL,SL)
cancel
Showing results for 
Search instead for 
Did you mean: 

ml350 G5 random reboots every 2-3 days

tom98_1
Advisor

ml350 G5 random reboots every 2-3 days

Hi,

i have an ml350 G5 with win2003 on it that reboots randomely

it was connected on a apc ups smartups 1500, but i read here that it might cause reboot so i plug the server right in the wall and it's hasn'T reboot in over a week but now it start rebooting frequently

sometime the server can go into windows and the internal health status led is red showing the processor 1 problem. but it's happens maybe one time in 20 reboots

any help?? there's no bluescreen, nothing in the log except : the previous shutdown was unexpected

power supply has been changed also and it's still rebooting.

HELP
20 REPLIES
LuckyP_1
Frequent Advisor

Re: ml350 G5 random reboots every 2-3 days

Please check for the power supply revision number. If no error is reported on any of the logs, such as IML, ILO and OS event logs, then power supplies or Power supply backplane are the component that may cause this issue.

Check the power supply rev-number. it should be on 06M, or later.

Also it's recommended to run offline diagnostics to diagnose CPU/Memory. These components also cause silent reboots sometime and nothing is logged in the logs.

What is your BIOS and ILo FW version? Make sure you are running on latest BIOS as HP as recently released a critical update for system BIOS for many of G5 servers.
PZel
Valued Contributor

Re: ml350 G5 random reboots every 2-3 days

I don't know if your F/W and PSP is up to date, but:
when you're ILO firmware level is greater then 1.78, then your ILO 2 Management Controller driver must be minimal 1.12.0.0(or 1.11.2.1)(on PSP 8.30), otherwise you get also ASR's. This is also true for DL380G5. We use here F/W 8.70 +PSP 8.30
for G5. For G7 we use F/W 9.20 (Interactive) + PSP 8.60.
(see:
 http://h30499.www3.hp.com/t5/ITRC-remote-lights-out-mgmt-iLO/ILO2-FW-1-79-mays-cause-server-restarts/m-p/4494257#M4623
and
http://h30499.www3.hp.com/t5/ProLiant-Servers-ML-DL-SL/Strange-events-from-hpqilo2/m-p/4400209#M89589

PZ
Michael A. McKenney
Respected Contributor

Re: ml350 G5 random reboots every 2-3 days

I would upgrade all the firmware and drivers to the latest revision. I would remove the SmartUPS 1500 and get a better UPS. It could be the smart software for the UPS thinks their is a problem. Did you try uninstalling the smart software?

Uncheck the box for automatic shutdown in
system - advanced setup - startup and recovery in Windows.

Run a diagnostics from the Smart Start disk.
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

my ILO firmware is 2.05

where can i see which version is ILO 2 Management Controller driver
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

i found it

version 1.13.0.0
gregersenj
HPE Pro

Re: ml350 G5 random reboots every 2-3 days

ILo FW = 2.05.

From wich version did you upgrade?
If it came frome 1.81 or older. You must upgrade the driver.

Failing to upgrade the drive do caurse random reboots.

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?locale=en_US&objectID=c01802766

BR
/jag
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

i don't know from which version i upgraded because i'm the new it manager for this server and it was like that when i arrive

i have the latest driver.. 1.13
gregersenj
HPE Pro

Re: ml350 G5 random reboots every 2-3 days

I you're on latest fw and driver.
Then that should be ok.

As suggestet by others.
Check the IML, and the ILo event log.
IML check for any hints.
ILo event log check for power loss.

BR
/jag
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

there's nothing in IML log

where can i see the ILO event log
PZel
Valued Contributor

Re: ml350 G5 random reboots every 2-3 days

Be sure that you are looking for the "ILO2 Management Controller Driver" and NOT the
"ILO2 Management Interface Driver" !!!
(because you can easily overlook that)
PZ
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

i check out the version of this file :

hpqilo2.sys

that's the ILO2 Management Controller Driver
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

any help??

the power supply has been changed

there's no error in the log,

can it be ram?
sometimes it reboots 2-3 times a day, sometimes it's ok for 30h.
PZel
Valued Contributor

Re: ml350 G5 random reboots every 2-3 days

hpilo2.sys must be correct, so probably no firmware/HP issue. Next steps:
1) Uninstall the UPS software
2) Make an memtest+ bootable CD and let it run overnight (when possible). Check walltime
(so you know if ít's reboot OR eject CD after loading memtest+)
3) Check Windows Event Logging
4) Check out the hardware, via:
http://bizsupport1.austin.hp.com/bc/docs/support/SupportManual/c00708990/c00708990.pdf
(check CPU in the socket)

Maybe, the best thing to do is contact HP for replacing the system brd ??
PZ
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

i uninstalled the ups software last week and it's still rebooting.

what's strange is that sometime the server can go into windows and the internal health status led is red showing the processor 1 problem

can it be the cpu the problem??

Re: ml350 G5 random reboots every 2-3 days

For what it is worth, I have 3 that were doing the same thing but I do not have time to sit and try to figure it out. what has worked is leaving the console session logged on(can be locked). I know it sounds absurd by mine were rebooting a couple times a week and this stopped it all together.
Michael A. McKenney
Respected Contributor

Re: ml350 G5 random reboots every 2-3 days

Did you install the latest firmware on all the hardware? Boot Smart Start and run a diagnostics.
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

i'll try the smart scan diag.

how long does it takes?
gregersenj
HPE Pro

Re: ml350 G5 random reboots every 2-3 days

Could indeed be the CPU.

BR
/jag
tom98_1
Advisor

Re: ml350 G5 random reboots every 2-3 days

i check the power supply and it's revision 03m

also i ran a smartscan diag and everything's ok except this :

Hard Drive 1 Storage Controller in Slot 0 S.M.A.R.T Error Test

Hard Drive 2 Storage Controller in Slot 0 S.M.A.R.T Error Test

Hard Drive 3 Storage Controller in Slot 0 S.M.A.R.T Error Test

what does it means?

i remove 3 out of 4 ram and i'll see if it help
PZel
Valued Contributor

Re: ml350 G5 random reboots every 2-3 days

PSU SPN 403781-001 (model: 379123-001)has a potential problem with the 03 Version (unless with a green dot on it).

This revision PSU will sometimes gives a general failure. This could be a major problem when you're only having 1 PSU in it. With 2 PSU's you see that one of the PSU's is not working anymore (amber LED in front).

I don't think you must concentrate on the harddisks.
PZ