ProLiant Servers (ML,DL,SL)
1753652 Members
5912 Online
108798 Solutions
New Discussion юеВ

Re: Random hang-up of ML-350 server

 
Slawomir Szyszlo
Occasional Advisor

Random hang-up of ML-350 server

I have a Proliant ML-350 server with SuSE Linux Enterprise Server 7 installed.

Since some days my server hangs up randomly. There is no alerts in log files or error messages. The screen is black and server is not responding - I don't can log in by SSH or the console.

I suspect overheating or hardware problems. BTW, second ventilator is not working during normal operations. Will be turn on automatically (depending on current temperature) or there is a monitoring software needed?
8 REPLIES 8
Martin Breidenbach
Honored Contributor

Re: Random hang-up of ML-350 server

I'm not sure about the fan. I believe it should be running unless the system health driver is loaded (which will only be loaded if you installed it).

Did you check the hardware event log ? There's a downloadable file on the HP web site that contains a dos bootable disk with hardware event log viewer program.

If you did install insight manager agents then you should be able to view hardware event log via insight manager or web agent.
louis gonzales
Occasional Advisor

Re: Random hang-up of ML-350 server

Where can I find that downloadable disk?
-Louis

louis.gonzales@edag-us.com
Martin Breidenbach
Honored Contributor

Re: Random hang-up of ML-350 server

I searched the HP web site for the bootable disk version of the integrated management log viewer but didn't find it. I know that it exists because I've used it.

But there are versions for almost every operating system.

If you go to

http://h18007.www1.hp.com/support/files/server/us/index.html

and search for 'integrated management log viewer' you'll get a list.
Martin Breidenbach
Honored Contributor

Re: Random hang-up of ML-350 server

I did check an older SmartStart cd (V4.90) and it contains the IML management utility.
Slawomir Szyszlo
Occasional Advisor

Re: Random hang-up of ML-350 server

This is my log:

ID Severity Initial Time Update Time Count
-------------------------------------------------------------
0001 Information 10:42 02/28/2003 15:13 06/11/2003 0007
LOG: Unknown Event (Class 15, Code 255)

0003 Information 14:11 02/28/2003 04:09 07/17/2003 0035
LOG: Unknown Event (Class 15, Code 80)

0004 Information 14:11 02/28/2003 14:11 02/28/2003 0002
LOG: Unknown Event (Class 15, Code 0)

0005 Information 14:11 02/28/2003 04:27 07/17/2003 0002
LOG: Unknown Event (Class 15, Code 255)

0006 Information 15:25 07/16/2003 04:10 07/17/2003 0033
LOG: Unknown Event (Class 15, Code 6)

0007 Information 15:27 07/16/2003 04:11 07/17/2003 0033
LOG: Unknown Event (Class 15, Code 6)


There is no other informations... But problems began on July, not Februar.
Martin Breidenbach
Honored Contributor

Re: Random hang-up of ML-350 server

unknown events... I don't know what they mean. I had hoped there might be something usefull.. like memory errors or fan failure or whatever.

Maybe HP support ?
Slawomir Szyszlo
Occasional Advisor

Re: Random hang-up of ML-350 server

I read on discussion list "suse-oracle" about similar problems. Probably it's kernel or storage driver bug or something.
Slawomir Szyszlo
Occasional Advisor

Re: Random hang-up of ML-350 server

I upgraded system from SLES 7 to SLES8 two weeks ago. Since that time there is no hang-up. I hope that new system will be stable.