Operating System - Tru64 Unix
1830477 Members
2357 Online
110005 Solutions
New Discussion

GS160 crashed spontanously and can not be brought online again

 
Telia BackOffice
Valued Contributor

GS160 crashed spontanously and can not be brought online again

Hi

I have a GS160 that died spontanously today. Noticed that it died due to my ICMP probe mechanism. When I got to the premisses my partitions were down. I had no luck in trying to boot either partition 0 or 1. I even took the main power to the rack to switch off everything, but it didn't help.

I can see the errors in the output, but not being a hardware specialist on the alphaservers, I would like to hear you opinion as to what the root course could be. I should also mention that the server is without support, so I can not just call HP :-)
3 REPLIES 3
Rob Leadbeater
Honored Contributor

Re: GS160 crashed spontanously and can not be brought online again

Hi,

From a quick scan through that capture file, this line stands out:

~E~ PBM11 Error:
~E~ FAN2 FAIL - POWEROFF IN 40 SECONDS

I've no experience myself on GS160s so can't tell you where this fan is...

Cheers,

Rob
Telia BackOffice
Valued Contributor

Re: GS160 crashed spontanously and can not be brought online again

Yeah I noticed that. And also hswitch errors. But ...

1. the system does not power off after 40 seconds at it says it would
2. is the hswitch error a consequence of the first error?

and as you say

3. what fan and where? ;-)

Thomas
Khairy
Esteemed Contributor

Re: GS160 crashed spontanously and can not be brought online again

hi,

your partition_0 consists of QBB0, QBB1 and QBB3.

My guess the server response normally by halting the process from forming partition_0 due to a fail fan in PCI1. My bet, its in PCI cage 1 which connects to QBB0.

And i noticed the QBB3 also complains about PSx_FAN_FAILURE.

My advice is call HP and let them check the machine.

Rgds