Operating System - Tru64 Unix
1832645 Members
2400 Online
110043 Solutions
New Discussion

Re: GS160 crashed spontanously and can not be brought online again

 
Telia BackOffice
Valued Contributor

GS160 crashed spontanously and can not be brought online again

Hi

I have a GS160 that died spontanously today. Noticed that it died due to my ICMP probe mechanism. When I got to the premisses my partitions were down. I had no luck in trying to boot either partition 0 or 1. I even took the main power to the rack to switch off everything, but it didn't help.

I can see the errors in the output, but not being a hardware specialist on the alphaservers, I would like to hear you opinion as to what the root course could be. I should also mention that the server is without support, so I can not just call HP :-)
3 REPLIES 3
Rob Leadbeater
Honored Contributor

Re: GS160 crashed spontanously and can not be brought online again

Hi,

From a quick scan through that capture file, this line stands out:

~E~ PBM11 Error:
~E~ FAN2 FAIL - POWEROFF IN 40 SECONDS

I've no experience myself on GS160s so can't tell you where this fan is...

Cheers,

Rob
Telia BackOffice
Valued Contributor

Re: GS160 crashed spontanously and can not be brought online again

Yeah I noticed that. And also hswitch errors. But ...

1. the system does not power off after 40 seconds at it says it would
2. is the hswitch error a consequence of the first error?

and as you say

3. what fan and where? ;-)

Thomas
Khairy
Esteemed Contributor

Re: GS160 crashed spontanously and can not be brought online again

hi,

your partition_0 consists of QBB0, QBB1 and QBB3.

My guess the server response normally by halting the process from forming partition_0 due to a fail fan in PCI1. My bet, its in PCI cage 1 which connects to QBB0.

And i noticed the QBB3 also complains about PSx_FAN_FAILURE.

My advice is call HP and let them check the machine.

Rgds