HPE 9000 and HPE e3000 Servers
1752424 Members
4685 Online
108788 Solutions
New Discussion юеВ

Re: A Processor Field Replaceable Unit (FRU) has failed.

 
Eric Antunes
Honored Contributor

A Processor Field Replaceable Unit (FRU) has failed.

I all ITRC addicted, ;)

I'm having serious issues with both "my" HP9000 servers: they crashes almost every day simultaneously!

I ran the logtool on STM and found the following error:

"A Processor Field Replaceable Unit (FRU) has failed.

Recommended Action:

From the Boot Console Handler Main Menu, execute the INfo (in) command.
This brings up the Info Menu. Now execute the PRocessor (pr) command. This
will report what processor is not functioning."

But after doing the recommended action, the PRocessor command shows that the CPU state is OK...

Any idea??

Best Regards,

Eric Antunes
Each and every day is a good day to learn.
14 REPLIES 14
Eric Antunes
Honored Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

One more thing:

Since they crash simultaneously, is there a way to check the UPS state/condition besides GSP->PS command?

Best Regards,

Eric Antunes
Each and every day is a good day to learn.
Torsten.
Acclaimed Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

Eric,

if both servers are crashing at the same time, it could be a serious power problem. Have a look into the GSP log for power related messages.
Anyway, most servers have a redundant power supply to prevent this. What model do you have?

Regarding the UPS - it depends on the model. Some are network connected and providing an own interface to see the details.

Hope this helps!
Regards
Torsten.

__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.

__________________________________________________
No support by private messages. Please ask the forum!

If you feel this was helpful please click the KUDOS! thumb below!   
Michael Steele_2
Honored Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

Are these new or old servers?

I'd place a HW call. Gather this information:

Model number:

# model

Check your PDC version in /var/tombstones/ts99

HW serial number which is on your delivery paper work or the outside of the machine.

Here's a thread like yours: http://forums1.itrc.hp.com/service/forums/questionanswer.do?threadId=805951
Support Fatherhood - Stop Family Law
Eric Antunes
Honored Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

Hi Torsten and Michael,

Thanks for the replies.

In the GSP log there is nothing besides an old 1970 message...

The UPS is a A3589B (5.5 kVA rackmt HP PowerTrust UPS). Is there a way to gather more information about this UPS?

They are 9000/800/L2000-44 and 9000/800/L2000-36 servers.

I don't think it is related with the PDC version since, being connected to the same Power Supply, they both crashes at the same time.

The servers have different PDC versions:

9000/800/L2000-36 has PDC Version: 41.38

9000/800/L2000-44 has PDC Version: 44.28

Best Regards,

Eric Antunes
Each and every day is a good day to learn.
Torsten.
Acclaimed Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

How many power supplies are installed?

GSP> ps


Hope this helps!
Regards
Torsten.

__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.

__________________________________________________
No support by private messages. Please ask the forum!

If you feel this was helpful please click the KUDOS! thumb below!   
Torsten.
Acclaimed Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

unfortunately your UPS doesn't talk too much. There is a possibility to read the data, but this is very cryptical and I don't know all commands.

About the "old" GSP messages: In case of a power loss, the GSP has no actual time. For this reason the timestamp is 1.1.1970. Once the OS is running, it passes to current time to the GSP. So read the "old" messages!

Are there power related entries?

If you have a redundant power supply, try to connect it to a wall outlet and not to the USV.

The L2000 needs 2 PS if the box is fully loaded, but perhaps it is able to run with only 1. The third PS is for redundancy.

Hope this helps!
Regards
Torsten.

__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.

__________________________________________________
No support by private messages. Please ask the forum!

If you feel this was helpful please click the KUDOS! thumb below!   
Torsten.
Acclaimed Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

And BTW, you should bring the other box to PDC 44.28 too - this is not related to this problem, but will prevent others.

Hope this helps!
Regards
Torsten.

__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.

__________________________________________________
No support by private messages. Please ask the forum!

If you feel this was helpful please click the KUDOS! thumb below!   
Eric Antunes
Honored Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

Hi Torsten,

Thanks for the replies.

They are 3 power supplies, all in "Normal" states.

Here's the same "old" message in both server GSP's (they don't seem power related):

"DATE: 01/01/1970 TIME: 00:00:00
ALERT LEVEL: 2 = Non-Urgent operator attention required

SOURCE: 3 = PDH
SOURCE DETAIL: 6 = interconnect medium SOURCE ID: 0
PROBLEM DETAIL: 3 = non-responding, may need GSP reset.

CALLER ACTIVITY: 2 = operation STATUS: 0
CALLER SUBACTIVITY: 02 = platform internal interconnect
REPORTING ENTITY TYPE: 1 = service processor REPORTING ENTITY ID: 00

0x5810082336002020 00004600 01000000 type 11 = Timestamp 01/01/1970 00:00:00
Type CR for next entry, - CR for previous entry, Q CR to quit."

I didn't understood this one: "If you have a redundant power supply, try to connect it to a wall outlet and not to the USV.". What is a wall outlet??

Best Regards,

Eric Antunes
Each and every day is a good day to learn.
Torsten.
Acclaimed Contributor

Re: A Processor Field Replaceable Unit (FRU) has failed.

A power outlet/socket in the wall?

Just a direct source for power - not the UPS.

Regarding the logs:

I expect something like

"SOURCE DETAIL: 4 = high voltage DC power SOURCE ID: 2
PROBLEM DETAIL: A = failed or disconnected"

Hope this helps!
Regards
Torsten.

__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.

__________________________________________________
No support by private messages. Please ask the forum!

If you feel this was helpful please click the KUDOS! thumb below!