HPE 9000 and HPE e3000 Servers
cancel
Showing results for 
Search instead for 
Did you mean: 

rp7420 stopped reboot and MP can't access

 
SOLVED
Go to solution

rp7420 stopped reboot and MP can't access

I have rp7420 with only one partition and before it finished rebooting, it hang and MP can't access either. I switch the slave MP and from the log it shows "Warning
Keyword: MASTER_MP_FAILURE
The mp in the master slot of the specified cabinet is reporting failure"; now even I can access MP but it can't detect any boot devices. My question is : how can I make the server work? (MP>CM>PS shows all hardware OK.

Thanks for your input.

Shirley
13 REPLIES 13
Torsten.
Acclaimed Contributor

Re: rp7420 stopped reboot and MP can't access

Hi Shirley,

first of all, unplug the box for some minutes and try again. Second: be sure to have a current firmware on all components.
After the power cycle, do a "sysrev" on GSP level and post it or give HP a call.

Hope this helps!
Regards
Torsten.

__________________________________________________
There are only 10 types of people in the world -
those who understand binary, and those who don't.

__________________________________________________
No support by private messages. Please ask the forum!

If you feel this was helpful please click the KUDOS! thumb below!   
Adisuria Wangsadinata_1
Honored Contributor

Re: rp7420 stopped reboot and MP can't access

Hi,

You can use DF from MP prompt to check which part is having a problem.

Check this document (docID : KBQA00000153) for your reference, about Superdome/Keystone BCH/MP command menu :

http://www2.itrc.hp.com/service/cki/docDisplay.do?docLocale=en_US&docId=200000080074590

Call HP for this issue.

Hope this information can help you.

Cheers,
AW
now working, next not working ... that's unix

Re: rp7420 stopped reboot and MP can't access

Hi,

My big question is if one core I/O board fails, can server work with the redundant core I/O? My case is that when server boots, VFP shows Ok(cell joins the partition), but boot stops at the BCH mode because running seach returns :
No Devices Found, even I insert an CDROM it still returns " No Devices Found".

Thanks,

Shirley
Silvino
Occasional Advisor

Re: rp7420 stopped reboot and MP can't access

Hi,
Core I/O are not redundant on RP7420.
You have 1 core i/o for each cell board.
core i/o for cell 0 has hw path 0/0/8/1 (slot8 right)
Core i/o for cell 1 has hw path 1/0/1/1. (slot1 left)

The 2 bottom disk assigned to MP0(ids 5 and 6)
Upper left is assigned to MP1
Upper roght is assigned to core i/o for cell 1.

If you can not "see" the disks with "SEA", you may have a failed MP (or core i/o) and you should call HP to repair/replace the MP.
Rgrds,
Silvino

Re: rp7420 stopped reboot and MP can't access

Thanks Silivino,

I have one MP failed for sure, but the other MP works, can I only use one MP to reboot rp7420 or I must have two MPs?

Shirley
Sameer_Nirmal
Honored Contributor

Re: rp7420 stopped reboot and MP can't access

rp7420 provides redundancy at MP level if it has two MPs installed in single npar configuration. Two MPs work in primary ( master ) and secondary ( slave ) modes.
If the master fails the secodary should take over through failover process.

By default MP1 which belongs to chassis 1 acts as primary MP. I believe it is failed in your case. You can check the status of the failover from seconday MP's menu
MP->CM->PS and M->CM->DE options.
It seems the failover has not taken place in the server. You can force the failover to the secodary MP using MP->CM->XD 9 option.
Mohanasundaram_1
Honored Contributor

Re: rp7420 stopped reboot and MP can't access

HI Shirley,

Usually the configuration is in such a way that the root disk and its mirror on different Core I/O SCSI.

This will enable to boot the system with
hpux -lq
in case of one disk or SCSI controller failure.

It will be important to know how your is system configured. If there is no mirror disk then the system is not going to boot on this failure.

Otherwise, try to boot in the alternate boot path without checking the LVM quorum.

With regards,
Mohan.

Attitude, Not aptitude, determines your altitude
Solution

Re: rp7420 stopped reboot and MP can't access

Ummm I'm afraid that MP failover is *not* supported any more - see the following:

http://h20000.www2.hp.com/bizsupport/TechSupport/Document.jsp?objectID=c00462190

Sorry to be the bearer of bad tidings

HTH

Duncan

HTH

Duncan
Stefan Stechemesser
Honored Contributor

Re: rp7420 stopped reboot and MP can't access

Hi,

Duncan is right, MP failover is no longer supported.

Anyway, in your situation it has already happened.

Regarding the boot problem:

The "sea" command only search for bootable devices on the "CoreCell", the cell with the Core I/O.

Try "sea all" to search on both cells.

In your case, it may be that the complete Core I/O 1 has failed (not only the MP part) and therfore, your Core Cell is now 0 and no longer 1.
You can check this with the "corecell" command in the Configuration Menu of BCH.
With the "io" command in the information menu, you can see if the Master core I/O is still be seen by the partition (hw path 1/0/0/3). If not, then the disks in the upper left slot (1/0/0/3/0.6) will also no longer be accessible.
The disks in the other slots are accessed by the Procorium Lan/SCSI card and the other Core I/O.

best regards

Stefan
Thayanidhi
Honored Contributor

Re: rp7420 stopped reboot and MP can't access

Hi Shirly,
To answer your question, you can have one MP when system has only one partition. Second MP is mandatory if you have two partitions. You can have two cell boards with single MP/core. In your current situation, you may move the MP card and try. Better to involve hardware vendor (or HP).

Regds
TT
Attitude (not aptitude) determines altitude.
Sameer_Nirmal
Honored Contributor

Re: rp7420 stopped reboot and MP can't access

Duncan,

Good piece of information for sharing. It is being for a while since I worked on rp7410 and 7420 and I didn't know about this HP document. Do you know when this document and first such annoncement made by HP for those servers ( date )?

Technically, I would say this is a bad as people can't have reduandacy at MP level.

I wonder if such announcements or business documents are published at ITRC with a link so ITRC users would know about it.

In this case , it is required to replace the failed MP card with the secondary MP.
Thereafter the firmware upgradat would be required to disable the MP redundancy.
It is better to invlove HP.

Re: rp7420 stopped reboot and MP can't access

Hi Stefen,

"sea all" still returned nothing. Again I check the HW status and I found cell 1 is claimed to be invalid:

HW status for Cell 1 : FAILURE DETECTED

Power status : on, no fault
Boot is blocked
PDH memory is shared
Processor Compatibility : OK
RIO cable status : connected
RIO cable connection physical location : PCI Domain 1
Core cell is INVALID
Attention Led is off

But cell 0 is OK and core cell is cell0. I have one partition and two boot disks, why does "sea all" return no device?

Thanks,

Shirley

Re: rp7420 stopped reboot and MP can't access

Sameer,

I found out about this by being subscribed to HP's 'Subscriber Choice' e-mails here:

https://h30046.www3.hp.com/subSignIn.php

Once you have an account and are signed in you can setup driver and support alerts which will let you know most of these little snippets.

HTH

Duncan

HTH

Duncan