Operating System - Tru64 Unix
1839270 Members
2571 Online
110137 Solutions
New Discussion

Re: DS-25 Cluster, member 1 fails to boot after restart

 
SOLVED
Go to solution
Martin Wolff
Frequent Advisor

DS-25 Cluster, member 1 fails to boot after restart

After reboot of a DS-25 node running TRU-64 5.1-B PK4, the following error message appears:

boot dkc100.1.0.1.2 -flags A
failed to open dkc100.1.0.1.2

After a

>>show devices dkc

We saw there was no dkc100, now there was dkc300!!!! It changed aparently alone!!!
boot dkc300 worked, but i am confused.

Does anyone know what could have happened?
9 REPLIES 9
Ivan Ferreira
Honored Contributor

Re: DS-25 Cluster, member 1 fails to boot after restart

¿What kind of storage are you using? Was there a recent change in device configuration or LUN presentations?
Por que hacerlo dificil si es posible hacerlo facil? - Why do it the hard way, when you can do it the easy way?
Martin Wolff
Frequent Advisor

Re: DS-25 Cluster, member 1 fails to boot after restart

We have two local disks, and a 14 disks array managed by LSM. Local disks are:

87: /dev/disk/dsk0c COMPAQ BD03686223 ro
bus-2-targ-0-lun-0
88: /dev/disk/dsk1c COMPAQ BD03686223 ro
bus-2-targ-3-lun-0

No changes were done to the system regarding to devices.
Ivan Ferreira
Honored Contributor

Re: DS-25 Cluster, member 1 fails to boot after restart

As I cannot see the output of SRM>>> show devs, I don't know if you booted with the shared disk (member boot disk) or your local disk.

Check if you booted the system as part of the lcuster with clu_get_info.

You should ensure that you still have access to the shared disks. I don't even know if it's a SAN storage.
Por que hacerlo dificil si es posible hacerlo facil? - Why do it the hard way, when you can do it the easy way?
Kapil Jha
Honored Contributor

Re: DS-25 Cluster, member 1 fails to boot after restart

Are you sure if was dkc100 before u started to work.May be someone has changed disk location.

Is this a new build??

BR,
Kapil+
I am in this small bowl, I wane see the real world......
Martin Wolff
Frequent Advisor

Re: DS-25 Cluster, member 1 fails to boot after restart

Ivan,
Yes the system is up in the cluster, and yes we have access to the SAN storageworks 4354R.

Kapil,
The system is old, it was installed in 2004, and is very un-probable that someone has make any changes.
Can that kind of conf be changed from the operating system?
Rob Leadbeater
Honored Contributor
Solution

Re: DS-25 Cluster, member 1 fails to boot after restart

Hi,

Can you post the output these two SRM commands...

>>> show dev
>>> show boot*

Cheers,

Rob
Vladimir Fabecic
Honored Contributor

Re: DS-25 Cluster, member 1 fails to boot after restart

Details that could help (so please post output of):
- # cat /etc/fstab
- # hwmgr -show scsi
- # ls -lR /etc/fdmns
In vino veritas, in VMS cluster
Martin Wolff
Frequent Advisor

Re: DS-25 Cluster, member 1 fails to boot after restart

Hi, i have attached the data Vladimir told that could help.

Regarding what Rob asked, i am arranging a maintenance window in order to get the data.
I hope i will have it before the end of the week.

Thank you all again for your help.
Martin Wolff
Frequent Advisor

Re: DS-25 Cluster, member 1 fails to boot after restart

There is a conflict between Position 2 and 4 of the local storage.

This conflict became visible when we put a disk on position 4 and the system did not boot.

There is something wrong physically that is making position 2 and 4 be consider as the same slot (maybe a short-circuit).

That´s why the position 2 is read as position 4 and that´s why the System board software is getting a wrong identifier for disk 2.

HP representatives may need to replace the local storage backpanel in order to correct this issue.

The customer will not use slot 4 so the correction done on the bootdef will be enough, it´s no need to correct the HW issue.

Slot 3 was tested and no problem was found, so in case the customer need a new disk on the local storage they can use position3.