Operating System - Tru64 Unix
1752590 Members
2955 Online
108788 Solutions
New Discussion юеВ

System crashes whenever we cleanup unused devices...!

 
Nagarajan Balakrishnan_1
Frequent Advisor

System crashes whenever we cleanup unused devices...!

Hi,

We have a two node cluster. Whenever, we run the following commands to cleanup unused and unconnected devices...!
-------------------------------
hwmgr delete component -id 224
dsfmgr -R hwid 224
-------------------------------
We did a "hwmgr -scan scsi" and the subsequent "hwmgr -show scsi" was showing the device 224 as not having any path.

After the cleanup completed successfully, everything was ok and within seconds one node crashed. When we trying to bring that up and when it was booting, the other node also crashed. We used to ignore this as "SCSI bus getting reset". But I feel somethig else could well be wrong. Please see the attachments for details.

Any help is appreciated.

Baalki
14 REPLIES 14
Ravi_8
Honored Contributor

Re: System crashes whenever we cleanup unused devices...!

Hi,

can u disable the SMNP traps

(set snmpEnableAuthenTraps flag to 2 in /etc/snmpd.conf file)
never give up
Srivathsan
Frequent Advisor

Re: System crashes whenever we cleanup unused devices...!

Hello Ravi,

Could you please explain that a bit further.

Is it specific to this cluster setup or is it a bug of some sort ?

Thanks in advance.

Srivathsan
Michael Schulte zur Sur
Honored Contributor

Re: System crashes whenever we cleanup unused devices...!

Hi,

could you please give us os version and patch level?

panic (cpu 0): Unaligned kernel space access from kernel mode

I wonder, if this not a patch related matter.

hth,

Michael
Nagarajan Balakrishnan_1
Frequent Advisor

Re: System crashes whenever we cleanup unused devices...!

Hi,

It is running on Tru64 5.1A with PK4.

According to my knowledge, this was a bug resolved in PK3.

The explanation given was the devices database changes in the memory of one of the nodes only before writing into the common file and hence there is an inconsistency detected and hence it crashes.

However, we are having this problem. Any additional help.

Regards
Baalki
Greg Yates
Valued Contributor

Re: System crashes whenever we cleanup unused devices...!

Hi,

There is at least one hwmgr-related panic fixed in a later patch kit (later than PK4) that you may be experiencing. I didn't see a stack trace in any of your attachments. That's always good info to have when looking at a crash. You can find this in the crash-data file in /var/adm/crash (default location).

Greg
Michael Schulte zur Sur
Honored Contributor

Re: System crashes whenever we cleanup unused devices...!

Hi,

we also had a problem with 5.1A pk4 when there was a tape not attached and we did a mt status, the machine would crash.

Michael
Orrin
Valued Contributor

Re: System crashes whenever we cleanup unused devices...!

Hi Baalki,

We had the same problem, only ours was worse as the device database was corrupted and we had to restore the device dataabse, lucky that we did not loose any data and there were no changes to the device database.

On further investigation of the problem, The HP engineer advised that the problem is witht the dsfmgr command, that causes the crash.

Since we don't have a system to play with you can understand our reluctance to try the command again.

The only difference was we used the delete option with the scsi id.

If it is a production box, might be a good idea to raise a service call. Our call just resolved the corrupted database, we haven't made any changes to the devices after that and haven't had the chance to look at the problem.

Hope you have more luck resolving the issue.
Will keep an eye on this thread, maybe it will solve our problem as well.

regards,
Orrin.
Nagarajan Balakrishnan_1
Frequent Advisor

Re: System crashes whenever we cleanup unused devices...!

Gents,

Thanks for the overwhelming responses.

Sorry, got busy with some urgent implementation requests.

I have attached the trace files. Hope that will throw some light on the problem and the cause.

Thanks.
Michael Schulte zur Sur
Honored Contributor

Re: System crashes whenever we cleanup unused devices...!

Hi,

could you post
hwmgr -show comp
and give an example, what you would delete?

thanks,

Michael