Re: REBOOT AFTER PANIC

Andrew Luis Arruza · ‎08-19-2003

Can anyone give me some direction on this message that is in the /etc/shutdownlog after the panic reboot:

11:45 Tue Aug 19 2003. Reboot after panic: , isr.ior = 0'240012.0'20c82218

Any/all help greatly appreciated and points will be assigned.
Andy

It is, after all, a matter of survival!!

John Bolene · ‎08-19-2003

isr is interrupt service routine

looks like trouble with an I/O device to me

anything else disk related in the log like a scsi reset?

It would help to know which OS, what disk devices are attached, and if you are current on patch levels.

It is always a good day when you are launching rockets! http://tripolioklahoma.org, Mostly Missiles http://mostlymissiles.com

A. Clay Stephenson · ‎08-19-2003

You really don't give us much to work with. At a minimum: Model & OS Version.

The first place to start is in /var/tombstones. Look for a file tsNN with a timestamp near your time of crash. This is probably a hardware problem so search the tombstone file for "HPMC" (High-Priority Machine Check) and that will give you a clue.

You really need to use the q4 utility (or have HP do it) to examine the crash dump.

One other thought: Is this box part of an MC/SG cluster? If so, you might be the victim of a ServiceGuard induced TOC (Transfer of Control).

If it ain't broke, I can fix that.

GK_5 · ‎08-19-2003

Here is similar forum article. This may help.

http://forums.itrc.hp.com/cm/QuestionAnswer/0,,0x3dccd7d96cbad711900a0090279cd0f9,00.html

IT is great!

Kent Ostby · ‎08-19-2003

Andy --

An isr.ior "panic" is caused by one of three things.

#1) HPMC -- As mentioned above, check the file /var/tombstones/ts99 to see if the date INSIDE the file is valid for around the time of the failure.

#2) If you run MC/Serviceguard, it could be caused by MC/Servicguard losing connection between the nodes. Look at the OLDsyslog.log on the node that crashed or the syslog.log on the node that didn't crash for that time frame and see if you see messages related to losing contact with each other.

#3) If someone forced a TOC either via the TOC button or the cntl-b> TC entry at the console, you would get a line like this as well.

Check #1 first. If the date INSIDE the file is from the timeframe then you have a HW problem and probably need to low a HW call with HP or whoever your vendor is.

If #1 doesnt show a valid timestamp (shows an older date or no valid timestamp) then if you have MC/SG and think that noone did #3 then you are probably looking at a Serviceguard TOC.

There are many causes of a SG TOC.

Generally it is either that there was a problem that disrupted the LAN and prevented SG from talking between the two nodes or that the system was in some sort of hang or "mini-hang" that prevented the two nodes from talking.

Post us what you find and we may be able to help further here although some SG TOC issues generally require more detailed analysis.

Best regards,

Kent M. Ostby

"Well, actually, she is a rocket scientist" -- Steve Martin in "Roxanne"

Categories

Company

Local Language

Forums

Discussions

Forums

Discussions

Discussions

Forums

Discussions

Forums

Discussions

Forums

Forums

Discussions

Forums

Discussions

Forums

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Community

Resources

Other HPE Sites

Discussions

Forums

Blogs

Re: REBOOT AFTER PANIC

REBOOT AFTER PANIC