cancel
Showing results for 
Search instead for 
Did you mean: 

kernel panic error

dawn_jose85
Frequent Advisor

kernel panic error

Hello ,

Can anyone help me troubleshoot this issue. My server got hung in this state . I'm using rhel4.2 OS

Code: 4c 8b 42 08 8b 55 00 49 d1 e8 e8 ff 19 f4 ff ff c3 eb a1 48
RIP {show_partition+209} RSP <000001047cc07e68>
CR2: 0000000000000008
<0>Kernel panic - not syncing: Oops
11 REPLIES
Murali_V
HPE Pro

Re: kernel panic error

Hi,
what is the server model and generation? i suspect this is something related to BIOS settings or with the OS support on this model server.
Refer Red Hat Enterprise Linux Certification and Support HP ProLiant & BladeSystem Server and ensure that this OS is supported(Click on Archive if you dont find your server in list)
:
http://h18004.www1.hp.com/products/servers/linux/hplinuxcert.html
Was this post useful? - You may click the KUDOS! thumb below to say thank you...
dawn_jose85
Frequent Advisor

Re: kernel panic error

OS is Linux 2.6.9-78
Server Model is HP Proliant DL 380 G5

How to trouble shoot this Kernel panic issue and resolve this ?
How we can prevent this error in future?
Murali_V
HPE Pro

Re: kernel panic error

As per the matrix, I see that the minimum supported RHEL 4 update version is 5.
Update the Linux version and install latest HP ProLiant support pack.
Was this post useful? - You may click the KUDOS! thumb below to say thank you...
dawn_jose85
Frequent Advisor

Re: kernel panic error

I hardbooted the server and now the server is working fine.
Is this issue is caused because of any interrupt handler?
Normally two typer of kernel panic errors are there, hard panic and Soft panic.
How can i confirm regarding this?
Murali_V
HPE Pro

Re: kernel panic error

From the information available it is difficult to understand what exactly caused this issue, This seems to be hard panic and to prevent this I suggest to update the Linux version, PSP and BIOS firmware.
Was this post useful? - You may click the KUDOS! thumb below to say thank you...
dawn_jose85
Frequent Advisor

Re: kernel panic error

How can i trouble shoot this issue inorder to identify the exact reason
Chhaya_Z
Valued Contributor

Re: kernel panic error

Hi Dawn,

What is the kernel version?

Were there any changes made to the server?
Is this an application server?

Do you have diskdump configured for this server?

Regards,
Chhaya

I am an HP employee.
Was this post useful? - You may click the KUDOS! star to say thank you.
dawn_jose85
Frequent Advisor

Re: kernel panic error

The Kernel version is 2.6.9-78.ELsmp
This is an application server .
Can you please let me know what u meant by diskdump. actually i didn't get that
Chhaya_Z
Valued Contributor

Re: kernel panic error

The diskdump utility offers the ability to create and collect vmcore (Kernel dumps) on a single system without needing to be connected to a network. Diskdump creates files in an identical format to the netdump core files and can be analyzed with the same tools. Like netdump, diskdump will only work with certain hardware.

In RHEL 4 we configure diskdump and in RHEL 5 we configure kdump.

Refer below redhat KB article for more information on diskdump and its configuration.

https://access.redhat.com/kb/docs/DOC-7075
Regards,
Chhaya

I am an HP employee.
Was this post useful? - You may click the KUDOS! star to say thank you.
Chhaya_Z
Valued Contributor

Re: kernel panic error

Hi Dawn,

Also can you check if you have call trace in dmesg file?
Regards,
Chhaya

I am an HP employee.
Was this post useful? - You may click the KUDOS! star to say thank you.
dawn_jose85
Frequent Advisor

Re: kernel panic error

Hi,

we are not configured diskdump in our server .
From dmesg i got one error as below

shpchp: acpi_shpchprm:get_device PCI ROOT HID fail=0x5
hw_random hardware driver 1.0.0 loaded
shpchp: acpi_shpchprm:\_SB_.PCI0 evaluate _BBN fail=0x5
shpchp: acpi_shpchprm:get_device PCI ROOT HID fail=0x5
shpchp: acpi_shpchprm:\_SB_.PCI0 evaluate _BBN fail=0x5
shpchp: acpi_shpchprm:get_device PCI ROOT HID fail=0x5
MC: drivers/edac/edac_mc.c version MC $Revision: 1.3 $
MC0: Giving out device to i5000_edac.c I5000: PCI 0000:00:10.0 (0000:00:10.0)
shpchp: acpi_shpchprm:\_SB_.PCI0 evaluate _BBN fail=0x5