ProLiant Servers (ML,DL,SL)
cancel
Showing results for 
Search instead for 
Did you mean: 

DL380 G3 Keeps hanging on Redhat 8.0 smp kernel

Vinod_22
Advisor

DL380 G3 Keeps hanging on Redhat 8.0 smp kernel

We have a HP ProLiant DL380 G3 (GigE) experiencing frequent lockups. The machine is a linux file server running RHL 8.0 (Psyche) with the default 2.4.18-14smp kernel

(the machine is a dual Xeon 3.06GHz), and has almost constant disk activity.
The machine keeps hanging. About twice a day it will hang with the windows background on-screen but nothing else. All network connectivity is dropped. I can't point to

anything more than the constant disk activity
that makes me say this, but could it be an issue with the SCSI driver? Here is what dmesg has to say on the subject:

SCSI subsystem driver Revision: 1.00
kmod: failed to exec /sbin/modprobe -s -k scsi_hostadapter, errno = 2
Compaq CISS Driver (v 2.4.30)
cciss: Device 0xb178 has been found at bus 1 dev 3 func 0
cciss: using DAC cycles
blocks= 35553120 block_size= 512
heads= 255, sectors= 32, cylinders= 4357

blocks= 860216490 block_size= 512
heads= 255, sectors= 63, cylinders= 53546

blk: queue c042e400, I/O limit 4294967295Mb (mask 0xffffffffffffffff)
Partition check:
cciss/c0d0: p1 p2 p3
cciss/c0d1: p1
Journalled Block Device driver loaded

Also I figured it out from dmesg about difference in cpu stepping as

CPU0: Intel(R) Xeon(TM) CPU 3.06GHz stepping 07
per-CPU timeslice cutoff: 1462.63 usecs.
task migration cache decay timeout: 1 msecs.
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Booting processor 1/0 eip 2000
Initializing CPU#1
masked ExtINT on CPU#1
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Calibrating delay loop... 6093.21 BogoMIPS
CPU: Before vendor init, caps: bfebfbff 00000000 000
CPU: L1 I cache: 0K, L1 D cache: 8K
CPU: L2 cache: 512K
CPU: Physical Processor ID: 0
CPU: After vendor init, caps: bfebfbff 00000000 0000
Intel machine check reporting enabled on CPU#1.
CPU: After generic, caps: bfebfbff 00000000 0000
CPU: Common caps: bfebfbff 00000000 0000
CPU1: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09
Total of 2 processors activated (12153.01 BogoMIPS).
WARNING: No sibling found for CPU 0.
WARNING: No sibling found for CPU 1.

The machine also has two integrated BroadCom BCM5703X GigE chips (only one is beng used).Can anyone suggest troubleshooting techniques? Are any of these drivers

known to be problematic?
The only way out is to power down and back on.I have installed the latest HP Systems ROMPaq Firmware Upgrade Diskette for ProLiant DL380 G3 (P29) Servers
version 4.09 P29-09/15/2004 (11 Oct 04)
There is no information in the event logs, apart from 'the previous shutdown was unexpected'
The server tends to do this when it is under high i/o load.
Any help would be appreciated.


Thanks,

Vinod Pote







1 REPLY
Brian_Murdoch
Honored Contributor

Re: DL380 G3 Keeps hanging on Redhat 8.0 smp kernel

Vinod,

There are later drivers for the Smart Array Controller (CISS driver) and the Broadcom Gigabit NICs.

These can be downloaded from the follwoing link.

http://h18023.www1.hp.com/support/files/server/us/locate/87_4706.html#20

I hope this helps,

Brian