Operating System - Linux
1753449 Members
6439 Online
108794 Solutions
New Discussion юеВ

Unknown reason for system hang

 
SOLVED
Go to solution
Paul Thomson_2
Super Advisor

Unknown reason for system hang

Hi

One of my servers running RH AS 3 Update 3
Kernel 2.4.21-20

The system was unable to be accessed via ssh or via ilo, once the password was entered the system would hang and never proceed past this point.

All I have to go on is the log /var/log/maillog displays messages such as
Jul 16 21:10:23 esmadl01 sendmail[1903]: rejecting connections on daemon MTA: load average: 48
Jul 16 21:11:08 esmadl01 last message repeated 3 times

This is repeated for days until the load reaches 1015 !!

I have no real idea what caused this, but we do collect some sar data this stopped at 19.30 on the same date (Jul 16)

I have attache this file sar.txt

The only thing strange is dentunusd and pgins goes high, Any ideas why the system would do this ?

Any thought as a little stuck
dmesg / messages shows nothing :-(

Thanks
Argh ye land lovers !
3 REPLIES 3
Paul Thomson_2
Super Advisor

Re: Unknown reason for system hang

Also, this server runs veritas volume manager, and oracle 9.2

This server uses no external storage, only the internal disk cciss0: HP Smart Array 5i Controller.

Argh ye land lovers !
Solution

Re: Unknown reason for system hang

Paul,

Take a look at this:

https://bugzilla.redhat.com/bugzilla/long_list.cgi?buglist=117400

Looks like there might be a race condition with Veritas and certain versions of the 2.4 kernel where dentry_stat is updated to a negative number.

- Alex
George Liu_4
Trusted Contributor

Re: Unknown reason for system hang

The sar output shows nothing interesting. If the system is able to work after power cycle, is all file system, such as size, missing links, etc. normal?
I did see one Dell 6850 lost filesystem when running, but it is fine after it is rebooted.
Check to see if there is any kernel update available.