- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- Re: Server Hangs every 3 months
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-22-2008 11:55 AM
тАО02-22-2008 11:55 AM
Server Hangs every 3 months
We are using ILO cards in these servers and I am not sure if this is the culprit or not.. There is nothing in the logs that show any sort of problem.
uname -a output
Linux ########## 2.6.5-7.286-bigsmp #1 SMP Thu May 31 10:12:58 UTC 2007 i686 athlon i386 GNU/Linux
anyone have any ideas, both Novell and HP are unable to come up with anything.
Thanks,
Larry
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-23-2008 03:08 PM
тАО02-23-2008 03:08 PM
Re: Server Hangs every 3 months
When I had a similar problem I configured a remote syslog server because the system was hang and cannot write to disk, but was able to send the message over the network and more infor was obtained to troubleshoot the problem.
Install collectl and enable performance logging. You could have an idea of what was going on in the system at the time of the hang.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-24-2008 05:40 PM
тАО02-24-2008 05:40 PM
Re: Server Hangs every 3 months
http://kbase.redhat.com/faq/FAQ_80_5559.shtm
The 'c' character will simulate a crash. Time the core creation so that this will give you a guideline if there is a crash again. Do not manually reboot until AFTER the core is created.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-26-2008 04:34 AM
тАО02-26-2008 04:34 AM
Re: Server Hangs every 3 months
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-27-2008 10:43 AM
тАО02-27-2008 10:43 AM
Re: Server Hangs every 3 months
Unable to handle kernel NULL pointer dereference at virtual address 00000174
printing eip:
c015ff0f
*pde = 21d58001
Oops: 0000 [#1]
SMP
CPU: 0
EIP: 0060:[
EFLAGS: 00010286 (2.6.5-7.287.3-bigsmp SLES9_SP3_BRANCH-20071002073136)
EIP is at blk_queue_bounce+0xf/0x310
eax: 00000000 ebx: f510dc68 ecx: 00000000 edx: d1bb5b44
esi: 00000000 edi: 00000000 ebp: 00000008 esp: d1bb5af0
ds: 007b es: 007b ss: 0068
Process novell-zislnxd (pid: 22349, threadinfo=d1bb4000 task=f2769980)
Stack: 00000001 00000001 cdfd2c50 ca852720 00000003 dabbee48 d1bb5b44 00000000
f510dc68 00000000 00000000 00000008 c026e21b f510de04 00000046 00000000
00000000 00000000 00000008 00000008 faa570a0 f510dc68 f510dc68 f510dc04
Call Trace:
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
[
Code: f6 80 74 01 00 00 01 0f 85 e4 01 00 00 8b 54 24 1c a1 c8 8f
done waiting: 3 cpus not responding
Dumping to block device (104,1) on CPU 0 ...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-29-2008 07:55 AM
тАО02-29-2008 07:55 AM
Re: Server Hangs every 3 months
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-02-2008 04:16 AM
тАО03-02-2008 04:16 AM
Re: Server Hangs every 3 months
The important thing to remember if you run collectl or even sar is to have a farily high monitoring frequency and I know most sar users monitor once every 10 minutes. By default collectl monitors once every 10 seconds, but even at that frequency it typically uses <0.1% of the cpu.
Once you've collected a pile of data with it you can then play it back and look at a variety of data in a variety of formats showing most of the types of things sar shows and then some. The key things I'd look for are system resources that are going up in consumption as well as what was going on at the time of the 'lock up', assuming you know the approximate time.
One resource people often miss (probably because there aren't any other utilities I know of that will log their usage) is 'slabs'. Collectl will show you the amount of memory allocated to slabs when you show memory usage but if in fact you think you are seeing an issue, you can also look at changes over time to individual slabs. Since slab monitoring (and process monitoring too for that matter) are more expensive to monitor than the other types of data, those subsystems are monitored once a minute in order to stay within that <0.1% overhead window.
Just keep in mind that by default collectl will write its data to a log in /var/log/collect, creating a new log every day and retaining 7 previous ones. If you do need to keep more, you can modify the number in /etc/collectl.conf.
check it out at http://collectl.sourceforge.net/ and enjoy
-mark
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-04-2008 04:59 AM
тАО03-04-2008 04:59 AM
Re: Server Hangs every 3 months
Contact Novell support. It looks like a problem in Novell ZENworks Linux Management.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-04-2008 05:42 AM
тАО03-04-2008 05:42 AM