StoreEasy Storage

Proliant DL100 G2 Issues

 
Megan G
Occasional Contributor

Proliant DL100 G2 Issues

I've been having some issues with my DL100 G2 server over the past few weeks. From first glace, it appeared that it was memory related. I would receive the messages below in the event log, and get the usual "insufficient system resources exist", messages when attempting to connect to shares. MMC would lock up, the whole works. I'd have to reboot the system for it to be functional again. I uninstalled Symantec 8.0, disabled RRAS service (had been using it...), and upgraded the RAM from 512 to 2GB. I've been monitoring the page pool using poolmon.exe - Mmst is the highest. However, I haven't applied the registry change discussed in the MS KB: http://support.microsoft.com/kb/304101/en-us. However, since the physical memory upgrade and Symantec uninstall I'm receiving the following error. I have no idea what's going on!

Event Type: Error
Event Source: arc
Event Category: None
Event ID: 11
Date: 1/23/2007
Time: 1:41:00 AM
User: N/A
Computer: FALSTAFF
Description:
The description for Event ID ( 11 ) in Source ( arc ) cannot be found. The local computer may not have the necessary registry information or message DLL files to display messages from a remote computer. You may be able to use the /AUXSOURCE= flag to retrieve this description; see Help and Support for details. The following information is part of the event: \Device\RaidPort0.
Data:
0000: 0f 00 10 00 01 00 60 00 ......`.
0008: 00 00 00 00 0b 00 04 c0 .......À
0010: 11 00 00 00 00 00 00 00 ........
0018: 00 00 00 00 00 00 00 00 ........
0020: 00 00 00 00 00 00 00 00 ........
0028: 00 00 00 00 00 00 00 00 ........
0030: 00 00 00 00 0b 00 04 c0 .......À

=====================================================================================================================
EVENT ERRORS PRIOR TO MEMORY UPGRADE & SYMANTEC UNINSTALL - OCCUR WHEN THE SERVER CRASHES

Event Type: Error
Event Source: NETLOGON
Event Category: None
Event ID: 5719
Date: 1/10/2007
Time: 1:50:18 PM
User: N/A
Computer: FALSTAFF
Description:
This computer was not able to set up a secure session with a domain controller in domain MYNTDOMAIN due to the following:
Not enough resources are available to complete this operation.
This may lead to authentication problems. Make sure that this computer is connected to the network. If the problem persists, please contact your domain administrator.

ADDITIONAL INFO
If this computer is a domain controller for the specified domain, it sets up the secure session to the primary domain controller emulator in the specified domain. Otherwise, this computer sets up the secure session to any domain controller in the specified domain.

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp .
Data:
0000: 16 00 02 c0 ...À

Event Type: Error
Event Source: Srv
Event Category: None
Event ID: 2020
Date: 1/10/2007
Time: 1:51:34 PM
User: N/A
Computer: FALSTAFF
Description:
The server was unable to allocate from the system paged pool because the pool was empty.

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp .
Data:
0000: 00 00 04 00 01 00 54 00 ......T.
0008: 00 00 00 00 e4 07 00 c0 ....ä..À
0010: 00 00 00 00 9a 00 00 c0 .... ..À
0018: 00 00 00 00 00 00 00 00 ........
0020: 00 00 00 00 00 00 00 00 ........
0028: 0b 00 00 00 ....

Event Type: Error
Event Source: Application Popup
Event Category: None
Event ID: 333
Date: 1/10/2007
Time: 3:18:56 PM
User: N/A
Computer: FALSTAFF
Description:
An I/O operation initiated by the Registry failed unrecoverably. The Registry could not read in, or write out, or flush, one of the files that contain the system's image of the Registry.

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp .
Data:
0000: 00 00 00 00 01 00 6c 00 ......l.
0008: 00 00 00 00 4d 01 00 c0 ....M..À
0010: 00 00 00 00 4d 01 00 c0 ....M..À
0018: 00 00 00 00 00 00 00 00 ........
0020: 00 00 00 00 00 00 00 00 ........

Event Type: Error
Event Source: HP Storage Server Management Service
Event Category: None
Event ID: 4160
Date: 1/10/2007
Time: 2:38:47 PM
User: N/A
Computer: FALSTAFF
Description:
Failed to retrieve logical disks. An unexpected exception was thrown.

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp .

Event Type: Information
Event Source: Application Popup
Event Category: None
Event ID: 26
Date: 1/5/2007
Time: 1:58:08 PM
User: N/A
Computer: FALSTAFF
Description:
Application popup: mmc.exe - Application Error : The application failed to initialize properly (0xc0000142). Click on OK to terminate the application.

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp .
3 REPLIES 3
Chris.Thomas
Advisor

Re: Proliant DL100 G2 Issues

The Mm pool tags are all used by the memory manager so if poolmon is reporting a steady increase in pool usage this either means a software component is leaking or the server performance cannot keep up with the tasks it is asked to do and is queuing them up in memory.

KB 312362 focusses on the increased stress placed on a server when a backup is running, but you might prefer to follow KB 312362 instead as it covers the same underlying issue.

Having said that, the errors you are seeing in the event log may indicate a hardware issue. Event ID 11 indicates an error has been reported by a disk controller and the inability to contact a disk would cause the 4160 error. You may also see event ID 9 errors in the event log.

A disk error would also cause the HotPlug service to consume pool memory. The other error messages may simply be a consequence of running low on pool resource.

Actions you could take are:

- Run a diagnostic utility against the disks
- Disable the Hotplug service and monitor pool usage
- Enable email alerts in the Adaptec storage manager

If the problem is still not resolved, configure the server to capture a kernel memory dump (see KB 244139) and open a support call with HP. Send us the memory dump as a zipped file and also a copy of the MPS SetupPerf report (see KB 818742)
Megan G
Occasional Contributor

Re: Proliant DL100 G2 Issues

Thanks for the response, Chris.

By the looks of it, KB312362 is very similar to the KB I was looking at, just with fewer registry changes. I have poolmon running all the time on the server, in hopes that when it becomes unresponsive, I can easily check to see what's up. I've attached a screenshot I took this morning incase there's any information in there I may have missed. I have also installed Symantec again (hence why RadS is up there), but running 8.1 this time instead of 8.0. So far, the server has been up and running for nearly 4 days (usually died between 3-5 days).

Just a few questions about the disks......number one - how do I run a diagnostics? I contacted the HP storage teams in regards to this and the gentleman handling my case said there was no way to run proper diagnostics with the server I had. Opening the adaptec storage manager in windows doesn't indicate any errors with the disks. Each disk is running firmware level BANC, and they're Maxtor 6L160M0. I'd also like to purchase a spare drive for this....BUT, 2 people at HP tell me I should be purchasing part number 353043-001, and two others said part number 399467-001. This of course leads me to my next question - Is there any documentation you could point me to that clearly states exactly which drive I require? It's very frustrating when I receive responses such as this, and no one can point me to documentation to back their "choice" up........I'd be in deep doo-doo if I purchased the wrong one and it failed me.
Megan G
Occasional Contributor

Re: Proliant DL100 G2 Issues

I should also let you know that the specific model of the server is AE436A.