- Community Home
- >
- Servers and Operating Systems
- >
- HPE ProLiant
- >
- ProLiant Servers (ML,DL,SL)
- >
- Re: Server continues down
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-27-2008 07:13 PM
тАО10-27-2008 07:13 PM
We have HP ProLiant DL 585 Server which continuesly downs sometimes. Then we have to switch on it & start running applications.
I tried to investigate the reasons of downing, but couldn't determine yet.
It usually downs when there is high software load on the server.
Its memory is:
MemTotal: 65849012 kB
CPU x8:
AMD Opteron (tm) Processor 885
What do you think how I should investigate it?
Please share me your experience
Thank you very much
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-27-2008 07:55 PM
тАО10-27-2008 07:55 PM
Re: Server continues down
Check for any memoy faults.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-27-2008 08:36 PM
тАО10-27-2008 08:36 PM
Re: Server continues down
just shuts down??? or is it a reboot?
any error in integrated management log?
windows event log errors? (assuming is windows you did not mention)
have you replaced any part so far?
bye
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-27-2008 09:33 PM
тАО10-27-2008 09:33 PM
Re: Server continues down
RedHat 5 installed on it. Kernel updates were installed until March 5, 2008.
Linux version 2.6.18-53.1.14.el5
Storage controller:
HP Storage Works
Modular Smart Array 1000
HP intergated log(hplog -v) is shown here.
ID Severity Initial Time Update Time Count
-------------------------------------------------------------
0003 Repaired 23:59 05/31/2007 13:48 08/29/2007 0001
LOG: ASR Detected by System ROM
0004 Repaired 15:58 08/28/2007 13:48 08/29/2007 0001
LOG: Corrected Memory Error threshold exceeded (Slot 2, Memory Module 7)
0005 Repaired 15:58 08/28/2007 13:48 08/29/2007 0001
LOG: Corrected Memory Error threshold exceeded (Slot 2, Memory Module 5)
0006 Repaired 16:00 08/28/2007 13:48 08/29/2007 0001
LOG: Corrected Memory Error threshold exceeded (Slot 2, Memory Module 3)
0007 Repaired 16:03 08/28/2007 13:48 08/29/2007 0001
LOG: Corrected Memory Error threshold exceeded (Slot 2, Memory Module 1)
0008 Repaired 21:24 08/28/2007 13:48 08/29/2007 0001
LOG: Corrected Memory Error threshold exceeded (Slot 2, Memory Module 3)
0009 Repaired 21:35 08/28/2007 13:48 08/29/2007 0001
LOG: Corrected Memory Error threshold exceeded (Slot 2, Memory Module 7)
0010 Repaired 21:37 08/28/2007 13:48 08/29/2007 0001
LOG: Corrected Memory Error threshold exceeded (Slot 2, Memory Module 5)
0011 Repaired 22:02 08/28/2007 13:48 08/29/2007 0001
LOG: Corrected Memory Error threshold exceeded (Slot 2, Memory Module 1)
0012 Repaired 13:20 08/29/2007 13:48 08/29/2007 0002
LOG: ASR Detected by System ROM
0045 Caution 11:31 10/19/2008 11:31 10/19/2008 0001
LOG: Corrected Memory Error threshold exceeded (Slot 4, Memory Module 4)
0046 Caution 11:40 10/19/2008 11:40 10/19/2008 0001
LOG: Corrected Memory Error threshold exceeded (Slot 4, Memory Module 8)
Today October 28, 2008 server was down again. But no log about this event in the hplog & /var/log/messages. It is just shutdown, we have to switch on the power button in order to operate.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-27-2008 10:10 PM
тАО10-27-2008 10:10 PM
Solutiondo you know if your server has this System ROM version
http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?lang=en&cc=us&prodTypeId=15351&prodSeriesId=398220&swItem=MTX-9d84c7ed390a4c14990482f45f&prodNameId=3288126&swEnvOID=1005&swLang=8&taskId=135&mode=5
bye
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-28-2008 12:16 AM
тАО10-28-2008 12:16 AM
Re: Server continues down
We updated the system BIOS version from ProLiant DL585 (A01) (2006-01) to A01 (2007-02-14).
Thank you again
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-28-2008 03:54 AM
тАО10-28-2008 03:54 AM
Re: Server continues down
Then I go to the server, Therm Trip & TEMP LEDs colour were orange. I think it may be thermal shutdown. But only this server still downs, other servers not.
What is your opinion? How should we investigate?
Thank you
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-28-2008 02:19 PM
тАО10-28-2008 02:19 PM
Re: Server continues down
there are too many memory errors on slot2 and a few on slot 4....
What is the memory configuration and # of processors used....
Also mention the memory type..
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-07-2008 01:25 AM
тАО11-07-2008 01:25 AM
Re: Server continues down
AMD Operaton
Four PC2700 DIMMs 266MHz
2.6 GHz (1 MB L2)
Thanks