- Community Home
- >
- Servers and Operating Systems
- >
- Legacy
- >
- HPE 9000 and HPE e3000 Servers
- >
- memory errors
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 07:06 AM
04-04-2002 07:06 AM
here is the message: Any ideas what this means?
Title: dm_memory
Command: /usr/sbin/stm/uut/bin/tools/monitor/dm_memory
Vendor: Hewlett-Packard Company
Version: B.01.00
Monitor PID: 5784
Resources currently monitored:
/system/events/memory/192
Thanks
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 07:12 AM
04-04-2002 07:12 AM
Re: memory errors
Please search some logfiles accompagnied to this monitor or specify the actual errors...
Regards,
Ceesjan
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 07:21 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 07:21 AM
04-04-2002 07:21 AM
Re: memory errors
Mar 14 15:19:34 system EMS [4863]: ----- EMS Monitor Restart ----- Title: d
m_memory Command: /usr/sbin/stm/uut/bin/tools/monitor/dm_memory Vendor: Hew
lett-Packard Company Version: B.01.00 To obtain a list of currently monit
ored resources, execute the following: /opt/resmon/bin/resdata -M 3111254123
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 07:22 AM
04-04-2002 07:22 AM
Re: memory errors
Check this out:
http://us-support3.external.hp.com/cki/bin/doc.pl/sid=cabbc6191ad3008cf4/screen=ckiDisplayDocument?docId=200000058669477
HTH,
Shiju
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 07:31 AM
04-04-2002 07:31 AM
Re: memory errors
-- Logtool Utility: View Memory Report --
System Start Time Sun Jul 1 17:16:55 2001
Last Error Check Time Thu Apr 4 10:30:03 2002
Logging Time Interval 120
Extender Card in Slot EXT0
==========================================================
DIMM Slot: 0b
Error Type: Single/unconfirmed: single-bit error that
could not be confirmed as either soft or hard.
Page Status: Pending: page could not be obtained.
Bit Num: 16
Logged By: Memlogd
First Detected: Wed Apr 3 23:16:08 2002
Last Detected: Thu Apr 4 10:30:03 2002
Error Count: 323
Error Addr: 0xf6b801
==========================================================
Extender Card in Slot EXT0
==========================================================
DIMM Slot: 0b
Error Type: Single/soft: unrepeatable single-bit error.
Page Status: Deallocated: page is no longer in use.
Bit Num: 16
Logged By: Memlogd
First Detected: Wed Apr 3 23:12:07 2002
Last Detected: Wed Apr 3 23:34:15 2002
Error Count: 2
Error Addr: 0x68307ec1
==========================================================
Extender Card in Slot EXT0
==========================================================
DIMM Slot: 0b
Error Type: Single/unconfirmed: single-bit error that
could not be confirmed as either soft or hard.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 07:32 AM
04-04-2002 07:32 AM
Re: memory errors
# /opt/resmon/bin/resdata -M 3111254123
the "311.." is the "monitor key", hopefully it'll show you more detail what's going on. I still think your best bet is look at cstm log file as suggested earlier.
Also run
# /opt/resmon/bin/resdata -h
for more detail of "resdata" syntax.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 08:03 AM
04-04-2002 08:03 AM
Re: memory errors
Good news is the page can be deallocated upon detection of the error. I've seen single bit error which could not be deallocated and as a result the system has to be rebooted to deallocate 'em. Bad news is the system cannot determine if it's soft or hard error. Meaning there is high possibility this can be a hardware problem (ie memory problem). It's time to call HP for further cause of action. I would not be surprise if the memory has to be replaced.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
04-04-2002 09:49 AM
04-04-2002 09:49 AM
Re: memory errors
A single bit error indicates that there was a threshold error count hit, that generated the message. These kinds of errors are always recoverable by the system.
The problem comes about with where the errors are occurring in memory -- user area versus system area. If the error occurs in the system area, this can bother the system, so, it deallocates memory and reassigns it somewhere else that is not have single-bit errors.
If the error count gets too high, it might be a candidate for a hardware repair, but this is a rarity.
The memory errors to rally be concerned with are multi-bit errors -- if this should occur, then the Error Correction algorithm may not be able to correct the error -- hence, causing an irrecoverable parity error. Hardware would definitely need to be addressed at that point.
Hope this helps.
Ross Martin