- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - OpenVMS
- >
- Re: DS15 - LOCKMGRERR crash
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 12:24 AM
02-02-2006 12:24 AM
yesterday the DS15 in our cluster crashed. This was especially unpleasant to the users of one application that was dedicated assigned to that node.
Also, the crash error does not sound very assuring...
SDA sh cras & SDA clue cras attached
The crash has also been forwarded though our support channel, but I guess this will be quicker, also because the support channel is not exactly a direct route...
Proost.
Have one on me.
jpe
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 12:25 AM
02-02-2006 12:25 AM
Re: DS15 - LOCKMGRERR crash
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 12:29 AM
02-02-2006 12:29 AM
Re: DS15 - LOCKMGRERR crash
Clue crash
clue config
clue register
clue stack
That would be helpful
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 12:35 AM
02-02-2006 12:35 AM
Re: DS15 - LOCKMGRERR crash
crash see previous,
find the other 3 attached.
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 12:37 AM
02-02-2006 12:37 AM
Re: DS15 - LOCKMGRERR crash
crash see previous,
find the other 3 attached.
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 01:05 AM
02-02-2006 01:05 AM
Re: DS15 - LOCKMGRERR crash
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 01:14 AM
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 05:08 AM
02-02-2006 05:08 AM
Re: DS15 - LOCKMGRERR crash
I KNEW you would beat the official support!
So, just a matter of a parameter adjustment after all.
Boy, am I glad it is not really something more serious (well, _I_ suspected it hardly could be some inherent fault, but there ARE those, that would like nothing better than pointing at VMS with evidence of potential harm to data integrety, as could easily happen when LockManager should be at fault!)
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-02-2006 11:01 PM
02-02-2006 11:01 PM
Re: DS15 - LOCKMGRERR crash
this system seems to be badly tuned in general, look at:
System Uptime: 1 00:37:35.82
EXE$GL_FLAGS: poolpging,init,bugdump,pgflfrag,pgflcrit,pagfildmp
To find about nonpaged pool expansion problems, see:
SDA> CLUE MEM/STAT
The LKBs and RSBs are allocated from S2 space:
SDA> SHOW PAGE/S2/FREE
To look at LCKMGR pool zone counters, use:
SDA> exa @LCK$AR_POOLZONE_REGION+80;20
The counters are (quadwords from right to left): hits, misses, expansions, failures.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-03-2006 12:23 AM
02-03-2006 12:23 AM
Re: DS15 - LOCKMGRERR crash
pgflfrag, pgflcrit show that the pagefile was full or nearly so at some time.
Purely Personal Opinion
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-03-2006 12:38 AM
02-03-2006 12:38 AM
Re: DS15 - LOCKMGRERR crash
This node was rebooted after 159 days because of the tape MDR: the driver for $2$MGA had received a wrong SCSI bitmask. Obviously a know problem, and only to be cleared by reboot. (and NO patches coming anymore, because MDR is EOL! How did that stuff EVER qualify for use under VMS?)
24 hours after the reboot this crash happened.
Clue mem/stat:
Successful pool expansions : 0
Unsuccessful pool exp : 0
Various "Failed" stats: all are 0
SHOW PAGE/S2/FREE:
not sure how to interpret what I see.
Mapped addr:
counting down in steps of %X4000, 8000, C000, 10000, 20000 for the first couple of pages
PTE addr:
conting down in (irregular?) multiples of 4, like 18, 30, 1C , C0
PTE:
counting down in rather big steps (all ending 0000)
Count:
small numbers, single digit except the last one: 3F7
But what does that mean?
exa @LCK$AR_POOLZONE_REGION+80;20
4F9A6A - 25C1 - 445A 1A
Again, what does that mean?
system seems to be badly tuned in general
Care to elaborate?
Any suggestions for improvement?
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-03-2006 12:49 AM
02-03-2006 12:49 AM
Re: DS15 - LOCKMGRERR crash
Indeed, that is what HELP/MESS INSVIRMEM offers as possibility, and I already installed an extra Gb of pagefile. But it makes me wonder WHY all of a sudden (after a reboot!!) so much pagefile was needed, because we monitor pagefile use, and try to never need it whatsoever.
(Then again, this IS the one small machine in the cluster).
Proost.
Have on on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-03-2006 12:56 AM
02-03-2006 12:56 AM
Re: DS15 - LOCKMGRERR crash
EXE$GL_FLAGS: ...,pgflfrag,pgflcrit,...
This says, that the page file has been severely fragmented and critically full during the uptime of the system (which is just 1 day). Look at the current situation at the time of the crash with:
SDA> CLUE MEM/FILES
SDA> SHOW PAGE/S2/FREE shows the amount of free PTEs in the S2 free page list. If the lock manager needs to allocate more RSBs and LKBs, it may need to expand it's pool zone in S2 space and would need some free S2 PTEs. Only the count fields would be interesting.
Were there any free physical pages SDA> SHOW PFN/FREE ?
If you've copied the LCKMGR POOLZONE counters from right to left, it would be:
hits: 4F9A6A
misses: 25C1
expansions: 445A
failures: 1A <<< normally this counter is 0
NOTE: you've seen an INSFMEM error, not an INSVIRMEM ! Lock manager resources are in S2 space, which is NOT paged, so pagefile space problems cannot cause this crash.
If this is 'the small machine' in the cluster, it might just not have had enough resources to receive the lock/resource tree being moved to it.
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-03-2006 01:05 AM
02-03-2006 01:05 AM
Re: DS15 - LOCKMGRERR crash
NOTE: you've seen an INSFMEM error, not an INSVIRMEM
Sorry, typo in the posting. I used the actual message in HELP.
SHOW PFN/FREE
*** List is empty ***
Looks we pinned it down!
Maybe a budget request for more memory is in order.
A bigger pagefile has already be installed.
Thanks!
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-03-2006 01:17 AM
02-03-2006 01:17 AM
Re: DS15 - LOCKMGRERR crash
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-03-2006 01:34 AM
02-03-2006 01:34 AM
Re: DS15 - LOCKMGRERR crash
More memory is always a good thing.
Purely Personal Opinion
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-03-2006 04:36 AM
02-03-2006 04:36 AM
Re: DS15 - LOCKMGRERR crash
maybe - just maybe - you've run BACKUP to test access to the tape after the reboot ? And backup has used lots of memory and pulled over the resource tree of the disk (due to it's lock activity) ?
Volker.