- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - OpenVMS
- >
- Re: SYSGEN
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 12:08 AM
03-20-2007 12:08 AM
SYSGEN
Can you please let me know how to find process that are swapped out of memory?
The users sessions are getting hanged often & CPU usage is ~ 100%,so I took a lok at balsetcnt...
The value of MAXPROCESSCNT is 521 & BALSETCNT is 207...which I find is not correct.
Total memory is ~ 1GB
But,when I run
$ Show mem/slots
swapped is zero.
So,I guess no swapping is occuring.
Is the value of balsetcnt right or should be changed to 519....Please advice..
Thanks,
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 12:27 AM
03-20-2007 12:27 AM
Re: SYSGEN
Help show sys/state will help you.
The state you need is RWSWP but it could be one of the others too.
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 12:29 AM
03-20-2007 12:29 AM
Re: SYSGEN
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 12:48 AM
03-20-2007 12:48 AM
Re: SYSGEN
Please do NOT blindly raise BALSETCNT !
You risk that your system does not boot.
This can be fixed via conversational boot, but you do need access to the console.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 01:21 AM
03-20-2007 01:21 AM
Re: SYSGEN
which can help when changing SYSGEN
parameters.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 01:26 AM
03-20-2007 01:26 AM
Re: SYSGEN
But maybe it was AUTOGEN which lowered BALSETCNT because there is a large VA configured and the system might otherwise run out of S0 space. I have seen this on OpenVMS VAX V7.1 many years ago and I don't recall what platform FOX is working on.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 01:34 AM
03-20-2007 01:34 AM
Re: SYSGEN
Based on the available information, the system requires some tuning, additional hardware, reduced load, replacement, or a combination.
Looking at the slots and processes is a factor, but there's a whole lot more information required. It's not clear (to me) around the thought sequence that led from from hung processes and max CPU load to the balance set. (You're going to have to explain that.)
Start with either the T4 tools or a monitor recording pass (or both), and start gathering baseline data on the current system load. And at what the application(s) are doing, and at the particular hangs and aberrant behavior.
Given the discussion of the balance set slots and the reported physical memory, I have to assume that this is an OpenVMS VAX box, and specifically VAX 6000 Model 600, VAX 7000, or VAX 10000 series box. Old and slow. Big and power-hungry, too. By coincidence, I posted some details of VAX XPA and XVA just a week ago, over at: http://64.223.189.234/node/131
Do you have the source code for your applications, or a way to move off these boxes? You're working very near or even at the architectural limits of VAX here, which means that if tuning doesn't work, you are left to either off-load or re-schedule or migrate the applications to OpenVMS Alpha or OpenVMS I64. An rx2660 would likely greatly outperform this VAX box, for instance.
Stephen Hoffman
HoffmanLabs
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 01:37 AM
03-20-2007 01:37 AM
Re: SYSGEN
But first some tuning is needed, and may be some memory should be added.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 02:03 AM
03-20-2007 02:03 AM
Re: SYSGEN
The problem is a suspended process is taking ~100% CPU usage.
There is a parent process which is creating a sub process & after a while the sub process becomes non existent but when monitored using mon proc/topcpu that process is consuming 100% cpu usage.
I guess that is the reason other user is getting hanged sessions.
The VMS box is DS20E running on 7.3-2.
It was working fine but only after patching with update 11....these problems are occuring.
So,I while going through sysgen parameter noticed balsetcnt.....
Any suggestion would be very helpful.
Thanks
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 02:41 AM
03-20-2007 02:41 AM
Re: SYSGEN
so it is Alpha - that is good news.
The balsetcount vs. VA problem does not apply.
My guess: either someone changed SYSGEN params without using AUTOGEN, or, in MODPARAMS MAXPROCESSCNT as well as BALSETCNT are hard-coded values, and the AUTOGEN warnings were ignored.
In the latter case, simply remove BALSETCNT from modparams.dat (and files invoked by it, if applicable) and let AUTOGEN recalculate it.
Probably it is best to first do a
@AUTOGEN SAVPARAMS TESTFILES FEEDBACK
which reports, does not yet change anything, and review the report.
hth
Proost.
Have one on me.
jpe
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 03:49 AM
03-20-2007 03:49 AM
Re: SYSGEN
A "suspended" process accumulating CPU would initially appear to be a lost I/O, or lost quota, or other kernel-level error. A much more detailed look at the process and at the loop is going to be required here. This may well prove to be a bug in the UPDATE kit, for instance.
If these systems have a support contract in place, you will want to avail yourself of its benefits; to contact the HP customer support center.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 03:58 AM
03-20-2007 03:58 AM
Re: SYSGEN
Is the process gone or isn't it ? Use sh proc/id=xxx to verify. Is mon proc/topcpu giving the same pid for the process ? What do you find in process accounting ? Operator log ? Audit trail ?
Did you reboot after installing the patches ?
Are all slots taken when you do show mem/slot ?
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 04:01 AM
03-20-2007 04:01 AM
Re: SYSGEN
Thanks to all for prompt replies.
The problem is a suspended process is taking ~100% CPU usage.
---
See this thread for similar spinning process
discussion:
http://forums1.itrc.hp.com/service/forums/questionanswer.do?threadId=1105743
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 04:02 AM
03-20-2007 04:02 AM
Re: SYSGEN
If it worked before the ECO and doesn't work after . . .
Did you install the patch with /save_recovery? Type:
$ PRODUCT SHOW RECOVERY_DATA
You can use PRODUCT UNDO PATCH to restore your system to it's previous state, this can be a quick way to eliminate the ECO as the cause or confirm the cause issue.
I've used UNDO and it really annoys the Windows types.
Andy
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 07:12 AM
03-20-2007 07:12 AM
Re: SYSGEN
I've used UNDO and it really annoys the Windows types.
<<<
Microsoft Windows XP restore points have provided this capability for some time now.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 09:23 AM
03-20-2007 09:23 AM
Re: SYSGEN
DCL's SHOW PROCESS command gives the highly-misleading message "process is suspended" if you use it on a process stuck in any of the MWAIT states (MUTEX, RW*) or for any process that has the delete-pending bit set.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 05:42 PM
03-20-2007 05:42 PM
Re: SYSGEN
For the whole system, yes. But even on Windows 2000 I was able to roll back (uninstall) a single patch.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 09:59 PM
03-20-2007 09:59 PM
Re: SYSGEN
When I issue the command
$ sh proc/id=XXXXXXX
The message appears suspended and after a while it becomes non existent but,when
$ sh sys
is issued that process is listing as current.
Each time the process becomes non-existent, anew sub-process is created.This event is repeating.
Memory slots are not taken completely,there are free slots available.
I did install the patches using /sav qualifier.
Once the processes attains 100% CPU usage,it goes into suspended mode....How does this happen?
I think for this reason other users are affected.
Thanks.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 10:33 PM
03-20-2007 10:33 PM
Re: SYSGEN
SHOW PROC/ID=xxx may issue a 'supended' message, if it can't get data from the remote process. Only SHOW SYS will tell the true status of that process.
Do you have a system with more than 1 CPU ? Only in that case, you could see another process in a CUR state - otherwise, it will always be your process, in which you have issued the SHOW SYS command.
Are you saying, that there is some process in your system, which seem to be creating sub-processes, which then consume 100% of CPU time and then disappear ?
You can look at accounting to find out, why those subprocesses disappear (exit status) and how long they were active and how much CPU they consumed:
$ ACC/FULL/SINCE=time/TYPE=PROCESS/PROCESS=SUBPROCESS
Volker.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-20-2007 10:34 PM
03-20-2007 10:34 PM
Re: SYSGEN
I've seen process termination taking 5 seconds (due to cleaning up ?).
Did you reboot ?
Wim
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-21-2007 09:34 AM
03-21-2007 09:34 AM
Re: SYSGEN
>Once the processes attains 100% CPU
>usage,it goes into suspended mode....
>How does this happen?
As others have said, just because SHOW PROCESS/CONTINUOUS says a process is suspended, doesn't mean it is in SUSP state. It just means that, for whatever reason, SHOW PROCESS can't access process private data. Use:
$ WRITE SYS$OUTPUT F$GETJPI(pid,"STATE")
to determine the real state of the process.
To reduce the impact on your system, perhaps you should really suspend the process:
$ SET PROCESS/SUSPEND/ID=pid
or lower its priority
$ SET PROCESS/PRIORITY=0/ID=pid
Note that this may not reduce the processes consumption of CPU, but it will put it at the back of the queue of processes competing for CPU. Also realise that just because the system says a process is using 100% CPU, if that were really the case, you couldn't do anything (because you use the CPU to do stuff). A runaway compute bound process will use whatever CPU is left over. OpenVMS does a fairly good job of reducing the impact of a runaway process by giving other processes priority boosts.
You can use ANALYZE/SYSTEM to see what the process is doing. Start with:
$ ANALYZE/SYSTEM
SDA> SET PROCESS/INDEX=pid
SDA> SHOW PROCESS/CHANNEL
This should tell you what program or procedure it's running. If the process is suspended, you can use SHOW CALL and SHOW CALL/NEXT to examine the call stack. If you have a link map and source code, with a bit of perseverence it's possible to identify exactly where in the program it's looping.
Another trick is SET PROCESS/DUMP=NOW this should write a process dump, which you can then examine using DEBUG and/or ANALYZE/CRASH.
Final point - the danger of having BALSETCNT much lower than MAXPROCESSCNT is that once you have more than BALSETCNT processes, the excess MUST be outswapped. In itself, not a problem, but if you have processes that scan the system (say an Idle Process Killer), if it's not written very carefully, it will "chase" the outswapped processes down the PID array, resulting in excessive swapping activity. If the scan interval is short enough, you can put your system into a thrashing state. FWIW, from your symptoms I don't believe this is happening, but with the resources you have, I can't think of a valid reason for limiting BALSETCNT so severely.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-22-2007 03:18 AM
03-22-2007 03:18 AM
Re: SYSGEN
Thanks to all for your prompt replies.
The problem got solved today,it was due to a corruption in application file due to which the process was looping.
However,I triggered autogen & it suggested a few changes which I think should have done long back.
Thanks.