- Integrated Systems
- About Us
- Integrated Systems
- About Us
07-18-2006 05:42 PM
Each VMS system has dual nics configured with
seperate ip addresses and IP failover.
When I do:
$ set displ/node="192.168.xx.xx" /transp=tcpip/create/perm
%SYSTEM-F-DEVLISTFULL, device unit limit reached; no unit added
Whilst this is a for terminal creation, Oracle listener (for example) cant create anymore processes.
Had to reboot to workaround the problem.
$ sysconfig -q net
arpkillc = 1200
arpkilli = 180
arprefresh = 120
arpunicast = 1
ifqmaxlen = 1024
lo_devs = 1
lo_def_ip_mtu = 4096
nslip = 0
ovms_unit_count = 208
ovms_unit_creates = 9369551
ovms_unit_fast_credel = 1
ovms_unit_limit = 32766
ovms_unit_minimum = 2
ovms_unit_maximum = 32767
ovms_unit_seed = 18969
Anyone any ideas ?
Solved! Go to Solution.
07-18-2006 05:51 PM
it looks like you've reached the maximum number of BG devices on your system. Should be 32767 on OpenVMS V7.3-2.
$ SHO DEV BG[/FULL] should show them all, $ TCPIP SHOW DEV[/FULL] should list their TCPIP attributes.
Maybe some process is creating and not freeing/deassigning those devices (sockets). You should watch the no. of BG devices in your system to find, if the no. of devices is constantly growing. Then try to find out who is using them:
$ pipe show dev bg | search/stat sys$pipe bg/noout
(look for Records matched count).
07-18-2006 05:58 PMSolution
next time this happens, force a crash (>>> CRASH) instead of just rebooting your server. Then you have enough time to dig through the crash to find out, who/what's using up all your BG devices (or is not freeing them).
As already said, you can probably pro-actively watch the no. of BG devices and take action, before they reach 32767...
07-18-2006 06:05 PM
tcpip show inet/stat
(the line with sockets)
I monitor this number and alarm myself when it gets above 600, which is high for us.
At our side it means a program had connections problems and tried to open new connections without closing the old ones.
A restart of the program solves the problem (has been corrected years ago but not yet in production).
07-19-2006 05:22 PM
If you can identify "dead" BG devices, you may be able to recover without rebooting or even restarting TCPIP using:
TCPIP> DISCONNECT DEVICE BGnnnn
Of course, how to identify them may be non-trivial...
Please log a case with your customer support centre for deeper diagnosis.
07-23-2006 09:08 PM
As displayed in the sysconfig -q net, the number of bg devices is actually very low (208), and yet Oracle cannot create a TNS listener process.
07-23-2006 09:56 PM
do you 'believe' that ovms_unit_count value at any time ?
Test is with a couple of
$ sysconfig -q net
$ pipe ucx sho dev | sea sys$pipe bg/noout/stat
and look for differences. Maybe ovms_unit_count does not reflect ALL BG devices ?
What do you expect from opening a call, if you have no 'hard evidence', i.e. a forced crash or at least a SHOW DEV BG/FULL from that situation ?
07-23-2006 10:23 PM
the SYSGEN parameter CHANNELCNT limits the no. of channels an individual process can assign.
The system service status SS$_DEVLISTFULL is returned, if you attempt to create a new (BG) device and the maximum no. of active (BG) devices has been reached. In OpenVMS V7.3-2, this limit has been increased from 9999 to 32767.
07-25-2006 09:19 AM
I have checked that the OVMS_UNIT_COUNT accurately reflects the number of BG devices.
(which it does).
Fortunately (!?) this probelm resurfaced on our test machine, and I was able to force a crash and obtain a valid dump.
I will be logging a call today.
I'll let you know the outcome.
07-25-2006 06:15 PM
you can easily check the no. of BG devices and the sysconfig net parameters in the dump:
$ ANAL/CRASH dumpfile
SDA> TCPIP SYSCONFIG NET
SDA> TCPIP SHOW DEVICE
SDA> SHOW DEV BG
To re-direct output to a file in SDA, use
SDA> SET OUT/NOINDEX file
Then you could process the file usign SEARCH etc. to count BG devices.
07-26-2006 11:57 AM
It wasn't BG devices, it was.......WSA devices!
The issue was that the Oracle DBA copied their personal login.com (which had a "set display.../perm" in it) , to the main Oracle account.
This created a WSA device every time that a TNS process was run. As the machine had been up for several months, the number of WSA devices crept up to 32770.
Removing the /PERM and an IF F$MODE() .NES. "INTERACTIVE" statement in the login.com has resolved the problem.
I think the issue with the Oracle TNS process was the the login.com didn't have a "SET NOON", therefore the login stopped running at the point of "set display" and ORAUSER was not run. The process then went into a cpu loop until its quota had run out.
My thanks to Sek at HP support.
07-27-2006 04:44 AM
from your Forum Profile:
I have assigned points to 13 of 73 responses to my questions.
-- unassigned questions dating as far back as 2003!
Maybe you can find some time to do some assigning?
Mind, I do NOT say you necessarily need to give lots of points. It is fully up to _YOU_ to decide how many. If you consider an answer is not deserving any points, you can also assign 0 ( = zero ) points, and then that answer will no longer be counted as unassigned.
Consider, that every poster took at least the trouble of posting for you!
To easily find your streams with unassigned points, click your own name somewhere.
This will bring up your profile.
Near the bottom of that page, under the caption "My Question(s)" you will find "questions or topics with unassigned points " Clicking that will give all, and only, your questions that still have unassigned postings.
Thanks on behalf of your Forum colleagues.
PS. - nothing personal in this. I try to post it to everyone with this kind of assignment ratio in this forum. If you have received a posting like this before - please do not take offence - none is intended!
Have one on me.