<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: HP-UX Open Files problem in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955440#M117014</link>
    <description>nflocks hits me initially.  Its set pretty high and I'm wondering why.&lt;BR /&gt;&lt;BR /&gt;Usually we set new kernel parms on the test box and make sure they work right before loosing them in production.&lt;BR /&gt;&lt;BR /&gt;If this is a kernel issue at all, it might be useful to bring production back to where the test machine is where practical and see if the problem recurs.  If not, there's your answer.&lt;BR /&gt;&lt;BR /&gt;You can bump the test box up to here production is and see if the problem mysteriously appears.  This would prove its a kernel problem.&lt;BR /&gt;&lt;BR /&gt;Then its a matter of scaling back the test machine one(with other dependent parameters) at a time until the problem goes away.&lt;BR /&gt;&lt;BR /&gt;It's painstaking and difficult, but I think you'd like to know which kernel parameter is actually causing the problem right?&lt;BR /&gt;&lt;BR /&gt;Good Luck.&lt;BR /&gt;&lt;BR /&gt;SEP</description>
    <pubDate>Mon, 21 Apr 2003 15:36:27 GMT</pubDate>
    <dc:creator>Steven E. Protter</dc:creator>
    <dc:date>2003-04-21T15:36:27Z</dc:date>
    <item>
      <title>HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955435#M117009</link>
      <description>Hi there. We have two systems (believed identical) one devel one production. In the devel system we do not experience the open files problem, whereas the production one does... I'll try below to paste the system's kernel parameters (from /stand/system):&lt;BR /&gt;&lt;BR /&gt;DEVEL * Tunable parameters&lt;BR /&gt;STRMSGSZ        65535&lt;BR /&gt;dnlc_hash_locks 512&lt;BR /&gt;max_thread_proc 3000&lt;BR /&gt;maxdsiz         2063835136&lt;BR /&gt;maxdsiz_64bit   0X80000000&lt;BR /&gt;maxfiles        2048&lt;BR /&gt;maxfiles_lim    2048&lt;BR /&gt;maxssiz         0X8000000&lt;BR /&gt;maxssiz_64bit   0X8000000&lt;BR /&gt;maxswapchunks   16384&lt;BR /&gt;maxtsiz         0X4000000&lt;BR /&gt;maxtsiz_64bit   0X40000000&lt;BR /&gt;maxuprc         ((NPROC*9)/10)&lt;BR /&gt;maxusers        512&lt;BR /&gt;msgmni          (NPROC)&lt;BR /&gt;msgseg          32767&lt;BR /&gt;msgtql          (NPROC)&lt;BR /&gt;ncallout        (((NPROC*7)/4)+16)&lt;BR /&gt;ncsize          ((8*NPROC+2048)+VX_NCSIZE)&lt;BR /&gt;nfile           (15*NPROC+8192)&lt;BR /&gt;nflocks         6000&lt;BR /&gt;ninode          (8*NPROC+2048)&lt;BR /&gt;nproc           4096&lt;BR /&gt;nstrpty         60&lt;BR /&gt;semmni          4096&lt;BR /&gt;semmns          (SEMMNI*2)&lt;BR /&gt;semmnu          (NPROC-4)&lt;BR /&gt;shmmax          (0X80000000-0X4000000)&lt;BR /&gt;shmmni          512&lt;BR /&gt;vps_ceiling     64&lt;BR /&gt;&lt;BR /&gt;PROD * Tunable parameters&lt;BR /&gt;STRMSGSZ        65535&lt;BR /&gt;dbc_max_pct     15&lt;BR /&gt;dnlc_hash_locks 512&lt;BR /&gt;max_thread_proc 256&lt;BR /&gt;maxdsiz         2063835136&lt;BR /&gt;maxdsiz_64bit   0X80000000&lt;BR /&gt;maxfiles        8192&lt;BR /&gt;maxfiles_lim    10000&lt;BR /&gt;maxssiz         0X8000000&lt;BR /&gt;maxssiz_64bit   1073741824&lt;BR /&gt;maxswapchunks   16384&lt;BR /&gt;maxtsiz         0X4000000&lt;BR /&gt;maxtsiz_64bit   0X40000000&lt;BR /&gt;maxuprc         ((NPROC*9)/10)&lt;BR /&gt;maxusers        512&lt;BR /&gt;msgmni          (NPROC)&lt;BR /&gt;msgseg          32767&lt;BR /&gt;msgtql          (NPROC)&lt;BR /&gt;nflocks         17744&lt;BR /&gt;ninode          (2*((NPROC+16+MAXUSERS)+32+(2*NPTY)))&lt;BR /&gt;nproc           8192&lt;BR /&gt;nstrpty         60&lt;BR /&gt;semmni          4096&lt;BR /&gt;semmns          (SEMMNI*2)&lt;BR /&gt;semmnu          (NPROC-4)&lt;BR /&gt;shmmax          0X20000000&lt;BR /&gt;shmmni          512&lt;BR /&gt;timezone        (-60)&lt;BR /&gt;vps_ceiling     64&lt;BR /&gt;&lt;BR /&gt;The one system (top - development) does not experience the problem. We managed to find out the open file leak at the end (this is a J2EE Oracle Application Server java web application) and fix it. However, what puzzles us is that in the production system the problem manifested itself from as low as 520 open files (which include the DB connections, 150) whereas in the development system we managed to have more than 650 files open (leaked) and still no problems. We used a utility named "lsof" which we compiled on HPUX to find the processes' open files (for the JVM).&lt;BR /&gt;From the kernel parameters we've upped both the NINODE, NPROC, NFILE, NFLOCKS, MAXFILES and MAXFILES_LIM. Although in the production we have MAXFILES_LIM = 10.000, it still dies compared to the development which is as low as 2048 (see above).&lt;BR /&gt;&lt;BR /&gt;Any ideas anyone? Does this configuration have something we've misdjudged in the production and we might need to rectify soon?? Why although the development server seems not as configured, still manages to handle more?&lt;BR /&gt;&lt;BR /&gt;Thank you&lt;BR /&gt;</description>
      <pubDate>Mon, 21 Apr 2003 12:55:48 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955435#M117009</guid>
      <dc:creator>Thanos Agelatos</dc:creator>
      <dc:date>2003-04-21T12:55:48Z</dc:date>
    </item>
    <item>
      <title>Re: HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955436#M117010</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;What is the setting for nfile on the production system?  I don't see that value listed.&lt;BR /&gt;&lt;BR /&gt;The 'lsof' utility should have given you a good idea of what the differences were.  Did it show if the processes on the production system had more files open per process than on the development system?&lt;BR /&gt;&lt;BR /&gt;JP&lt;BR /&gt;</description>
      <pubDate>Mon, 21 Apr 2003 13:05:03 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955436#M117010</guid>
      <dc:creator>John Poff</dc:creator>
      <dc:date>2003-04-21T13:05:03Z</dc:date>
    </item>
    <item>
      <title>Re: HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955437#M117011</link>
      <description>hi,&lt;BR /&gt;&lt;BR /&gt;some general pointers -&lt;BR /&gt;&lt;BR /&gt;- Are both these machines at the same patch levels?&lt;BR /&gt;&lt;BR /&gt;- are there any other applications running on the other machine which has an "open files" problem?&lt;BR /&gt;&lt;BR /&gt;- what is lsof's output? that would help to analyze the problem.&lt;BR /&gt;&lt;BR /&gt;use this script that i'd earlier posted in the forums, to find out total no. of files open by a process -&lt;BR /&gt;&lt;BR /&gt;lsof | awk '{procct[$1]++;procname[$1]=$1;}END{for (i in procname) {printf ("%s, %d\n",procname[i],procct[i]);}}' | tr -s " " | sort -t" " -n -r -k 2,2 &lt;BR /&gt;&lt;BR /&gt;check the process that has too many files open on the system following the oracle jvm based app. see if that too has a memory leak anywhere.&lt;BR /&gt;&lt;BR /&gt;FWIW.&lt;BR /&gt;&lt;BR /&gt;- ramd.</description>
      <pubDate>Mon, 21 Apr 2003 13:07:48 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955437#M117011</guid>
      <dc:creator>Ramkumar Devanathan</dc:creator>
      <dc:date>2003-04-21T13:07:48Z</dc:date>
    </item>
    <item>
      <title>Re: HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955438#M117012</link>
      <description>I don't see the same kernel parameters or values on both systems, 'dbc_max_pct 15' for example only appears in the bottom list.&lt;BR /&gt;&lt;BR /&gt;What model servers, for both please?&lt;BR /&gt;&lt;BR /&gt;This sounds more like a patching issue also, which can be corrected by cloning dev onto prod.  Run some patch counts on both:&lt;BR /&gt;&lt;BR /&gt;swlist -l fileset | wc&lt;BR /&gt;&lt;BR /&gt;Use sar to measure usage of open files, inodes and processes:&lt;BR /&gt;&lt;BR /&gt;sar -v 5 5</description>
      <pubDate>Mon, 21 Apr 2003 13:08:38 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955438#M117012</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2003-04-21T13:08:38Z</dc:date>
    </item>
    <item>
      <title>Re: HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955439#M117013</link>
      <description>First of all, do you have more people on production than development.  &lt;BR /&gt;&lt;BR /&gt;It may actually deal with a calculated field, but is appearing as a file problem.  &lt;BR /&gt;&lt;BR /&gt;I would suggest running sar and checking to see which of your metrics is really killing you.&lt;BR /&gt;&lt;BR /&gt;Our /stand/system resembles this&lt;BR /&gt;* Tunable parameters&lt;BR /&gt;&lt;BR /&gt;STRMSGSZ        65535&lt;BR /&gt;bufpages        0&lt;BR /&gt;dbc_max_pct     10&lt;BR /&gt;fs_async        1&lt;BR /&gt;maxdsiz         0X20000000&lt;BR /&gt;maxdsiz_64bit   0X20000000&lt;BR /&gt;maxfiles        2048&lt;BR /&gt;maxfiles_lim    2048&lt;BR /&gt;maxssiz         0X2000000&lt;BR /&gt;maxssiz_64bit   0X2000000&lt;BR /&gt;maxswapchunks   12288&lt;BR /&gt;maxtsiz         0X20000000&lt;BR /&gt;maxuprc         ((NPROC*8)/10)&lt;BR /&gt;maxusers        1200&lt;BR /&gt;maxvgs          256&lt;BR /&gt;msgmap          (MSGTQL+2)&lt;BR /&gt;msgmax          32768&lt;BR /&gt;msgmnb          65535&lt;BR /&gt;msgmni          (NPROC)&lt;BR /&gt;msgseg          (MSGTQL*4)&lt;BR /&gt;msgssz          128&lt;BR /&gt;msgtql          (NPROC)&lt;BR /&gt;nfile           (15*NPROC+2048)&lt;BR /&gt;nflocks         (NPROC)&lt;BR /&gt;ninode          (8*NPROC+2048)&lt;BR /&gt;nproc           (((10*MAXUSERS)/3)+128)&lt;BR /&gt;nstrpty         60&lt;BR /&gt;nstrtel         (MAXUSERS)&lt;BR /&gt;nswapdev        25&lt;BR /&gt;semmni          (NPROC*2)&lt;BR /&gt;semmns          (SEMMNI*2)&lt;BR /&gt;semmnu          (NPROC-4)&lt;BR /&gt;semume          64&lt;BR /&gt;semvmx          32768&lt;BR /&gt;shmmax          0X40000000&lt;BR /&gt;shmmni          512&lt;BR /&gt;shmseg          32&lt;BR /&gt;swapmem_on      0&lt;BR /&gt;timeslice       1&lt;BR /&gt;unlockable_mem  (MAXUSERS*10)&lt;BR /&gt;&lt;BR /&gt;What I am suspecting is that you are actually bumping your head somewhere else.&lt;BR /&gt;&lt;BR /&gt;Like I said, this is in probability a parallel problem...</description>
      <pubDate>Mon, 21 Apr 2003 13:18:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955439#M117013</guid>
      <dc:creator>Tim Sanko</dc:creator>
      <dc:date>2003-04-21T13:18:52Z</dc:date>
    </item>
    <item>
      <title>Re: HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955440#M117014</link>
      <description>nflocks hits me initially.  Its set pretty high and I'm wondering why.&lt;BR /&gt;&lt;BR /&gt;Usually we set new kernel parms on the test box and make sure they work right before loosing them in production.&lt;BR /&gt;&lt;BR /&gt;If this is a kernel issue at all, it might be useful to bring production back to where the test machine is where practical and see if the problem recurs.  If not, there's your answer.&lt;BR /&gt;&lt;BR /&gt;You can bump the test box up to here production is and see if the problem mysteriously appears.  This would prove its a kernel problem.&lt;BR /&gt;&lt;BR /&gt;Then its a matter of scaling back the test machine one(with other dependent parameters) at a time until the problem goes away.&lt;BR /&gt;&lt;BR /&gt;It's painstaking and difficult, but I think you'd like to know which kernel parameter is actually causing the problem right?&lt;BR /&gt;&lt;BR /&gt;Good Luck.&lt;BR /&gt;&lt;BR /&gt;SEP</description>
      <pubDate>Mon, 21 Apr 2003 15:36:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955440#M117014</guid>
      <dc:creator>Steven E. Protter</dc:creator>
      <dc:date>2003-04-21T15:36:27Z</dc:date>
    </item>
    <item>
      <title>Re: HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955441#M117015</link>
      <description>Hi all&lt;BR /&gt;&lt;BR /&gt;Thanks for your prompt replies - I'll take the questions you asked one by one:&lt;BR /&gt;&lt;BR /&gt;NFILE on production system:&lt;BR /&gt;14344 (based on formulae - I believe its the default: &lt;BR /&gt;(16*(NPROC+16+MAXUSERS)/10+32+2*(NPTY+NSTRPTY+NSTRTEL))&lt;BR /&gt;&lt;BR /&gt;NFILE on development:&lt;BR /&gt;69632 (15*NPROC+8192)&lt;BR /&gt;&lt;BR /&gt;dbc_max_pct on PROD (missing from the list) is 15 again. Dont know why it didnt show.&lt;BR /&gt;&lt;BR /&gt;sar -v 5 5 on PROD:&lt;BR /&gt;root@appsrv2#sar -v 5 5&lt;BR /&gt;&lt;BR /&gt;HP-UX appsrv2 B.11.11 U 9000/800    04/22/03&lt;BR /&gt;&lt;BR /&gt;09:03:11 text-sz  ov  proc-sz  ov  inod-sz  ov  file-sz  ov&lt;BR /&gt;09:03:16   N/A   N/A 137/8192  0  768/17744 0  1853/14354 0&lt;BR /&gt;09:03:21   N/A   N/A 137/8192  0  769/17744 0  1855/14354 0&lt;BR /&gt;09:03:26   N/A   N/A 137/8192  0  769/17744 0  1856/14354 0&lt;BR /&gt;&lt;BR /&gt;sar -v 5 5 on DEVEL:&lt;BR /&gt;# sar -v 5 5 &lt;BR /&gt;&lt;BR /&gt;HP-UX io B.11.11 U 9000/800    04/22/03&lt;BR /&gt;&lt;BR /&gt;11:07:35 text-sz  ov  proc-sz  ov  inod-sz  ov  file-sz  ov &lt;BR /&gt;11:07:40   N/A   N/A 217/4096  0  1158/34816 0  1536/69642 0&lt;BR /&gt;11:07:45   N/A   N/A 217/4096  0  1158/34816 0  1536/69642 0&lt;BR /&gt;11:07:50   N/A   N/A 217/4096  0  1158/34816 0  1536/69642 0&lt;BR /&gt;&lt;BR /&gt;about the patchlists:&lt;BR /&gt;"swlist -l fileset | wc "&lt;BR /&gt;on DEVEL: 1348&lt;BR /&gt;on PROD : 1319&lt;BR /&gt;&lt;BR /&gt;machine specs: &lt;BR /&gt;DEVEL: rp2405, 2cpu, 2GB ram&lt;BR /&gt;PROD : rp5405, 2cpu, 4GB ram&lt;BR /&gt;&lt;BR /&gt;Machines are different, but both have been patched with the  GOLD patchset of December 2002, and also the Java-Out-Of-The-Box patchset, apparently this was installed on development initially and not production and just by putting it on we gained some 10-15% performance increase. One other reason why kernel param numbers are not so in sync is that I've been on the production site for the last 3 weeks and we were "sort of" applying changes direct to the servers there (these are 3 servers for production, same machine.) Still, the 3 prod servers managed to hit the mark MUCH SOONER than the development one which has also 3 DB _instances_ (oracle 9i) running on it.&lt;BR /&gt;&lt;BR /&gt;Hope this information is more help - I've managed to find out the leaks on the application (it was some gif/jpg serving servlet that was not closing its files) using the lsof utility. Our question still remains: why the prod systems hit the mark sooner (apart from the traffic which was higher on production obviously). What other parameters do we need to tune/check for the production systems? Running java-out-of-the-box has already tuned the threads etc (or so it says). Any other pointers?&lt;BR /&gt;&lt;BR /&gt;Finally I want to thank again all of you for the replies - this is my first post in these forums and I've got solaris and linux experience - never before touched the hp-ux beasts... &lt;BR /&gt;&lt;BR /&gt;Thanos&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 22 Apr 2003 06:08:45 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955441#M117015</guid>
      <dc:creator>Thanos Agelatos</dc:creator>
      <dc:date>2003-04-22T06:08:45Z</dc:date>
    </item>
    <item>
      <title>Re: HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955442#M117016</link>
      <description>Thanos,&lt;BR /&gt;&lt;BR /&gt;in principle, if you are running one development system and one production system, and if your poduction system is critical, you should have the same hardware and the same software installations and configurations for both, production and development systems.&lt;BR /&gt;&lt;BR /&gt;In your case, hardware is not the same. But, this seems not to be critical, since your production system looks better than the development system. Nevertheless, if it is possible, in the future, upgrade your development systems hardware, to have the same environment as on the production system.&lt;BR /&gt;&lt;BR /&gt;Your software installations may be the same on both systems (I don't know), regardless of the small difference in installed filesets.&lt;BR /&gt;&lt;BR /&gt;At least your kernel configuration is not the same. E.g. your development system has set max_thread_proc to 3000, but on your production system it is set to 256. This seems much too small. I did not check the remainder of your kernel parameter list. But you should do so. Don't try to figure out on the individual parameter, how that could influence your specific problem. Just be sure to have the *same* configuration on both systems. After doing so, go on on analyzing if your problem will still exist.</description>
      <pubDate>Tue, 22 Apr 2003 08:53:21 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955442#M117016</guid>
      <dc:creator>Thomas Schler_1</dc:creator>
      <dc:date>2003-04-22T08:53:21Z</dc:date>
    </item>
    <item>
      <title>Re: HP-UX Open Files problem</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955443#M117017</link>
      <description>I'd say both of your servers are in danger of grinding to a halt from system memory fragmentation.  Certainly performance will degrade as you get farther from a reboot.  Consider these numbers for your production server:&lt;BR /&gt;&lt;BR /&gt;sar -v 5 5 on PROD: &lt;BR /&gt;root@appsrv2#sar -v 5 5 &lt;BR /&gt;&lt;BR /&gt;HP-UX appsrv2 B.11.11 U 9000/800 04/22/03 &lt;BR /&gt;&lt;BR /&gt;09:03:11 text-sz ov proc-sz ov inod-sz ov file-sz ov &lt;BR /&gt;09:03:16 N/A N/A 137/8192 0 768/17744 0 1853/14354 0 &lt;BR /&gt;09:03:21 N/A N/A 137/8192 0 769/17744 0 1855/14354 0 &lt;BR /&gt;09:03:26 N/A N/A 137/8192 0 769/17744 0 1856/14354 0 &lt;BR /&gt;&lt;BR /&gt;137/8192 = 1.67% utilized for nproc&lt;BR /&gt;&lt;BR /&gt;768/17744 = 4.3% utilized for ninode&lt;BR /&gt;&lt;BR /&gt;1853/14354 = 12.9% utilized for nfile&lt;BR /&gt;&lt;BR /&gt;If these numbers are indicative of your peak usage times during the day then this is a very lightly used server and you should adjust the above kernel parameters until 25% to 50% utilized numbers are indicated. &lt;BR /&gt;&lt;BR /&gt;Your dev server belongs to the A class while your prod server belongs to an L class and they will require different configuring.  You can't clone dev onto prod with ignite for instance.  Consider this link about the HW differences:&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www.hp.com/products1/servers/compare_pa-risc.html" target="_blank"&gt;http://www.hp.com/products1/servers/compare_pa-risc.html&lt;/A&gt; &lt;BR /&gt;&lt;BR /&gt;Finally, your open file problem:  From you description in your second posting its sounds like a rogue runaway process which is a cleanup issue.  These are often hard to identify but if you're using shared memory then ipcs and ipcrm will help, as the owner and PID are indicated.&lt;BR /&gt;&lt;BR /&gt;ipcs -ma&lt;BR /&gt;&lt;BR /&gt;ipcrm -m key / shmid&lt;BR /&gt;&lt;BR /&gt;Although this won't kill a rogue process that's gone to init (* reboot *) it will help you evaluate and release your shared memory.</description>
      <pubDate>Tue, 22 Apr 2003 13:55:55 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/hp-ux-open-files-problem/m-p/2955443#M117017</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2003-04-22T13:55:55Z</dc:date>
    </item>
  </channel>
</rss>

