<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: System hanging in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631334#M849786</link>
    <description>Well, since the system is up and running since 248 days, there might be some memory leaks  hanging out. You can use ipcs -mob and look at NATTCH and look out for non root segments having 0's. This might indicate that they were not released properly. You can use ipcrm to remove them. But I would suggest to halt application and check it up again to see if they were really them. If you don't want to get into trouble, you can reboot the system. &lt;BR /&gt;&lt;BR /&gt;You certainly need to consider decreasing dbc_max_pct value. You can make it around 15% and watch for %wio in sar -u and %rcache and %wcache in sar -b.&lt;BR /&gt;&lt;BR /&gt;dbc_mac_pct has to be adjusted according to the pattern you see. Typically %rcache &amp;gt; 90 and %wcache &amp;gt; 80. If you have %wio &amp;gt; 20, then your system is bottlenecking on IO.&lt;BR /&gt;&lt;BR /&gt;You can start with 15% of dbc_max_pct and observe the above. &lt;BR /&gt;&lt;BR /&gt;This needs reboot. So, this will also fix if you have any memory leaks on the box.&lt;BR /&gt;&lt;BR /&gt;Take a make_recovery tape of the system before rebooting. I have seen systems with this much uptime not come up properly. Also run /sbin/init.d/clean_ex start before rebooting the box as it will take a long time here during the bootup.&lt;BR /&gt;&lt;BR /&gt;-Sri</description>
    <pubDate>Thu, 13 Dec 2001 20:36:10 GMT</pubDate>
    <dc:creator>Sridhar Bhaskarla</dc:creator>
    <dc:date>2001-12-13T20:36:10Z</dc:date>
    <item>
      <title>System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631328#M849780</link>
      <description>&lt;BR /&gt;Running HP-UX 11.0 on an N-class server. We've been experiencing system hanging problems recently, where the system seems to hang up for several seconds, then is ok.  GPM shows Memory Wait Queue spiking during these periods, jumping from 0 processes waiting to up to 32 processes waiting on mem resources.  It remains like that for a number of seconds -- enough to cause user complaints.&lt;BR /&gt;Swapinfo shows: &lt;BR /&gt;             Kb      Kb      Kb   PCT  START/      Kb&lt;BR /&gt;TYPE      AVAIL    USED    FREE  USED   LIMIT RESERVE  PRI  NAME&lt;BR /&gt;dev      512000       0  512000    0%       0       -    1  /dev/vgroot/swap&lt;BR /&gt;dev     3072000  212884 2859116    7%       0       -    0  /dev/vg01/swap&lt;BR /&gt;dev     3072000  215900 2856100    7%       0       -    0  /dev/vg02/swap&lt;BR /&gt;dev     3072000  215884 2856116    7%       0       -    0  /dev/vg03/swap&lt;BR /&gt;dev     3072000  217864 2854136    7%       0       -    0  /dev/vg05/swap&lt;BR /&gt;reserve       - 3163608 -3163608&lt;BR /&gt;memory  2786380 1437704 1348676   52%&lt;BR /&gt;&lt;BR /&gt;I dont think we have a shortage of memory; almost no paging out at all, and almost no VM writing to disk in gpm. I've used the "UNIX95= ps -e -o "user,vsz,pid,ppid,args" | sort -rnk2" command, and output to files during "normal" periods, and during "wait" periods.  Looking at the top mem users, it appears that when a wait period begins, a new process appears in the high-mem-usage end of the listing, but when the wait period ends, thoses processes still remain.  This doesnt seems like unusual behavior that should suddenly cause obvious performance issues.  How can I identify what the exact cause is of mem que waits?  Then, how do I go about correcting it?&lt;BR /&gt;Any help would be appreciated.</description>
      <pubDate>Thu, 13 Dec 2001 19:24:10 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631328#M849780</guid>
      <dc:creator>Evelyn Daroga</dc:creator>
      <dc:date>2001-12-13T19:24:10Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631329#M849781</link>
      <description>Hi Evelyn,&lt;BR /&gt;&lt;BR /&gt;Looking at your swapinfo, your system is running low on memory. This is because the reserve is around 3GB which gives a good idea of the processes occupying around it. And you can see almost 1 GB of pages sitting on the swap. When a new process is forked, obviously some processes have to be paged out to accommodate it's private area. What does your vmstat -s 's cumulative page out value?. &lt;BR /&gt;&lt;BR /&gt;If you are using the default buffer cache of 50%, you may want to get it lower by altering&lt;BR /&gt;dbc_max_pct. Or get the system some more memory like 2GB extra.&lt;BR /&gt;&lt;BR /&gt;-Sri&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 13 Dec 2001 19:36:43 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631329#M849781</guid>
      <dc:creator>Sridhar Bhaskarla</dc:creator>
      <dc:date>2001-12-13T19:36:43Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631330#M849782</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;You might find this doc is useful..&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://us-support2.external.hp.com/cki/bin/doc.pl/sid=f704aad410450a6e52/screen=ckiDisplayDocument?docId=200000056493259" target="_blank"&gt;http://us-support2.external.hp.com/cki/bin/doc.pl/sid=f704aad410450a6e52/screen=ckiDisplayDocument?docId=200000056493259&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;-USA..</description>
      <pubDate>Thu, 13 Dec 2001 19:55:20 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631330#M849782</guid>
      <dc:creator>Uday_S_Ankolekar</dc:creator>
      <dc:date>2001-12-13T19:55:20Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631331#M849783</link>
      <description>Thanks for the quick reply!  vmstat -s shows the following for page stats:&lt;BR /&gt;438883 pages swapped out&lt;BR /&gt;16647831 page outs&lt;BR /&gt;39011422 pages paged out&lt;BR /&gt;472405557 pages scanned for page out&lt;BR /&gt;But, are these values cumulative since the last reboot?  The system has been up for 248 days.  Would a reboot free up some mem?  Also, the dbc_max_pct is at 50%.  Would a reduction to 30% be reasonable?  Thanks again for your help -- sys tuning is not my specialty!!!</description>
      <pubDate>Thu, 13 Dec 2001 19:55:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631331#M849783</guid>
      <dc:creator>Evelyn Daroga</dc:creator>
      <dc:date>2001-12-13T19:55:52Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631332#M849784</link>
      <description>Thanks, also, Uday, for the doc reference.  I have printed it out, and will certainly study it. I need all the help I can get!</description>
      <pubDate>Thu, 13 Dec 2001 20:04:18 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631332#M849784</guid>
      <dc:creator>Evelyn Daroga</dc:creator>
      <dc:date>2001-12-13T20:04:18Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631333#M849785</link>
      <description>Hi Evelyn,&lt;BR /&gt;&lt;BR /&gt;Check out this thread.  James attached several good documents, including the one stated above.&lt;BR /&gt;&lt;BR /&gt;...jcd...</description>
      <pubDate>Thu, 13 Dec 2001 20:35:34 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631333#M849785</guid>
      <dc:creator>Joseph C. Denman</dc:creator>
      <dc:date>2001-12-13T20:35:34Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631334#M849786</link>
      <description>Well, since the system is up and running since 248 days, there might be some memory leaks  hanging out. You can use ipcs -mob and look at NATTCH and look out for non root segments having 0's. This might indicate that they were not released properly. You can use ipcrm to remove them. But I would suggest to halt application and check it up again to see if they were really them. If you don't want to get into trouble, you can reboot the system. &lt;BR /&gt;&lt;BR /&gt;You certainly need to consider decreasing dbc_max_pct value. You can make it around 15% and watch for %wio in sar -u and %rcache and %wcache in sar -b.&lt;BR /&gt;&lt;BR /&gt;dbc_mac_pct has to be adjusted according to the pattern you see. Typically %rcache &amp;gt; 90 and %wcache &amp;gt; 80. If you have %wio &amp;gt; 20, then your system is bottlenecking on IO.&lt;BR /&gt;&lt;BR /&gt;You can start with 15% of dbc_max_pct and observe the above. &lt;BR /&gt;&lt;BR /&gt;This needs reboot. So, this will also fix if you have any memory leaks on the box.&lt;BR /&gt;&lt;BR /&gt;Take a make_recovery tape of the system before rebooting. I have seen systems with this much uptime not come up properly. Also run /sbin/init.d/clean_ex start before rebooting the box as it will take a long time here during the bootup.&lt;BR /&gt;&lt;BR /&gt;-Sri</description>
      <pubDate>Thu, 13 Dec 2001 20:36:10 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631334#M849786</guid>
      <dc:creator>Sridhar Bhaskarla</dc:creator>
      <dc:date>2001-12-13T20:36:10Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631335#M849787</link>
      <description>Ok, ipcs -mob indicates only one non-root process with a 0 for NATTCH.  SEGSIZE is 1771332. I think I know what it is, and will remove it tomorrow (dont want to today, just in case it bothers nightly processing).  I will adjust the dbc_max_pct to 15%, and watch the items mentioned.  I am unable to reboot until the weekend.  I do a make_recovery every night, so will have one on hand, in case. I'll have to let this wait till next week, but will look at anything else anyone might suggest in the meantime.  Thanks for all the suggestions.</description>
      <pubDate>Thu, 13 Dec 2001 21:11:03 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631335#M849787</guid>
      <dc:creator>Evelyn Daroga</dc:creator>
      <dc:date>2001-12-13T21:11:03Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631336#M849788</link>
      <description>hi,&lt;BR /&gt;&lt;BR /&gt; Regarding tracking current pageouts,  run vmstat -n and it will list it out under po column .&lt;BR /&gt;&lt;BR /&gt;for eg:&lt;BR /&gt;#vmstat -n 5 5&lt;BR /&gt;VM&lt;BR /&gt;       memory                     page                          faults&lt;BR /&gt;     avm    free   re   at    pi   po    fr   de    sr     in     sy    cs  &lt;BR /&gt;   96439  1426934  140   31     1    0    46    0     0   5642  12263  3923&lt;BR /&gt;&lt;BR /&gt;***&lt;BR /&gt;&lt;BR /&gt;Also see the usage of vhand and swapper:&lt;BR /&gt;ps -ef |grep vhand&lt;BR /&gt;&lt;BR /&gt;IF they are accumulating CPU time rapidly , then it is doing too much of memory swapping, which is not a good thing.&lt;BR /&gt;&lt;BR /&gt;HTH&lt;BR /&gt;raj</description>
      <pubDate>Thu, 13 Dec 2001 22:56:21 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631336#M849788</guid>
      <dc:creator>Roger Baptiste</dc:creator>
      <dc:date>2001-12-13T22:56:21Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631337#M849789</link>
      <description>Changed the dbc_max_pct to 15 and rebooted Sat night.  After about 7 hrs "normal" usage, swapinfo shows:&lt;BR /&gt;             Kb      Kb      Kb   PCT  START/      Kb&lt;BR /&gt;TYPE      AVAIL    USED    FREE  USED   LIMIT RESERVE  PRI  NAME&lt;BR /&gt;dev      512000       0  512000    0%       0       -    1  /dev/vgroot/swap&lt;BR /&gt;dev     3072000   58212 3013788    2%       0       -    0  /dev/vg01/swap&lt;BR /&gt;dev     3072000   57816 3014184    2%       0       -    0  /dev/vg02/swap&lt;BR /&gt;dev     3072000   58444 3013556    2%       0       -    0  /dev/vg03/swap&lt;BR /&gt;dev     3072000   58424 3013576    2%       0       -    0  /dev/vg05/swap&lt;BR /&gt;reserve       - 3729136 -3729136&lt;BR /&gt;memory  2786348  655384 2130964   24%&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Now I remember why I havent been using sar... &lt;BR /&gt;Since upgrading to newer system, and to HPUX 11.0, sar is broken.  It wants /var/adm/sa/sa17 which does not exist on the system.  Even the dir /var/adm/sa does not exist.  Do I need to reinstall something??</description>
      <pubDate>Mon, 17 Dec 2001 20:30:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631337#M849789</guid>
      <dc:creator>Evelyn Daroga</dc:creator>
      <dc:date>2001-12-17T20:30:32Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631338#M849790</link>
      <description>Hi Evelyn,&lt;BR /&gt;&lt;BR /&gt;Well, you need /var/adm/sa directory if you want to store the SAR data. For runtime data, you can specify the interval (i) and the iterations (j).&lt;BR /&gt;&lt;BR /&gt;sar -d i j&lt;BR /&gt;&lt;BR /&gt;For ex., if you say sar -d 2 20, it is goign to display the data every 2 seconds for 20 values.&lt;BR /&gt;&lt;BR /&gt;You are still swapping means your application needs more memory resources. It's not a bad idea to keep it like this, but memory isn't very costly to upgrade. I wouldn't like to see any swap usage on my systems.&lt;BR /&gt;&lt;BR /&gt;-Sri&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 17 Dec 2001 20:45:14 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631338#M849790</guid>
      <dc:creator>Sridhar Bhaskarla</dc:creator>
      <dc:date>2001-12-17T20:45:14Z</dc:date>
    </item>
    <item>
      <title>Re: System hanging</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631339#M849791</link>
      <description>Thank you, Sri.  I didn't get that sar was looking for "sa17" yesterday, and yesterday was the 17th.  Today it wants sa18.  Now I get it! (duh!)  So, I created the dir, and I also had to create sa18, and now it doesnt complain.  I dont think upgrading mem is an option right now, but at least we're not hanging users like we were last week.  I really appreciate all the help and suggestions.  I'll be reviewing the docs mentioned, and hopefully do a better job of monitoring this sort of thing.</description>
      <pubDate>Tue, 18 Dec 2001 13:21:01 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/system-hanging/m-p/2631339#M849791</guid>
      <dc:creator>Evelyn Daroga</dc:creator>
      <dc:date>2001-12-18T13:21:01Z</dc:date>
    </item>
  </channel>
</rss>

