<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Need help troubleshooting performance issue in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207262#M464299</link>
    <description>I don't seem to be able toopen the file so I, going to try a excell 2003 format.</description>
    <pubDate>Mon, 02 Nov 2009 22:53:06 GMT</pubDate>
    <dc:creator>Tony Williams</dc:creator>
    <dc:date>2009-11-02T22:53:06Z</dc:date>
    <item>
      <title>Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207250#M464287</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;We had a performance problem on Friday where we brought a server to its knees, logins took 10 minutes process. The problem lasted 30 minutes until we stopped a few Oracle processes. The problem was 100% CPU utilization with a global priority queue of 120, a memory queue of 60, and a disk queue of 20. I'm trying to find out specifically what was the cause using HP OV Performance Manager. When I look at process data I can see lots of processes blocked on PRI and VM. I can see that my root/swap disks are hot.&lt;BR /&gt;&lt;BR /&gt;I look at swap and there were no pageout and the swapout rate was 0. Global Disk VM IO is higher than normal but not by much (30,000 to 165,000), and GLobal Pagein are high 16,000 to 20,000. &lt;BR /&gt;&lt;BR /&gt;I'm having a bit of a problem pinpointing where the problem started or came from.&lt;BR /&gt;&lt;BR /&gt;Any help would be appreciated.</description>
      <pubDate>Mon, 02 Nov 2009 15:54:05 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207250#M464287</guid>
      <dc:creator>Tony Williams</dc:creator>
      <dc:date>2009-11-02T15:54:05Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207251#M464288</link>
      <description>Shalom,&lt;BR /&gt;&lt;BR /&gt;Looks like a process or two was bound to CPU and not playing nice with other processes.&lt;BR /&gt;&lt;BR /&gt;To see the issue in real time, you would want to run glance or gpm and see what processes are running at the time.&lt;BR /&gt;&lt;BR /&gt;Some part of this data is lying to you.&lt;BR /&gt;&lt;BR /&gt;You say root and swap disks were hot but you got no pageing.  This is not telling a consistent story.&lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www.hpux.ws/?p=6" target="_blank"&gt;http://www.hpux.ws/?p=6&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;I'd set up a collection run on the script above to see if you can spot anything.&lt;BR /&gt;&lt;BR /&gt;There is a top snapshot that gets done that might help you identify the processes.&lt;BR /&gt;&lt;BR /&gt;SEP</description>
      <pubDate>Mon, 02 Nov 2009 15:59:21 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207251#M464288</guid>
      <dc:creator>Steven E. Protter</dc:creator>
      <dc:date>2009-11-02T15:59:21Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207252#M464289</link>
      <description>Thanks Stephen, sorry I should have said there are no page-outs there are a high number of page-ins, 200,000 to 800,000.</description>
      <pubDate>Mon, 02 Nov 2009 16:15:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207252#M464289</guid>
      <dc:creator>Tony Williams</dc:creator>
      <dc:date>2009-11-02T16:15:58Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207253#M464290</link>
      <description>Before you stopped the Oracle processes, was there a large amount of swap actually consumed (not reserved)?&lt;BR /&gt;&lt;BR /&gt;What you're describing sounds a lot like the Oracle processes produced sufficient memory pressure such that a large number of deactivations occurred (due to insufficient memory and lower priority than Oracle and affiliates). When Oracle went away, free memory rose -- and processes began to be reactivated. In your case, it sounds like reactivation in a "thundering herd" such that the scheduler and swap-in paths got swamped trying to handle all the new scheduling/paging requests of the herd coming back to life -- and logins suffered under the contention.&lt;BR /&gt;&lt;BR /&gt;What OS version is this? What are your core kernel Process and Virtual Memory Management patch levels? (Deactivation/reactivation isn't a path that get stressed that much on performant systems, but I remember some work touching on that space such that patches may be relevant). Was there any pattern with the scheduling priority of the reactivated processes relative to your login/shell priorities? (Reactivation should be a more gradual thing -- if for no other reason, to ensure the memory pressure doesn't come right back so the system doesn't just thrash, but if all the deactivations were for higher-than-shell, but lower-than-Oracle priority, I can imagine a herd forming...)</description>
      <pubDate>Mon, 02 Nov 2009 16:50:01 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207253#M464290</guid>
      <dc:creator>Don Morris_1</dc:creator>
      <dc:date>2009-11-02T16:50:01Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207254#M464291</link>
      <description>Hi&lt;BR /&gt;&lt;BR /&gt;So you have a classic race condition developed by your application (* very likely *) and you can id the responsible pid by collecting data over time with a 15 minute cron.&lt;BR /&gt;&lt;BR /&gt;Refer to the 'ps' man page and the -o option, especially pcpu and vmz and comm then collect the data in an outfile.&lt;BR /&gt;&lt;BR /&gt;UNIX95=1 ps -ef -o pcpu,state,pid,ppid,comm | sort -rn | head -15&lt;BR /&gt;&lt;BR /&gt;UNIX95=1 ps -ef -o vsz,state,pid,ppid,comm | sort -rn | head 15&lt;BR /&gt;&lt;BR /&gt;vmstat 5 5&lt;BR /&gt;&lt;BR /&gt;sar -d 5 5 (* disk bottlenecks *)&lt;BR /&gt;&lt;BR /&gt;And any other command that you'd like to check.</description>
      <pubDate>Mon, 02 Nov 2009 17:26:17 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207254#M464291</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2009-11-02T17:26:17Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207255#M464292</link>
      <description>... Before you stopped the Oracle processes, was there a large amount of swap actually consumed (not reserved)?&lt;BR /&gt;&lt;BR /&gt;No, we see about 1% on a system with 256GB of main memory. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;What OS version is this? &lt;BR /&gt;11.23</description>
      <pubDate>Mon, 02 Nov 2009 17:28:19 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207255#M464292</guid>
      <dc:creator>Tony Williams</dc:creator>
      <dc:date>2009-11-02T17:28:19Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207256#M464293</link>
      <description>Shalom,&lt;BR /&gt;&lt;BR /&gt;swapinto -tam&lt;BR /&gt;&lt;BR /&gt;I'd like to see the ratio of main memory to swap. I've been running into some issues here where swap reservation requests are failing on a very large system.&lt;BR /&gt;&lt;BR /&gt;Might be able to share some insights.&lt;BR /&gt;&lt;BR /&gt;SEP</description>
      <pubDate>Mon, 02 Nov 2009 17:35:23 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207256#M464293</guid>
      <dc:creator>Steven E. Protter</dc:creator>
      <dc:date>2009-11-02T17:35:23Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207257#M464294</link>
      <description>The 1% on the device line was from our problem period, we normally are 0.&lt;BR /&gt;&lt;BR /&gt;            Mb      Mb      Mb   PCT  START/      Mb&lt;BR /&gt;TYPE      AVAIL    USED    FREE  USED   LIMIT RESERVE  PRI  NAME&lt;BR /&gt;dev       71680     710   70940    1%       0       -    1  /dev/vg00/lvol2&lt;BR /&gt;dev      131072     702  130348    1%       0       -    1  /dev/vg00/swap2&lt;BR /&gt;dev      131072     701  130349    1%       0       -    1  /dev/vg00/swap3&lt;BR /&gt;dev      131072     706  130344    1%       0       -    1  /dev/vg00/swap4&lt;BR /&gt;dev      131072     705  130345    1%       0       -    1  /dev/vg00/swap5&lt;BR /&gt;dev      131072     710  130340    1%       0       -    1  /dev/vg00/swap6&lt;BR /&gt;reserve       -  563273 -563273&lt;BR /&gt;memory   524023  124095  399928   24%&lt;BR /&gt;total   1251063  691602  559321   55%       -       0    -</description>
      <pubDate>Mon, 02 Nov 2009 18:41:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207257#M464294</guid>
      <dc:creator>Tony Williams</dc:creator>
      <dc:date>2009-11-02T18:41:52Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207258#M464295</link>
      <description>Sure doesn't look like swap</description>
      <pubDate>Mon, 02 Nov 2009 22:03:12 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207258#M464295</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2009-11-02T22:03:12Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207259#M464296</link>
      <description>&amp;gt;&amp;gt;there are a high number of page-ins, 200,000 to 800,000.&lt;BR /&gt;&lt;BR /&gt;Do you see a larger than usual number of processes that were started during the time you were having issues?</description>
      <pubDate>Mon, 02 Nov 2009 22:14:16 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207259#M464296</guid>
      <dc:creator>Patrick Wallek</dc:creator>
      <dc:date>2009-11-02T22:14:16Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207260#M464297</link>
      <description>Over most of the day and during this time the number of processes are steady from 3700 to 3900. I'm attaching a Excel 2007 spreadsheet with a lot of the Global metrics I'm looking at. B4 the CPU and Global priority queue go up, The root disk become very busy (400% utilization) page requests and free memory start to go down,</description>
      <pubDate>Mon, 02 Nov 2009 22:31:01 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207260#M464297</guid>
      <dc:creator>Tony Williams</dc:creator>
      <dc:date>2009-11-02T22:31:01Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207261#M464298</link>
      <description>Hi&lt;BR /&gt;&lt;BR /&gt;Need to see a breakdown by process - Please include the reports requested above</description>
      <pubDate>Mon, 02 Nov 2009 22:47:38 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207261#M464298</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2009-11-02T22:47:38Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207262#M464299</link>
      <description>I don't seem to be able toopen the file so I, going to try a excell 2003 format.</description>
      <pubDate>Mon, 02 Nov 2009 22:53:06 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207262#M464299</guid>
      <dc:creator>Tony Williams</dc:creator>
      <dc:date>2009-11-02T22:53:06Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207263#M464300</link>
      <description>Thanks Michael,&lt;BR /&gt;&lt;BR /&gt;I have included all but the sar -d there are several thousand disks so the output is very long. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Mon, 02 Nov 2009 23:06:02 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207263#M464300</guid>
      <dc:creator>Tony Williams</dc:creator>
      <dc:date>2009-11-02T23:06:02Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207264#M464301</link>
      <description>1049892 R 18018     1 java&lt;BR /&gt; 966304 S  6595 10378 dw.sapTPQ_DVEBMGS82&lt;BR /&gt; 716076 S 28075  8116 dw.sapEWD_DVEBMGS32&lt;BR /&gt; 609112 S 24831 24816 dw.sapERQ_DVEBMGS29&lt;BR /&gt; 565144 S 10060 24816 dw.sapERQ_DVEBMGS29&lt;BR /&gt;&lt;BR /&gt;So keep an eye on these processes.  &lt;BR /&gt;&lt;BR /&gt;Put the UNIX95 commands in a 15 cron and save the data for at least two days unless you see vsz process growth greater than the above.&lt;BR /&gt;&lt;BR /&gt;Include sar -c -u and -v.&lt;BR /&gt;&lt;BR /&gt;Attach the sar -d Totals:  Note any avwait &amp;gt; aserv disks and pvdisplay -v those disks and note the file systems.</description>
      <pubDate>Mon, 02 Nov 2009 23:41:59 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207264#M464301</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2009-11-02T23:41:59Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207265#M464302</link>
      <description>Thanks Michael,&lt;BR /&gt;&lt;BR /&gt;This problem happened this past Friday and the 1st order of business is always to free up resources, its only after everything is back to normal that we start looking to see what the problem was. Your suggestions will help if we have the problem again.</description>
      <pubDate>Tue, 03 Nov 2009 02:15:37 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207265#M464302</guid>
      <dc:creator>Tony Williams</dc:creator>
      <dc:date>2009-11-03T02:15:37Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207266#M464303</link>
      <description>Tony,&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt;The problem was 100% CPU utilization with a global priority queue of 120, a memory queue of 60, and a disk queue of 20.&lt;BR /&gt;&lt;BR /&gt;The 100% cpu utilization and pri.queue 120 says it all.  And login takes 10 minute as priority queue was high with cpu bottleneck.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Here the question would be:&lt;BR /&gt;- Did you see any increased load at that time. i.e may be more oracle process or more java process or more application than usual scenario, or more batch was executed.&lt;BR /&gt;- How many cpu do you have . What is the model of the server.&lt;BR /&gt;- How many process wa runningduring that time, and how many process runs at usual load.&lt;BR /&gt;- what was the load factor at that time. Obviously it would be more than 1, 2 ..&lt;BR /&gt;- What measureware 'extract' report shows the historical data of cpu/mem/io/swap/network in/out etc. &lt;BR /&gt;From above we can narrow down the cause, &lt;BR /&gt;&lt;BR /&gt;Hth,&lt;BR /&gt;Raj.</description>
      <pubDate>Tue, 03 Nov 2009 03:01:10 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207266#M464303</guid>
      <dc:creator>Raj D.</dc:creator>
      <dc:date>2009-11-03T03:01:10Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207267#M464304</link>
      <description>HI&lt;BR /&gt;&lt;BR /&gt;What is this process?&lt;BR /&gt;&lt;BR /&gt;1049892 R 18018 1 java :  First in virtual memory and gone to init.  Is that normal for it to go to init or should it have a parent pid?&lt;BR /&gt;&lt;BR /&gt;What is this process?&lt;BR /&gt;&lt;BR /&gt;90.82 R 18669 18375 jlaunch : 2nd in cpu activity only behind the kernel.&lt;BR /&gt;&lt;BR /&gt;Java login?&lt;BR /&gt;&lt;BR /&gt;Question to Others:&lt;BR /&gt;&lt;BR /&gt;Is it normal for 'kernel' to be consumming the most CPU time?</description>
      <pubDate>Tue, 03 Nov 2009 03:13:45 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207267#M464304</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2009-11-03T03:13:45Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207268#M464305</link>
      <description>HI&lt;BR /&gt;&lt;BR /&gt;What HP-UX version?&lt;BR /&gt;&lt;BR /&gt;Is this a virtual server or what?</description>
      <pubDate>Tue, 03 Nov 2009 03:15:24 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207268#M464305</guid>
      <dc:creator>Michael Steele_2</dc:creator>
      <dc:date>2009-11-03T03:15:24Z</dc:date>
    </item>
    <item>
      <title>Re: Need help troubleshooting performance issue</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207269#M464306</link>
      <description>Tony,&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&amp;gt;&amp;gt;I have included all but the sar -d there are several thousand disks so the output is very long.&lt;BR /&gt;&lt;BR /&gt;Well, to get a clear idea quickly , if the disks are hitting heavily ,you can check with a  small scrpt (sar -d) to find out disk and their correspoding vgs . (check the attached one : find_high_io_wait_11iv2.sc ) .   Then if you  see the avwait is more you can try to locate the cause of the problem.&lt;BR /&gt;&lt;BR /&gt;Hope this helps..,&lt;BR /&gt;Raj.</description>
      <pubDate>Tue, 03 Nov 2009 03:36:17 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/need-help-troubleshooting-performance-issue/m-p/5207269#M464306</guid>
      <dc:creator>Raj D.</dc:creator>
      <dc:date>2009-11-03T03:36:17Z</dc:date>
    </item>
  </channel>
</rss>

