<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: BL460C TCP traffic abnormal in BladeSystem - General</title>
    <link>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796884#M13769</link>
    <description>Hi :&lt;BR /&gt;&lt;BR /&gt;1. One tcp connection with one trasaction is required somewhere, like some applications in telecome. We have changed linux kernel parameter to avoid socket exahusted. I agree if we changed to "keep-alive", BL460C performance level should be higher &lt;BR /&gt;(but our reuiqrement is "close-down"). One connection should be kept for 5 seconds&lt;BR /&gt;&lt;BR /&gt;2. Sorry, no sleep for 600 transactions. We send out 600 transactions in each second. And time slot is 1/600 = 0.0017 second.&lt;BR /&gt;&lt;BR /&gt;3. We use multiple process ( 30 processes )to generate 600 trasactions&lt;BR /&gt;&lt;BR /&gt;   code: generate 30 threads, one thread is allocated 20 TPS&lt;BR /&gt;&lt;BR /&gt;  Did not add cards, use LOM on DL380&lt;BR /&gt;&lt;BR /&gt;4. I use "Auto"  to assign each of the FlexNICs bandwith ( also tried manually, such as 1G connection). We use only one ethernet interface ( such as 10.170.7.168) to accept imcoming request.&lt;BR /&gt;&lt;BR /&gt;5.  ethtool -g eth0 (on BL460C)&lt;BR /&gt;Ring parameters for eth0:&lt;BR /&gt;Pre-set maximums:&lt;BR /&gt;RX:             1024&lt;BR /&gt;RX Mini:        0&lt;BR /&gt;RX Jumbo:       0&lt;BR /&gt;TX:             2048&lt;BR /&gt;Current hardware settings:&lt;BR /&gt;RX:             1012&lt;BR /&gt;RX Mini:        0&lt;BR /&gt;RX Jumbo:       0&lt;BR /&gt;TX:             0&lt;BR /&gt;&lt;BR /&gt;6. Another information:&lt;BR /&gt;&lt;BR /&gt; After stopped simulator, BL460C still have some connection or FIN_wait like following :&lt;BR /&gt;&lt;BR /&gt;Port 10035 is listen port on BL460C and we have stopped sending out request on simulator&lt;BR /&gt;&lt;BR /&gt;netstat -an | grep 10035&lt;BR /&gt;tcp        0      0 10.170.7.168:10035      0.0.0.0:*               LISTEN      &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41498       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41517       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57430       FIN_WAIT2   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41515       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41540       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41507       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57431       FIN_WAIT2   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57620       ESTABLISHED &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41571       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57595       ESTABLISHED &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57657       ESTABLISHED &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41492       TIME_WAIT   &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Do not know whether it is related to VCFlex10 configuration. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;7. Network Connection:&lt;BR /&gt;&lt;BR /&gt;   BL460C &amp;lt;-&amp;gt; VC Flex10 &amp;lt;-&amp;gt; Cisco Switch (1G cable) &amp;lt;-&amp;gt; DL380 (simulator)&lt;BR /&gt;</description>
    <pubDate>Tue, 14 Jun 2011 08:47:58 GMT</pubDate>
    <dc:creator>mingxuan_wu</dc:creator>
    <dc:date>2011-06-14T08:47:58Z</dc:date>
    <item>
      <title>BL460C TCP traffic abnormal</title>
      <link>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796880#M13765</link>
      <description>* Hardware and Software environment &lt;BR /&gt;    1.use BL460C + VCFlex10 as Server.&lt;BR /&gt;    2.Operating system is SuSe 11.0  + SP1&lt;BR /&gt;* Some information :&lt;BR /&gt;&lt;BR /&gt;    We have done load test ( performance test) on some DL380 hardwares with our software, DL380 could handl 600 concurrent TCP traffic at the same time.&lt;BR /&gt;&lt;BR /&gt;    But BL460C can't handle 600 concurrent TCP traffic from our simulator. After investigating, we found BL460C TCP listening port act in strange behaviour. E.g. We send out 600 TPS every second and smoothly, in principle,BL460C TCP port should also receive 600 TPS smoothly ( or 2% ~ 3% fluctuation). The fact is BL460C TCP port receive very low TPS(such as 200 TPS) in one second, but receive very high TPS ( suchs as 1000 TPS ) in other second. Accepted connection number is very strange while performing load test.&lt;BR /&gt;&lt;BR /&gt;* Question :&lt;BR /&gt; &lt;BR /&gt;    1. Does BL460C have firmware to solve this problem ?&lt;BR /&gt;&lt;BR /&gt;    2. Is SuSe11+SP1 compatible with BL460C ?Or should we patch SuSe?&lt;BR /&gt;&lt;BR /&gt; Thank you very much !</description>
      <pubDate>Wed, 08 Jun 2011 13:48:46 GMT</pubDate>
      <guid>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796880#M13765</guid>
      <dc:creator>mingxuan_wu</dc:creator>
      <dc:date>2011-06-08T13:48:46Z</dc:date>
    </item>
    <item>
      <title>Re: BL460C TCP traffic abnormal</title>
      <link>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796881#M13766</link>
      <description>Is this "600 concurrent TCP traffic" 600 different connections, or 600 messages on one TCP connection?  Is your load one where the "transactions" are a new TCP connection for each transaction, or are the TCP connections kept up for the length of the test?  Also, when you say you send out 600 TPS every second and smoothly, exactly how do you do that?  Is it initiate 600 transactions, then sleep for a second, then initiate another 600 etc?&lt;BR /&gt;&lt;BR /&gt;What are you using to generate this load?&lt;BR /&gt;&lt;BR /&gt;Have you examined the networking statistics on either your load generator(s) or your BL460c (BTW, *which* BL460C - a "plain" (aka Gen1) or G5, 6 or 7?  As you have VC Flex10 mentioned, have you defined any flexnics?</description>
      <pubDate>Thu, 09 Jun 2011 16:00:42 GMT</pubDate>
      <guid>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796881#M13766</guid>
      <dc:creator>rick jones</dc:creator>
      <dc:date>2011-06-09T16:00:42Z</dc:date>
    </item>
    <item>
      <title>Re: BL460C TCP traffic abnormal</title>
      <link>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796882#M13767</link>
      <description>Hi :&lt;BR /&gt;&lt;BR /&gt; Thanks for your response.&lt;BR /&gt;&lt;BR /&gt;1. 600 concurrent request is mapped to 600 connection ( 600 sockets), not one socket.It is a new TCP connection for each transaction.&lt;BR /&gt;&lt;BR /&gt;2. 600 TPS smoothly : Yes, it initiate 600 transactions, then sleep for a second, then initiate another 600 etc.&lt;BR /&gt;&lt;BR /&gt;3. Generate this load : We developed our own c++ codes as simulator ( not use third part tool). Our codes are compiled on both sparc platform (solaris) and SuSe platform. &lt;BR /&gt;&lt;BR /&gt;  If we use DL380 + Suse as platform, and test DL380 + SuSe + our product, performance is very good !&lt;BR /&gt;&lt;BR /&gt;  If we use DL380 + Suse as platform, and test BL460C + SuSe + our product, performance is poor. Problems were describered in the first thread.&lt;BR /&gt;&lt;BR /&gt;  If we use Sparc + Solaris as platform, and test BL460C + SuSe + our product, performance is stable but not high.&lt;BR /&gt;&lt;BR /&gt;4. BL460C Type : BL460C G7&lt;BR /&gt;&lt;BR /&gt;5. Vc Flex10 : Yes, we used flex NIC. BL460C is combined with two physical ethernet interface, and VC Flex10 vertualize other 6 ethernet interfaces. Totally 8 ethernet interfaces on one BL460C hardware.&lt;BR /&gt;&lt;BR /&gt;  We are now using eth0 to do performance test&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 10 Jun 2011 03:21:27 GMT</pubDate>
      <guid>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796882#M13767</guid>
      <dc:creator>mingxuan_wu</dc:creator>
      <dc:date>2011-06-10T03:21:27Z</dc:date>
    </item>
    <item>
      <title>Re: BL460C TCP traffic abnormal</title>
      <link>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796883#M13768</link>
      <description>So with regards to:&lt;BR /&gt;&lt;BR /&gt;1) - a new TCP connection for each transaction is a model that went-out with HTTP 1.0 in the late 1990s.  Frankly, with just the one load generator, you are lucky that 600 TCP connections per second didn't cause issues with TIME_WAIT and local port number exhaustion.  I trust you aren't using an "abortive close" ...  &lt;BR /&gt;&lt;BR /&gt;&lt;A href="http://www.frameip.com/nntp/comp-protocols-tcp-ip/21912-comp-protocols-tcp-ip-time-wait-recycling-vs-reuse.htm" target="_blank"&gt;http://www.frameip.com/nntp/comp-protocols-tcp-ip/21912-comp-protocols-tcp-ip-time-wait-recycling-vs-reuse.htm&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;2) 600 transactions, sleep for one second, lather, rinse, repeat is most certainly *not* smooth.  It is in fact rather bursty.  Is the real-world usage of the system really going to behave like that, or is that bursty behaviour simply an artifact of how you coded-up the load generator?&lt;BR /&gt;&lt;BR /&gt;Also, how "big" are the responses to each transaction?&lt;BR /&gt;&lt;BR /&gt;3) Is your simulator one thread or 600 threads?  Are you launching new threads for each transaction? Can you post some pseudo-code?&lt;BR /&gt;&lt;BR /&gt;On the DL380 were you simply using the LOM ports or did you add cards to it?&lt;BR /&gt;&lt;BR /&gt;Just how low is "not high" when you use Solaris as the load generation?  There are at least two possibilities there - one is that Solaris and SPARC are just slow.  The other is that on the Solaris platform, your attempt to churn through 600 TCP connections a second has indeed exhausted the local port space and that is slowing things down.&lt;BR /&gt;&lt;BR /&gt;5) What bandwidths did you assign to each of the FlexNICs you created?   What are you using as the link(s) between the Flex-10 Modules and the switch that then connects to your DL380 load generator?&lt;BR /&gt;&lt;BR /&gt;It would be good to get the output of ifconfig on one of these flexnics, as well as ethtool -g &lt;BR /&gt;&lt;BR /&gt;You should look at the netstat statistics on your load generator while it is running.  Take two snapshots several seconds apart and then compare them with something like beforeafter from &lt;A href="ftp://ftp.cup.hp.com/dist/networking/tools/" target="_blank"&gt;ftp://ftp.cup.hp.com/dist/networking/tools/&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;netstat -s &amp;gt; before&lt;BR /&gt;sleep 20&lt;BR /&gt;netstat -s &amp;gt; after&lt;BR /&gt;beforeafter before after &amp;gt; delta&lt;BR /&gt;&lt;BR /&gt;and then look at delta.  Probably good to do the same thing on the system under test (eg the BL460 G7) as well.  &lt;BR /&gt;&lt;BR /&gt;From your descriptions thusfar, I am guessing that there are some packet losses somewhere, triggered by the bursts of traffic, and the fluctuation stems from some older transactions being delayed by that.</description>
      <pubDate>Fri, 10 Jun 2011 16:26:35 GMT</pubDate>
      <guid>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796883#M13768</guid>
      <dc:creator>rick jones</dc:creator>
      <dc:date>2011-06-10T16:26:35Z</dc:date>
    </item>
    <item>
      <title>Re: BL460C TCP traffic abnormal</title>
      <link>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796884#M13769</link>
      <description>Hi :&lt;BR /&gt;&lt;BR /&gt;1. One tcp connection with one trasaction is required somewhere, like some applications in telecome. We have changed linux kernel parameter to avoid socket exahusted. I agree if we changed to "keep-alive", BL460C performance level should be higher &lt;BR /&gt;(but our reuiqrement is "close-down"). One connection should be kept for 5 seconds&lt;BR /&gt;&lt;BR /&gt;2. Sorry, no sleep for 600 transactions. We send out 600 transactions in each second. And time slot is 1/600 = 0.0017 second.&lt;BR /&gt;&lt;BR /&gt;3. We use multiple process ( 30 processes )to generate 600 trasactions&lt;BR /&gt;&lt;BR /&gt;   code: generate 30 threads, one thread is allocated 20 TPS&lt;BR /&gt;&lt;BR /&gt;  Did not add cards, use LOM on DL380&lt;BR /&gt;&lt;BR /&gt;4. I use "Auto"  to assign each of the FlexNICs bandwith ( also tried manually, such as 1G connection). We use only one ethernet interface ( such as 10.170.7.168) to accept imcoming request.&lt;BR /&gt;&lt;BR /&gt;5.  ethtool -g eth0 (on BL460C)&lt;BR /&gt;Ring parameters for eth0:&lt;BR /&gt;Pre-set maximums:&lt;BR /&gt;RX:             1024&lt;BR /&gt;RX Mini:        0&lt;BR /&gt;RX Jumbo:       0&lt;BR /&gt;TX:             2048&lt;BR /&gt;Current hardware settings:&lt;BR /&gt;RX:             1012&lt;BR /&gt;RX Mini:        0&lt;BR /&gt;RX Jumbo:       0&lt;BR /&gt;TX:             0&lt;BR /&gt;&lt;BR /&gt;6. Another information:&lt;BR /&gt;&lt;BR /&gt; After stopped simulator, BL460C still have some connection or FIN_wait like following :&lt;BR /&gt;&lt;BR /&gt;Port 10035 is listen port on BL460C and we have stopped sending out request on simulator&lt;BR /&gt;&lt;BR /&gt;netstat -an | grep 10035&lt;BR /&gt;tcp        0      0 10.170.7.168:10035      0.0.0.0:*               LISTEN      &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41498       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41517       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57430       FIN_WAIT2   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41515       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41540       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41507       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57431       FIN_WAIT2   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57620       ESTABLISHED &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41571       TIME_WAIT   &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57595       ESTABLISHED &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:57657       ESTABLISHED &lt;BR /&gt;tcp        0      0 10.170.7.168:10035      10.170.7.81:41492       TIME_WAIT   &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;Do not know whether it is related to VCFlex10 configuration. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;7. Network Connection:&lt;BR /&gt;&lt;BR /&gt;   BL460C &amp;lt;-&amp;gt; VC Flex10 &amp;lt;-&amp;gt; Cisco Switch (1G cable) &amp;lt;-&amp;gt; DL380 (simulator)&lt;BR /&gt;</description>
      <pubDate>Tue, 14 Jun 2011 08:47:58 GMT</pubDate>
      <guid>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796884#M13769</guid>
      <dc:creator>mingxuan_wu</dc:creator>
      <dc:date>2011-06-14T08:47:58Z</dc:date>
    </item>
    <item>
      <title>Re: BL460C TCP traffic abnormal</title>
      <link>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796885#M13770</link>
      <description>So, where are your snapshots of the statistics such as netstat, ethtool and/or switch stats?</description>
      <pubDate>Tue, 14 Jun 2011 18:01:32 GMT</pubDate>
      <guid>https://community.hpe.com/t5/bladesystem-general/bl460c-tcp-traffic-abnormal/m-p/4796885#M13770</guid>
      <dc:creator>rick jones</dc:creator>
      <dc:date>2011-06-14T18:01:32Z</dc:date>
    </item>
  </channel>
</rss>

