<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Inconsistent Performance with DL785 using MPI in ProLiant Servers (ML,DL,SL)</title>
    <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/inconsistent-performance-with-dl785-using-mpi/m-p/4653778#M103760</link>
    <description>We are using a DL785 G6 system with 8x Opteron 6-core processors (48core) with 256 GB total memory. We are running a computationally and memory intensive model, which uses MPI, testing the scalability of said model. System OS is RH Enterprise Linux 5.5.&lt;BR /&gt;&lt;BR /&gt;Running on (for example) 12 cores produces inconsistent performance results - run times vary greatly depending on which processors get assigned to the tasks. Running on increasing numbers of cores yields diminishing returns.&lt;BR /&gt;&lt;BR /&gt;Running a single threaded test, 47 instances of a calculation of PI to 100,000,000 digits produced strange results as well. Performance for each instance was consistent (seconds per digit calculated) while all 47 threads were running - however, 28 threads performed at speed X, while the remaining 19 cores performed at 75% to 50% of the speed of the fastest cores. Memory availability was not an issue for this test (memory footprint of the program was small).  After some threads terminated, a visible improvement occurred in the remaining threads - whether they actually migrated processors or just improved performance is unknown.&lt;BR /&gt;&lt;BR /&gt;Does anyone know why the same program running on each core would perform significantly slower on some cores when all processors are identical?&lt;BR /&gt;</description>
    <pubDate>Mon, 28 Jun 2010 14:12:40 GMT</pubDate>
    <dc:creator>David B Hart</dc:creator>
    <dc:date>2010-06-28T14:12:40Z</dc:date>
    <item>
      <title>Inconsistent Performance with DL785 using MPI</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/inconsistent-performance-with-dl785-using-mpi/m-p/4653778#M103760</link>
      <description>We are using a DL785 G6 system with 8x Opteron 6-core processors (48core) with 256 GB total memory. We are running a computationally and memory intensive model, which uses MPI, testing the scalability of said model. System OS is RH Enterprise Linux 5.5.&lt;BR /&gt;&lt;BR /&gt;Running on (for example) 12 cores produces inconsistent performance results - run times vary greatly depending on which processors get assigned to the tasks. Running on increasing numbers of cores yields diminishing returns.&lt;BR /&gt;&lt;BR /&gt;Running a single threaded test, 47 instances of a calculation of PI to 100,000,000 digits produced strange results as well. Performance for each instance was consistent (seconds per digit calculated) while all 47 threads were running - however, 28 threads performed at speed X, while the remaining 19 cores performed at 75% to 50% of the speed of the fastest cores. Memory availability was not an issue for this test (memory footprint of the program was small).  After some threads terminated, a visible improvement occurred in the remaining threads - whether they actually migrated processors or just improved performance is unknown.&lt;BR /&gt;&lt;BR /&gt;Does anyone know why the same program running on each core would perform significantly slower on some cores when all processors are identical?&lt;BR /&gt;</description>
      <pubDate>Mon, 28 Jun 2010 14:12:40 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/inconsistent-performance-with-dl785-using-mpi/m-p/4653778#M103760</guid>
      <dc:creator>David B Hart</dc:creator>
      <dc:date>2010-06-28T14:12:40Z</dc:date>
    </item>
    <item>
      <title>Re: Inconsistent Performance with DL785 using MPI</title>
      <link>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/inconsistent-performance-with-dl785-using-mpi/m-p/4653779#M103761</link>
      <description>Hello David,&lt;BR /&gt;we use DL785G5 (32cores, 128GB ram) as MS SQL data warehouse.&lt;BR /&gt;Server performance is great, but we do different kind of task than you.&lt;BR /&gt;&lt;BR /&gt;Jan</description>
      <pubDate>Tue, 29 Jun 2010 18:13:41 GMT</pubDate>
      <guid>https://community.hpe.com/t5/proliant-servers-ml-dl-sl/inconsistent-performance-with-dl785-using-mpi/m-p/4653779#M103761</guid>
      <dc:creator>Jan Soska</dc:creator>
      <dc:date>2010-06-29T18:13:41Z</dc:date>
    </item>
  </channel>
</rss>

