Servers & Systems: The Right Compute
1825009 Members
2913 Online
109678 Solutions
New Article
Brandon_Draeger

HPE Cray Perlmutter: Its perfect ratio and interconnect

 Perlmutter is the name the National Energy Research Scientific Computing Center (NERSC) of Berkeley Lab gave its upcoming HPE Cray Shasta supercomputer in recognition of resident Nobel laureate, Saul Perlmutter, professor, astrophysicist, and senior scientist.Perlmutter-blog.png

In October 2018, when the U.S. Department of Energy (DOE) and Cray (now part of HPE) announced the Perlmutter system, the news included that next-generation NVIDIA GPUs would be incorporated into the system’s nodes. 

Today, we are happy to join NERSC and NVIDIA in an update stating that the accelerators the Perlmutter system will use are NVIDIA A100 Tensor Core GPUs. Perlmutter will be based on HPE’s Cray Shasta architecture, Slingshot interconnect, and compute blades each featuring four NVIDIA A100 GPUs and a single AMD EPYC™ processor (4:1 GPU to CPU ratio). Half the eventual total system cabinets will provide GPU acceleration using over 6000 NVIDIA A100 GPUs. 

The user community at NERSC has been working to prepare its applications by adding GPU acceleration to new apps and optimizing GPU accelerated code where it already exists. NERSC has also been helping its user community prepare for the new GPU architecture through the NERSC Exascale Science Applications Program (NESAP). NESAP partnerships allow projects to collaborate with NERSC and HPC vendors by providing access to early hardware, prototype software tools for performance analysis and optimization, and special training. Over the past 20 months, the NESAP teams have been working closely with HPE and NVIDIA engineers to accelerate as many codes as possible to ensure the scientific community can hit the ground running when Perlmutter comes online in 2021. ]

Through the Cray Shasta architecture, HPE is addressing the desire of customers of all types to get the most out of data through new converged-use HPC, AI, and Big Data analytics—in a single workflow and often in real time. The new capabilities of the Shasta architecture include flexibility to support a multitude of new processor types. In fact, Perlmutter is a great example of this flexibility because it will be the fourth unique CPU (and CPU plus GPU) architecture combination that we have announced thus far for the various customer awards granted.   

Additionally, Perlmutter will use:

  • Cray Slingshot—a new type of interconnect that can bridge from the research lab to the enterprise
  • Cray ClusterStor E1000—the fastest (4TB/sec) all Flash NVMe file system (featuring 30 PB usable capacity for Perlmutter)
  • New Shasta architecture software—to empower developers and operators with systems that perform like a supercomputer but run like a cloud

In response to the new requirements of the Exascale Era, the Cray Shasta architecture was first created to address the critical needs of the U.S. National labs—and then to serve the equally pressing needs of every enterprise that struggles with massive data and converged analytics on the path to digital transformation.


Brandon Draeger
Hewlett Packard Enterprise

twitter.com/HPE_HPC
twitter.com/cray_inc
linkedin.com/showcase/hpe-servers-and-systems/
hpe.com/servers

About the Author

Brandon_Draeger

Brandon leads the Compute Product Marketing teams for HPE and joined the company in January 2020 as part of the Cray acquisition. Prior to Cray, Brandon held leadership roles in engineering, product management, marketing, and strategy at Intel, Dell, and Symantec.