- Community Home
- >
- Servers and Operating Systems
- >
- Servers & Systems: The Right Compute
- >
- The unique modular architecture of HPE Superdome F...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark as New
- Mark as Read
- Bookmark
- Receive email notifications
- Printer Friendly Page
- Report Inappropriate Content
The unique modular architecture of HPE Superdome Flex: How it works and why it matters
Is your infrastructure straining to handle demands to process ever-growing data sets? Learn how the unique modular architecture of HPE Superdome Flex delivers extreme performance, high bandwidth and consistent low latency, even at the largest configurations.
Last December, HPE announced the worldโs most scalable and modular in-memory computing platform, HPE Superdome Flexโa compute breakthrough to power critical applications, enable real-time analytics and tackle data-intensive high performance computing (HPC) workloads.
In a series of three blogs, Iโll be taking an in-depth look at the HPE Superdome Flex capabilities that make it unique in the industry and explain how they can add value to your business. To get started, Iโm focusing here on the platformโs modular, scalable architecture.
Scaling beyond the capabilities of Intel
Like most x86 server vendors, HPE uses the latest Intelยฎ Xeonยฎ Scalable processorโcodename Skylakeโin its latest-generation servers, including HPE Superdome Flex. Intelโs reference design for these processors uses the new UltraPath Interconnect (UPI) that limits scaling to 8 sockets. Most vendors using these processors base their server designs on this โgluelessโ interconnect method, but unlike them, HPE Superdome Flex uses a unique modular architecture that can scale beyond the capabilities of Intelโfrom 4 to 32-sockets in a single system.
We did this because we recognized the market need for platforms able to scale beyond Intelโs 8-socket limit, especially today when data sets are growing at an unprecedented pace. In addition, because Intel focuses the UPI on 2- and 4-socket servers, the 8-socket โgluelessโ servers become bandwidth challenged. Our design delivers high-bandwidth even when you grow the system to the largest configurations.
Price/performance advantages over other systems
The HPE Superdome Flex modular architecture is based on a 4-socket chassis that can scale to 8 chassis for a total of 32 sockets in a single-system compute powerhouse. You have many different processor options to choose, from the cost-efficient Gold to the high-end Platinum โflavorsโ of the Xeon Scalable processor family.
This choice of Gold and Platinum processors delivers great price/performance advantages over smaller systems. For example, in a typical 6TB memory configuration, Superdome Flex can deliver a lower-cost, higher-performance solution than competitive 4-socket offerings. Why? Because of their design, other 4-socket systems are forced to use 128GB DIMMs, which are a lot more expensive than the 64GB DIMMS an 8-socket Superdome Flex can utilize. At this socket count, an 8-socket/6TB Superdome Flex will deliver double the compute power, double the memory bandwidth and double the IO capabilityโand it will still be more cost effective than a 4-socket/6TB competitive product.
Similarly, for a competitive 8-socket/6TB configuration, Superdome Flex can deliver a lower-cost, higher-performance 8-socket solution. How? While others are forced to use more expensive Platinum processors because of their design, an 8-socket Superdome Flex can use lower-cost Gold processors to give you the same memory capacity.
In fact, of the platforms based on Intel Xeon Scalable processors, Superdome Flex is the only one able to deliver 8-sockets using the cost-effective Gold variant (as Intelยดs โgluelessโ design supports 8-sockets only through the more expensive Platinum type). We also offer a variety of core count choices, enabling you to map the number of cores per processor to your workload requirements, with variations starting as low as 4 cores to as high as 28 cores per processor.
Scaling up: why it matters
The ability to scale as a single system, or scale up, delivers several advantages for those vital workloads and databases HPE Superdome Flex is best suited for. These include traditional and in-memory databases, real-time analytics, ERP, CRM and other OLTP workloads. For these types of workloads, a scale-up environment is simpler and cheaper to manage than a scale-out cluster, and it also reduces latency, increasing performance.
Check out this blog post on the transaction speed when scaling up or out with SAP S/4HANA to understand why scaling up is a much better alternative than scaling out/clustering for these types of workloads. Itโs all about speed and the ability to perform at the level required for these critical applications.
Consistent high performance, even at the largest configurations
The Superdome Flex extreme scale is achieved via the unique HPE Superdome Flex ASIC chipset, connecting the individual 4-socket chassis to one another in a point-to-point fashion, as shown in Figures 1 and 2. The HPE Superdome Flex ASIC technology enables adaptive routing, which load-balances the fabric and optimizes latency and bandwidth, increasing performance and system availability. The ASIC connects the chassis together in a cache-coherent fabric and maintains coherency by tracking cache line state and ownership across all the processor sockets inside a directory cache built into the ASIC itself. This coherency scheme is a critical factor in the ability of HPE Superdome Flex to perform at near linear scaling from 4-sockets all the way up to 32-sockets. Typical glueless architecture designs already see limited performance when scaling to as low as 4- to 8-sockets, because of broadcast snooping.
Figure 1. HPE Superdome Flex ASICs Point-to-point connections Figure 2. HPE Superdome Flex 4-socket chassis
Shared memory
In a similar fashion to compute, memory capacity can grow as more chassis are added to the system. With support for 48 DDR4 DIMM slots per chassis, accommodating either 32 GB RDIMMs, 64 GB LRDIMMs, or even 128 GB 3DS LRDIMMs, the maximum per-chassis memory capacity is 6 TB. This gives a fully scaled 32-socket HPE Superdome Flex a whopping total memory capacity of 48 TB of shared memory to support the most demanding in-memory applications.
Extreme I/O flexibility
As for I/O, each HPE Superdome Flex chassis can be equipped with either a 16-slot or 12-slot I/O bulkhead to provide numerous stand-up PCIe 3.0 card options, giving you plenty of flexibility to support a wide variety of workloads. With either I/O bulkhead selection, the I/O design provides direct connections between the processors and the card slotsโwith no need for bus repeaters or retimers that can add latency or reduce bandwidth. This gives you the best per card performance possible.
Ultra-low latency
Low latency is a key factor driving the high performance of Superdome Flex. Although data exists in local (directly connected to processor) or remote (across chassis) memory, copies of the data can exist in various processor caches throughout the system. Cache coherency keeps the cached copies consistent in the event an operation changes the data. The round trip latency between a processor and local memory is about 100ns. Latency of a processor accessing data from memory connected to another processor over UPI is ~130ns.
Processors accessing data residing in memory in another chassis will travel between two Flex ASICs (always a single โhopโ) for a roundtrip latency of under 400nsโno matter if a processor at the top of the rack is accessing data from memory at the bottom. As for bandwidth, Superdome Flex provides more than 210 GB/s of bi-sectioned crossbar bandwidth at 8-sockets, more than 425 GB/s at 16-sockets and over 850 GB/s at 32-sockets. Thatโs plenty to power the most demanding workloads. In another post, I will expand on the performance topic and share some recent Superdome Flex benchmark results
Why does this extreme modular scalability matter?
Itโs no secret data is growing at an unprecedented paceโwhich means infrastructure strains to handle increasingly demanding requests to process and analyze critical, ever-growing data sets. But growth rates can be unpredictable.
To support the business, IT teams need systems that respond effectively and promptly to their requests, regardless of the amount of data or how fast it grows. Having a platform that keeps pace with the demands of your business will give you peace of mindโso youโll know that you wonโt run out of room to grow, but neither will you need to overprovision.
When you deploy memory-intensive workloads, you might ask: What will my next TB of memory capability cost? With Superdome Flex, you can scale memory capacity without a forklift upgrade, as youโre not limited to the DIMM slots in a single chassis. Also, as the number of users increase, mission-critical applications require a high performing environment regardless of size.
Todayโs in-memory databases demand low-latency/high-bandwidth systems. Thanks to its innovative architecture, HPE Superdome Flex delivers extreme performance, high bandwidth and consistent low latency, even at the largest configurations. Whatโs more, you can get all this for your critical workloads and databases at better price performance than on smaller systems.
One more thing: HPE Superdome Flex has been recently certified to run VMware and Oracle Linux workloads, in addition to the standard RHEL and SUSE Linux distributions. Oracle VM and Windows certifications are expected later this year.
In the second blog in this series, I cover some of the advanced and unique reliability, availability and serviceability (RAS) features of HPE Superdome Flex resulting in five nines (99.999%) single-system availability.
You might also want to check out the HPE Superdome Flex Architecture and RAS technical whitepaper or watch this short video for architecture highlights.
Featured articles:
- How to evolve your infrastructure to hybrid IT
- Consumption-based IT: A primer for your business
- A super-fast history of supercomputers: From the CDC 6600 to the Sunway TaihuLight
- Want to know the future of technology? Sign up for weekly insights and resources
Meet Servers: The Right Compute Blogger Diana Cortes, Marketing Manager, Mission Critical x86 Solutions, HPE.
Diana has spent the past 20 years working with the technologies that power the worldโs most demanding environments and is interested in how solutions based on those technologies impact the business. A native from Colombia, Diana holds an MBA from Georgetown University and has held a variety of regional and global roles with HPE in the US, the UK and Sweden.
- Back to Blog
- Newer Article
- Older Article
- Back to Blog
- Newer Article
- Older Article
- Dale Brown on: Going beyond large language models with smart appl...
- alimohammadi on: How to choose the right HPE ProLiant Gen11 AMD ser...
- Jams_C_Servers on: If youโre not using Compute Ops Management yet, yo...
- AmitSharmaAPJ on: HPE servers and AMD EPYCโข 9004X CPUs accelerate te...
- AmandaC1 on: HPE Superdome Flex family earns highest availabili...
- ComputeExperts on: New release: What you need to know about HPE OneVi...
- JimLoi on: 5 things to consider before moving mission-critica...
- Jim Loiacono on: Confused with RISE with SAP S/4HANA options? Let m...
- kambizhakimi23 on: HPE extends supply chain security by adding AMD EP...
- pavement on: Tech Tip: Why you really donโt need VLANs and why ...
-
COMPOSABLE
77 -
CORE AND EDGE COMPUTE
146 -
CORE COMPUTE
129 -
HPC & SUPERCOMPUTING
131 -
Mission Critical
86 -
SMB
169