AMD EPYC 9004 Genoa: Difference between revisions

From HPCWIKI
Jump to navigation Jump to search
(새 문서: ==EPYC Genoa== center | 500px In brief : At OCP Summit 2022, AMDlaunching next-generation AMD EPYC 9004 4th-gen CPU, code name Genoa on November 10, 2022. AMD Genor supports up to 96 cores with 192 threads with 5nm manufacturing process. 12-channel DDR5 memory, Compute Express Link 1.1 standards as well as more PCIe Gen5 capabilities. HPCMATE with local and global partners are preparing HPC optimized AMD based server in Q2 2023. === Genoa === Code-n...)
 
No edit summary
Line 8: Line 8:
<p align="center"> <youtube>ytITVteSeOI&t=3s</youtube> </p>
<p align="center"> <youtube>ytITVteSeOI&t=3s</youtube> </p>


With its massive 96-core-count in a single processor, the new AMD Genoa processors will allow organizations to reduce their physical footprint by deploying fewer servers while leveraging more powerful servers.
With its massive 96-core-count in a single processor, the new AMD Genoa processors will allow organizations to reduce their physical footprint by deploying fewer servers while leveraging more powerful servers. HPE and Dell have announced their servers, each offering four systems, two 1 CPU chassis, and two 2 CPU chassis


===Key features===
===Key features===
The CPU chiplets will be able to house up to '''64 cores and 128 threads'''. The EPYC Genoa processors will also have access to faster 8 channel DDR4 memory lanes, allowing for higher bandwidth. Genoa SoCs support both single and 2-way multiprocessing with up to a maximum of 64 cores (and 128 threads) per processor for a total of up to 128 cores (and 256 threads) for a 2-way MP system. Those SoCs support '''128 PCIe Gen 4 lanes each''', however, ''half of them are lost when in 2-way MP'' (leaving the system with the same overall lanes count as a single socket solution). Communication between the two chips is done via AMD's [[Infinity Fabric]] protocol over the 64 reserved lanes. Genoa is backwards platform/socket compatible with Naples and forward-compatible with Milan.
Genoa has key enhancements in memory cost, which is 50% of a server’s BOM. The support for 72-bit and 80-bit DIMMs is noteworthy. Most servers will use 80-bit ECC, but some hyperscalers want to cut down to 72-bit. The advantage here is that there is 1 less DRAM die for parity checks.
[[file : Epyc_genoa_iodie.png | center | 500px]]
The other important feature is dual rank versus single rank memory. With Milan and most Intel platforms, dual-rank memory is crucial to maximizing performance. There’s a 25% performance delta on Milan, for example. With Genoa, this is brought down to 4.5%. This is another considerable cost improvement because cheaper single-rank memory can be used.
Genoa has higher memory latency than Milan, 118ns on Genoa versus 105ns on Milan. AMD’s argument against this is that only 3ns of this is from the massively larger IO die, 73ns on Genoa versus 70ns on Milan. Most of the memory latency impact comes from the DDR5 memory device itself. 35ns on DDR5 versus 25ns on DDR4.


Genoa will support 3200 MHz RAM in eight channels, for up to 4 TB of RAM per socket (While Intel is presently limited to 2 TB across six channels, and only up to speeds of 2666 MHz)
Power management is enhanced. Genoa has 2 basic modes for power management, performance determinism or power determinism.
PCIe 4.0 and 128 PCIe lanes
Performance determinism is for firms that want consistent performance. It consumes less power when allowed to, and performance is kept stable. Most customers will choose this option because stability is vital.
Intel processors have significantly higher TDPs at the high-end, much higher prices at the low-end, and far less cache the whole way through
Power determinism is for keeping power consumption stable and ramping performance up and down. Given factors such as the silicon lottery, thermal budget, and workloads, the chip will ramp up and down clock speeds.
In addition to the power management modes, there is a configurable TDP for Genoa chips. The peak boost behavior will vary depending on which option is chosen.


In summary, all Genoa processors have the following:  
[[file : 9004_highlight.png | center | 500px]]
* 128 PCIe lanes PCIe Gen 4 (in both single-way and dual-way multiprocessing)
 
* Octa-channel Memory
AMD generally supports CXL 1.1 but supports CXL 2.0 for Type 3 memory devices, One noteworthy item is that the 64 lanes of CXL can be bifurcated into 16 4x devices.
* Up to DDR4-3200 ECC 4 TiB (8 TiB in 2MP)
Type 3 is what the ecosystem wanted, Genoa was delayed 2 quarters to add this feature. Intel Sapphire Rapids is not capable of CXL lane bifurcation. If one connects a 4x or 8x CXL device, that will consume all 16 lanes.
* Up to 64 cores / 128 threads
Hypervisors cannot change the memory assignment under the guest, which is huge for users using CXL-attached memory in the cloud.
* Everything up to AVX2 (i.e., SMM, FPU, NX, MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AES, AVX, FMA3, and AVX2), and SHA
 
 
In summary, The pillars of performance for AMD are per-socket performance leadership, per-core performance leadership, leadership across all workloads and market segments, and leadership in TCO and sustainability


===EPYC Genoa series===
===EPYC Genoa series===
[[file : Epyc_roadmap.png | center | 500px]]
[[file : genoa_9004_sku.png | center | 500px]]
According to Techspot<ref>https://www.techspot.com/news/80631-amd-7nm-epyc-genoa-specs-prices.html</ref>, AMD is bringing 64, 48, 32, 24, 16, 12 and 8 core variants to the market this time, most of which are available in three configuration options. A fancy-pants dual-socket compatible version, a single-socket version with identical specs (bar one or two exceptions) and a ‘budget’ variant that cuts back a little while increasing power efficiency
The big question then is obvious, is AMD Genoa worth the investment? these new CPU technologies have a lot to offer in terms of compute power, security, and efficiency
[[file : Epyc_genoa_series.png | center | 500px]]
 
The big thing to note is that when software per core licensing costs come into play, this lead extends even further in TCO. This is best shown in the enterprise benchmark, which runs VMMark. VMMark runs 19 representative VM per tile and then sees how many tiles can be run as well as the speed. Genoa is faster and can handle more VMs.
 
 


The AMD EPYC Genoa processor family is expected to lift AMD’s server CPU market share to 10% by 2020. AMD’s 32-core processor is 29% cheaper than Intel’s 28-core, their 24-core part is 57% cheaper, and their 16-core part is 69% cheaper <ref>AMD 7nm Epyc Genoa specs and prices leak</ref>
* <ref> AMD Genoa Detailed – Architecture Makes Xeon Look Like A Dinosaur, url=https://www.semianalysis.com/p/amd-genoa-detailed-architecture-makes</ref> 
* <ref> CXL Deep Dive,url=https://www.semianalysis.com/p/cxl-deep-dive-future-of-composable </ref>  


* Wccftech’s article shows EPYC Genoa 7452 with 32 cores moninating position against Intel’s Xeon and much faster than EPY Naples <ref>EPYC Genoa 7452 with 32 cores moninating position against Intel, url=https://wccftech.com/amd-epyc-7452-7nm-rome-server-cpu-benchmark-versus-intel-xeon/</ref> 
* Dell EMC announced that they will be launching servers powered by AMD’s newest architecture – a 7nm architecture codenamed ‘Genoa’ – in the second half of 2019 - Dominique Vanhamme (DELL EMEA vice president)
* Microsoft took the stage at its E3 2019 keynote, announcing that the system-on-a-chip powering the next Xbox, Project Scarlett, is using Zen 2 cores and AMD Navi graphics. Microsoft certainly is talking a deep partnership with AMD during computex. When talking about Windows, Roanne explicitly mentioned "mission critical system...and government" during Computex on stage, as well as "premium sector and next level".


==Reference==
==Reference==

Revision as of 12:41, 14 March 2023

EPYC Genoa

Epyc genoa.png

In brief : At OCP Summit 2022, AMDlaunching next-generation AMD EPYC 9004 4th-gen CPU, code name Genoa on November 10, 2022. AMD Genor supports up to 96 cores with 192 threads with 5nm manufacturing process. 12-channel DDR5 memory, Compute Express Link 1.1 standards as well as more PCIe Gen5 capabilities. HPCMATE with local and global partners are preparing HPC optimized AMD based server in Q2 2023.

Genoa

Code-named AMD Genoa, the new line of CPUs supports 12 channels of DDR5-4800 (up to 6TB memory capacity per socket), 128 lanes of PCIe Gen5, AMD Infinity Fabric/Guard technology, and up to 96 cores. This makes them ideal for critical workloads across cloud, enterprise, and high-performance computing.

With its massive 96-core-count in a single processor, the new AMD Genoa processors will allow organizations to reduce their physical footprint by deploying fewer servers while leveraging more powerful servers. HPE and Dell have announced their servers, each offering four systems, two 1 CPU chassis, and two 2 CPU chassis

Key features

Genoa has key enhancements in memory cost, which is 50% of a server’s BOM. The support for 72-bit and 80-bit DIMMs is noteworthy. Most servers will use 80-bit ECC, but some hyperscalers want to cut down to 72-bit. The advantage here is that there is 1 less DRAM die for parity checks. The other important feature is dual rank versus single rank memory. With Milan and most Intel platforms, dual-rank memory is crucial to maximizing performance. There’s a 25% performance delta on Milan, for example. With Genoa, this is brought down to 4.5%. This is another considerable cost improvement because cheaper single-rank memory can be used. Genoa has higher memory latency than Milan, 118ns on Genoa versus 105ns on Milan. AMD’s argument against this is that only 3ns of this is from the massively larger IO die, 73ns on Genoa versus 70ns on Milan. Most of the memory latency impact comes from the DDR5 memory device itself. 35ns on DDR5 versus 25ns on DDR4.

Power management is enhanced. Genoa has 2 basic modes for power management, performance determinism or power determinism. Performance determinism is for firms that want consistent performance. It consumes less power when allowed to, and performance is kept stable. Most customers will choose this option because stability is vital. Power determinism is for keeping power consumption stable and ramping performance up and down. Given factors such as the silicon lottery, thermal budget, and workloads, the chip will ramp up and down clock speeds. In addition to the power management modes, there is a configurable TDP for Genoa chips. The peak boost behavior will vary depending on which option is chosen.

9004 highlight.png

AMD generally supports CXL 1.1 but supports CXL 2.0 for Type 3 memory devices, One noteworthy item is that the 64 lanes of CXL can be bifurcated into 16 4x devices. Type 3 is what the ecosystem wanted, Genoa was delayed 2 quarters to add this feature. Intel Sapphire Rapids is not capable of CXL lane bifurcation. If one connects a 4x or 8x CXL device, that will consume all 16 lanes. Hypervisors cannot change the memory assignment under the guest, which is huge for users using CXL-attached memory in the cloud.


In summary, The pillars of performance for AMD are per-socket performance leadership, per-core performance leadership, leadership across all workloads and market segments, and leadership in TCO and sustainability

EPYC Genoa series

Genoa 9004 sku.png

The big question then is obvious, is AMD Genoa worth the investment? these new CPU technologies have a lot to offer in terms of compute power, security, and efficiency

The big thing to note is that when software per core licensing costs come into play, this lead extends even further in TCO. This is best shown in the enterprise benchmark, which runs VMMark. VMMark runs 19 representative VM per tile and then sees how many tiles can be run as well as the speed. Genoa is faster and can handle more VMs.



Reference

  1. AMD Genoa Detailed – Architecture Makes Xeon Look Like A Dinosaur, url=https://www.semianalysis.com/p/amd-genoa-detailed-architecture-makes
  2. CXL Deep Dive,url=https://www.semianalysis.com/p/cxl-deep-dive-future-of-composable