Home Blogs Data Center Explorer AMD unveils new generation of Epyc, Instinct chips

AMD unveils new generation of Epyc, Instinct chips

News

Oct 11, 20244 mins

CPUs and ProcessorsData CenterHigh-Performance Computing

New Epyc chip to offer scale-up performance or scale-out capacity for a wide range of data center workloads.

Credit: AMD

AMD announced its latest AI and high-performance computing processors at its Advancing AI event in San Francisco, including the fifth generation of its Epyc server processors and AMD Instinct MI325X accelerators. Commitments from leading customers and partners, including Microsoft, OpenAI, Meta, Oracle, and Google Cloud, rounded out the event.

The fifth-generation Epyc CPUs come in two distinct configurations, both of which are part of the same 9005 family, also known as Turin. The scale-up version has a new “Zen 5” core architecture optimized for maximum performance, according to AMD. The scale-out models come with dense “Zen 5c” compact cores, a concept first introduced this year with the Zen 4c Bergamo line.

Intel has a similar strategy with its Performance cores and Efficiency cores, but it achieves the E-cores by removing instructions, which can risk breaking apps. AMD got its compact cores by limiting cache size and clock speed, but it kept all the instructions.

The scale-up CPUs come with up to 128 cores and 256 threads and are made with TSMC’s 4nm process. The scale-out Zen 5c version will offer up to 192 cores and 384 threads and is made on TSMC’s 3nm process. The Zen 5-based processors have a max power draw of 500 watts, while the 5c draws 390 watts.

The new chips comes big claims. AMD says its flagship 192-core Epyc 9965 is 2.7 times faster than Intel’s competing Xeon, the Platinum 8952+. AMD claims four times faster video transcoding, 3.9 times faster performance in HPC applications, and up to 1.6 times the performance per core in virtualized environments.

Instinct takes on Nvidia

AMD is seeking a war on two fronts, with Intel and with Nvidia. It’s looking to take on Nvidia in the AI accelerator space with Instinct. While it has a fraction of the business Nvidia has, AMD does have some wins, like the world’s fastest supercomputer Frontier.

To that end, AMD has launched the MI325X with greater memory capacity and bandwidth than the Instinct MI300X, which launched last December. The MI325X is based on the same CDNA 3 GPU architecture, compared with 192GB of HBM3 high-bandwidth memory and 5.3 TB/s in memory bandwidth in the MI300X.

AMD said AI inference performance in the MI325X provides 40% faster throughput with an 8-group, 7-billion-parameter Mixtral model over Nvidia’s top-of-the-line Hopper H200, 30% lower latency with a 7-billion-parameter Mixtral model, and 20% lower latency with a 70-billion-parameter Llama 3.1 model.

AMD is planning an eight-node platform for next year, similar to Nvidia’s DGX Pods. With eight MI325X GPUs connected over AMD’s Infinity Fabric, the platform will offer 2TB of HBM3e memory, 48 TB/s of total memory bandwidth, 20.8 petaflops of FP8 performance, and 10.4 petaflops of FP16 performance, AMD said.

The MI325X will begin shipping in systems from Dell Technologies, Lenovo, Supermicro, Hewlett Packard Enterprise, Gigabyte, and several other server vendors starting in the first quarter of next year, the company said.

Americas

Topics

About

Policies

Our Network

More

AMD unveils new generation of Epyc, Instinct chips

New Epyc chip to offer scale-up performance or scale-out capacity for a wide range of data center workloads.

Instinct takes on Nvidia

Read more processor news

More from this author

Will Trump repeal the CHIPS Act?

Supermicro’s problems aren’t tech buyers’ problems – yet

Nvidia releases reference architectures for AI factories

Do data center CPUs have shorter lifespans?

HPE and Supermicro debut liquid cooling products

Supermicro unveils AI-optimized storage powered by Nvidia

Intel, AMD forge x86 alliance

Vertiv and Nvidia define liquid cooling reference architecture

Show me more

Observe, New Relic boost observability of Kubernetes workloads

How network startups could win over enterprises

F5 gateway works to protect and manage AI applications

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the diff3 command

How to use the colordiff command

How to use the CMP command

AMD unveils new generation of Epyc, Instinct chips

New Epyc chip to offer scale-up performance or scale-out capacity for a wide range of data center workloads.

Instinct takes on Nvidia

Read more processor news

Related content

IBM preps Power servers with AI support, new processor

Hurry-up offense: NFL builds terabit network in one week for Munich game

VMware Explore 2024: Latest news and insights

VMware by Broadcom: Product, service and support news

Newsletter Promo Module Test

More from this author

Will Trump repeal the CHIPS Act?

Supermicro’s problems aren’t tech buyers’ problems – yet

Nvidia releases reference architectures for AI factories

Do data center CPUs have shorter lifespans?

HPE and Supermicro debut liquid cooling products

Supermicro unveils AI-optimized storage powered by Nvidia

Intel, AMD forge x86 alliance

Vertiv and Nvidia define liquid cooling reference architecture

Show me more

Observe, New Relic boost observability of Kubernetes workloads

How network startups could win over enterprises

F5 gateway works to protect and manage AI applications

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to use the diff3 command

How to use the colordiff command

How to use the CMP command