AMD Instinct MI350X vs Intel Gaudi 2

AMD Instinct MI350X

AMD Instinct MI350X

Intel Gaudi 2

16384 Shaders 288GB HBM3e 2200MHz	6144 Shaders 96GB HBM2e 550MHz
Peak AI Performance 18.45 PFLOPS FP4 Tensor Sparse	Peak AI Performance 865.08 TFLOPS FP8 Tensor (FP32 Accumulate)
FP32 144.18 TFLOPS	FP32 5.5 TFLOPS
FP16 144.18 TFLOPS	FP16 11 TFLOPS
Form Factor OAM Module -	Form Factor OAM Module -
TDP 1000W	TDP 600W
- - - - -	- - - - -

Highlights

Benchmarks

Geekbench 6

GB6 OpenCL N/A 0%	GB6 OpenCL N/A 0%
GB6 Metal N/A 0%	GB6 Metal N/A 0%
GB6 Vulkan N/A 0%	GB6 Vulkan N/A 0%

Geekbench 5

GB5 OpenCL N/A 0%	GB5 OpenCL N/A 0%
GB5 CUDA N/A 0%	GB5 CUDA N/A 0%
GB5 Metal N/A 0%	GB5 Metal N/A 0%
GB5 Vulkan N/A 0%	GB5 Vulkan N/A 0%

OctaneBench

OCT 2020.1 N/A 0%	OCT 2020.1 N/A 0%
OCT Metal N/A 0%	OCT Metal N/A 0%

Tech Specs

Theoretical Performance

Peak AI Performance 18.45 PFLOPS FP4 Tensor Sparse	Peak AI Performance 865.08 TFLOPS FP8 Tensor (FP32 Accumulate)
FP4 9.23 PFLOPS Tensor 18.45 PFLOPS Tensor Sparse	- - -
FP8 - 4.61 PFLOPS Tensor (FP16 Accumulate) 9.23 PFLOPS Tensor (FP16 Accumulate) Sparse 4.61 PFLOPS Tensor (FP32 Accumulate) 9.23 PFLOPS Tensor (FP32 Accumulate) Sparse	FP8 - - - 865.08 TFLOPS Tensor (FP32 Accumulate) -
FP16 144.18 TFLOPS 2.31 PFLOPS Tensor (FP16 Accumulate) 4.61 PFLOPS Tensor (FP16 Accumulate) Sparse 2.31 PFLOPS Tensor (FP32 Accumulate) 4.61 PFLOPS Tensor (FP32 Accumulate) Sparse	FP16 11 TFLOPS - - - -
FP32 144.18 TFLOPS 144.18 TFLOPS Tensor 288.36 TFLOPS Tensor Sparse	FP32 5.5 TFLOPS - -
FP64 72.09 TFLOPS 72.09 TFLOPS Tensor	- - -
BF16 - 2.31 PFLOPS Tensor 4.61 PFLOPS Tensor Sparse	BF16 11 TFLOPS 432.54 TFLOPS Tensor -
- - -	- - -
INT4 4.61 POPS Tensor 9.23 POPS Tensor Sparse	- - -
INT8 - 4.61 POPS Tensor 9.23 POPS Tensor Sparse	- - - -
- -	- -
- -	- -
Pixel Fillrate -	Pixel Fillrate -
- -	- -
Texture Fillrate -	Texture Fillrate -

Chip

Manufacturer AMD	Manufacturer Intel
Chip Designer AMD	Chip Designer Intel
Architecture CDNA 4	Architecture Gaudi 2
Family Instinct	Family Gaudi
Codename CDNA Next - - -	Codename Gaudi 2 HL-2080 Variant HL-2080
Market Segment Server	Market Segment Server
Release Date 6/12/2025	Release Date 5/1/2022

Fabrication

Foundry TSMC TSMC IO Die	Foundry TSMC -
Fabrication Node N3P N6 IO Die	Fabrication Node N7 -
Die Size 8x 181 mm² -	- - -
Transistor Count 8x 17.4 Billion 2x 23 Billion IO Die	- -
Transistor Density 96.13M/mm² -	- - -

Form

Form

OAM Module

Form

OAM Module

Core Configuration

Shading Units 16384 Shaders -	Shading Units 6144 Shaders -
Texture Mapping Units -	Texture Mapping Units -
Render Output Units -	Render Output Units -
Tensor Cores 1024 T-Cores	Tensor Cores 24 T-Cores
- -	- -
- -	- -
Compute Units 256 CUs	Compute Units 2 CUs
- -	- -
- -	- -

Clock Speeds

-

-

-

2200MHz

-

-

-

550MHz

Cache

- -	- -
L1 - - 16KB/CU -	L1 - - - Unknown
L2 16MB Shared	L2 Unknown
L3 256MB Shared -	- - -

Memory

288GB HBM3e ECC	96GB HBM2e ECC
Bus Width 8192Bit	Bus Width 6144Bit
Clock 3906MHz Transfer Rate 7.8GT/s Bandwidth 8000.1GB/s	Clock 1596MHz Transfer Rate 3.2GT/s Bandwidth 2451.5GB/s
- - - - - - - - -	- - - - eSRAM 48MB 6400GB/s - -

Power & Thermals

TDP 1000W	TDP 600W
- -	- -

Ports

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

No Ports

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

No Ports

Video Output

Max Resolution Unknown	Max Resolution Unknown
Max Resolution Refresh Rate -	Max Resolution Refresh Rate -
Variable Refresh Rate - - -	Variable Refresh Rate - - -
Display Stream Compression (DSC) Not Supported	Display Stream Compression (DSC) Not Supported
Multi Monitor Support Unknown	Multi Monitor Support Unknown
- -	- -

Video Encoder

Model 4x VCN 4.0	No Encoders -
Codec - - - - - - VP9 - AVC (H.264) HEVC (H.265) - AV1 - -	- - - - - - - - - - - - - - -

Video Decoder

Model 4x VCN 4.0	No Decoders
Codec MPEG-1 MPEG-2 MPEG-4 JPEG VC-1 - VP9 - AVC (H.264) HEVC (H.265) - AV1 - -	- - - - - - - - - - - - - -

API Support

- - - -	- - - -
- - OpenCL 3.0 - -	- - OpenCL 3.0 - -
- - - - GFX 9.4 - - - -	- - - - - - - - - -

Card

- - - -	Not a Card - - -
- - - - - - - -	- - - - - - - -
- - PCIe Version 5.0 PCIe Lanes 16	- - PCIe Version 4.0 PCIe Lanes 16
Multi GPU Support Supported Type Infinity Fabric	Multi GPU Support Supported Type RoCE
- - - - - -	- - - - - -

Competitors

AMD Instinct MI350X

AMD Instinct MI300X

AMD Instinct MI350X vs AMD Instinct MI300X

AMD Instinct MI350X

AMD Instinct MI325X

AMD Instinct MI350X vs AMD Instinct MI325X

AMD Instinct MI350X

AMD Instinct MI355X

AMD Instinct MI350X vs AMD Instinct MI355X

Change Comparison

Copy Link