AMD Instinct MI200 vs NVIDIA A30

AMD Instinct MI200

AMD Instinct MI200

NVIDIA A30

6656 Shaders 64GB HBM2e 1700MHz	3584 Shaders 24GB HBM2 1440MHz
Peak AI Performance 181.04 TOPS INT4 Tensor	Peak AI Performance 1.32 POPS INT4 Tensor Sparse
FP32 22.63 TFLOPS	FP32 10.32 TFLOPS
FP16 45.26 TFLOPS	FP16 41.29 TFLOPS
Form Factor OAM Module -	Form Factor PCIe Card 2.0-Slots
TDP 300W	TDP 165W
- - - - -	- - - - -

Highlights

Benchmarks

Geekbench 6

GB6 OpenCL N/A 0%	GB6 OpenCL 128,100 33%
GB6 Metal N/A 0%	GB6 Metal N/A 0%
GB6 Vulkan N/A 0%	GB6 Vulkan N/A 0%

Geekbench 5

GB5 OpenCL N/A 0%	GB5 OpenCL N/A 0%
GB5 CUDA N/A 0%	GB5 CUDA N/A 0%
GB5 Metal N/A 0%	GB5 Metal N/A 0%
GB5 Vulkan N/A 0%	GB5 Vulkan N/A 0%

OctaneBench

OCT 2020.1 N/A 0%	OCT 2020.1 N/A 0%
OCT Metal N/A 0%	OCT Metal N/A 0%

Tech Specs

Theoretical Performance

Peak AI Performance 181.04 TOPS INT4 Tensor	Peak AI Performance 1.32 POPS INT4 Tensor Sparse
- - -	- - -
- - - - - -	- - - - - -
FP16 45.26 TFLOPS 181.04 TFLOPS Tensor (FP16 Accumulate) - 181.04 TFLOPS Tensor (FP32 Accumulate) -	FP16 41.29 TFLOPS 165.15 TFLOPS Tensor (FP16 Accumulate) 330.3 TFLOPS Tensor (FP16 Accumulate) Sparse 165.15 TFLOPS Tensor (FP32 Accumulate) 330.3 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32 22.63 TFLOPS 45.26 TFLOPS Tensor -	FP32 10.32 TFLOPS - -
FP64 22.63 TFLOPS 45.26 TFLOPS Tensor	FP64 5.16 TFLOPS 10.32 TFLOPS Tensor
BF16 - 181.04 TFLOPS Tensor -	BF16 20.64 TFLOPS 165.15 TFLOPS Tensor 330.3 TFLOPS Tensor Sparse
- - -	TF32 82.58 TFLOPS Tensor 165.15 TFLOPS Tensor Sparse
INT4 181.04 TOPS Tensor -	INT4 660.6 TOPS Tensor 1.32 POPS Tensor Sparse
INT8 - 181.04 TOPS Tensor -	INT8 - 330.3 TOPS Tensor 660.6 TOPS Tensor Sparse
- -	INT32 10.32 TOPS
- -	- -
Pixel Fillrate -	Pixel Fillrate 138.24 GPixel/s
- -	- -
Texture Fillrate 707.2 GTexel/s	Texture Fillrate 322.56 GTexel/s

Chip

Manufacturer AMD	Manufacturer NVIDIA
Chip Designer AMD	Chip Designer NVIDIA
Architecture CDNA 2	Architecture Ampere
Family Instinct	Family Server
Codename Aldebaran Aldebaran XL Variant Aldebaran XL	Codename NV170 GA100 - -
Market Segment Server	Market Segment Server
Release Date 3/22/2022	Release Date 4/12/2021

Fabrication

Foundry TSMC -	Foundry TSMC -
Fabrication Node N6 -	Fabrication Node 7N -
Die Size 724 mm² -	Die Size 826 mm² -
Transistor Count 28 Billion -	Transistor Count 54.2 Billion -
Transistor Density 38.67M/mm² -	Transistor Density 65.62M/mm² -

Form

Form

OAM Module

Form

PCIe Card

Core Configuration

Shading Units 6656 Shaders -	Shading Units 3584 Shaders -
Texture Mapping Units 416 TMUs	Texture Mapping Units 224 TMUs
Render Output Units -	Render Output Units 96 ROPs
Tensor Cores 416 T-Cores	Tensor Cores 224 T-Cores
- -	- -
- -	Streaming Multiprocessors 56 SMs
Compute Units 104 CUs	- -
- -	- -
- -	- -

Clock Speeds

-

-

1000MHz Base

1700MHz

-

-

930MHz Base

1440MHz

Cache

- -	- -
L1 - - 16KB/CU -	L1 64KB/SM Tex 192KB/SM - -
L2 8MB Shared	L2 24MB Shared
- - -	- - -

Memory

64GB HBM2e ECC	24GB HBM2 -
Bus Width 4096Bit	Bus Width 3072Bit
Clock 1600MHz Transfer Rate 3.2GT/s Bandwidth 1638.4GB/s	Clock 1215MHz Transfer Rate 2.4GT/s Bandwidth 933.1GB/s
- - - - - - - - -	- - - - - - - - -

Power & Thermals

TDP 300W	TDP 165W
- -	- -

Ports

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

No Ports

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

No Ports

Video Output

Max Resolution Unknown	Max Resolution Unknown
Max Resolution Refresh Rate -	Max Resolution Refresh Rate -
Variable Refresh Rate - - -	Variable Refresh Rate G-Sync FreeSync -
Display Stream Compression (DSC) Not Supported	Display Stream Compression (DSC) Not Supported
Multi Monitor Support Unknown	Multi Monitor Support Unknown
- -	- -

Video Encoder

Model VCN 2.6	No Encoders -
Codec - - - - - - - - AVC (H.264) HEVC (H.265) - - - -	- - - - - - - - - - - - - - -

Video Decoder

No Decoders	Model 5x NVDEC 4
- - - - - - - - - - - - - -	Codec MPEG-1 MPEG-2 MPEG-4 - VC-1 VP8 VP9 - AVC (H.264) HEVC (H.265) - - - -

API Support

- - - -	- - - -
- - OpenCL 3.0 - -	- - OpenCL 3.0 Vulkan 1.2
- - - - GFX 9.4 - - - -	- - CUDA 8.0 - - PureVideo HD VP10 VDPAU Feature Set J

Card

- - - -	- - - -
- - - - - - - -	- - - - - - - -
- - PCIe Version 4.0 PCIe Lanes 16	Slots Required 2.0 PCIe Version 4.0 PCIe Lanes 16
Multi GPU Support Supported Type Infinity Fabric	Multi GPU Support Supported Type NVLink
- - - - - -	Height 111 mm (4.37 in) Width 267 mm (10.51 in) Depth 40 mm (1.57 in)

Competitors

NVIDIA A30

NVIDIA GRID RTX T10-2

NVIDIA A30 vs NVIDIA GRID RTX T10-2

NVIDIA A30

NVIDIA GRID RTX T10-4

NVIDIA A30 vs NVIDIA GRID RTX T10-4

NVIDIA A30

NVIDIA GRID RTX T10-8

NVIDIA A30 vs NVIDIA GRID RTX T10-8

NVIDIA A30

NVIDIA GRID RTX T10-16

NVIDIA A30 vs NVIDIA GRID RTX T10-16

NVIDIA A30

AMD Radeon Pro V520

NVIDIA A30 vs AMD Radeon Pro V520

Change Comparison

Copy Link