GPUs

NVIDIA A40 vs AMD Instinct MI300P Full Specs

10,752 Shaders
1.74GHz
9,728 Shaders
2.1GHz
48GB GDDR6347.9GB/s
64GB HBM32.66TB/s
··
37.42 TFLOPS
··
81.72 TFLOPS
Form Factor
PCIe Card
Form Factor
OAM Module
TDP
300W
TDP
300W
Power Connectors
1x 8-Pin EPS
Power Connectors
-

A40A401.2 POPSINT4 Tensor Sparse
x1
Instinct MI300PInstinct MI300P2.62 POPSINT4 Tensor Sparse
x2.18

Clock Speed
···
Clock Speed
···
Peak OPS
1.2 POPSINT4 Tensor Sparse
Peak OPS
2.62 POPSINT4 Tensor Sparse
-
Tensor FP8-16
1.31 PFLOPS
FP8-16 Tensor Sparse
2.62 PFLOPS
Tensor FP8-32
1.31 PFLOPS
FP8-32 Tensor Sparse
2.62 PFLOPS
Tensor FP16-16
149.7 TFLOPS
FP16-16 Tensor Sparse
299.3 TFLOPS
Tensor FP16-32
74.83 TFLOPS
FP16-32 Tensor Sparse
149.7 TFLOPS
Tensor FP16-16
653.7 TFLOPS
FP16-16 Tensor Sparse
1.31 PFLOPS
Tensor FP16-32
653.7 TFLOPS
FP16-32 Tensor Sparse
1.31 PFLOPS
BF16
37.42 TFLOPS
Tensor BF16
74.83 TFLOPS
BF16 Tensor Sparse
149.7 TFLOPS
BF16
-
Tensor BF16
653.7 TFLOPS
BF16 Tensor Sparse
1.31 PFLOPS
Tensor TF32
37.42 TFLOPS
Tensor TF32
326.9 TFLOPS
FP32
37.42 TFLOPS
Tensor FP32
-
FP32
81.72 TFLOPS
Tensor FP32
81.72 TFLOPS
FP64
584.6 GFLOPS
Tensor FP64
-
FP64
40.86 TFLOPS
Tensor FP64
81.72 TFLOPS
Tensor INT4
598.7 TOPS
Tensor INT4
1.31 POPS
Tensor INT8
299.3 TOPS
Tensor INT8
1.31 POPS
Ray
56.43 TOPS
Ray
-
Pixel Rate
194.9 GPixel/s
Pixel Rate
-
Texture Rate
584.6 GTexel/s
Texture Rate
1.28 TTexel/s

Shaders
10,752 Shaders
Shaders
9,728 Shaders
TMUs
336 TMUs
TMUs
608 TMUs
ROPs
112 ROPs
ROPs
-
Tensor Cores
336 T-Cores
Tensor Cores
608 T-Cores
RT Cores
84 RT-Cores
RT Cores
-
SMs
84 SMs
CUs
152 CUs

Base Clock
1.3GHz
Base Clock
1GHz
Boost Clock
1.74GHz
Boost Clock
2.1GHz

L2 Cache
6.1MB shared
L2 Cache
16.4MB shared
L3 Cache
-
L3 Cache
128MB shared

48GB GDDR6
64GB HBM3
Memory Bus
384-bit
Memory Bus
4096-bit
Memory Speed
7.2GT/s
Memory Speed
5.2GT/s
Memory Bandwidth
347.9GB/s
Memory Bandwidth
2.66TB/s
ECC
-
ECC
Yes

TDP
300W
TDP
300W

Multi-Monitor
3
Multi-Monitor
-
HDCP
HDCP 2.3
HDCP
-

3x DisplayPort 1.3
-

Encoder Model
NVENC 7
Encoder Model
2x VCN 2.6

Decoder Model
NVDEC 5
Decoder Model
2x VCN 2.6

Form Factor
PCIe Card
Form Factor
OAM Module
PCIe
2-Slots
PCIe
-
Height
112 mm (4.41")
Width
267 mm (10.51")
Depth
40 mm (1.57")
-
Cooling
Passive
Cooling
Passive
Power Connectors
1x 8-Pin EPS
Power Connectors
-

Manufacturer
Manufacturer
Chip Designer
Chip Designer
Architecture
Architecture
Family
Family
Branding
A40 branding
Branding
Instinct branding
Codename
NV172
Codename
Aqua Vanjaram
Chip Variant
GA102-200-KD-A1
Chip Variant
Aqua Vanjaram XL
Market Segment
Server
Market Segment
Server
Release Date
Oct 5, 2020
Release Date
Dec 6, 2023

Foundry
Samsung
Foundry
TSMC
Other Foundries
-
Other Foundries
TSMCActive Interposer
Fabrication Node
8N
-
Fabrication Node
N5
N6Active Interposer
Die Size
628mm²
Die Size
740mm²
Transistor Count
28.3 Billion
Transistor Count
73 Billion
Transistor Density
45.04 MTr/mm²
Transistor Density
98.65 MTr/mm²

No images available
No images available