NVIDIA A40 vs AMD Instinct MI300P Full Specs
10,752 Shaders 1.74GHz | 9,728 Shaders 2.1GHz |
48GB GDDR6347.9GB/s | 64GB HBM32.66TB/s |
·· 37.42 TFLOPS | ·· 81.72 TFLOPS |
Form Factor PCIe Card | Form Factor OAM Module |
TDP 300W | TDP 300W |
Power Connectors 1x 8-Pin EPS | Power Connectors - |
Clock Speed ··· | Clock Speed ··· |
Peak OPS 1.2 POPSINT4 Tensor Sparse | Peak OPS 2.62 POPSINT4 Tensor Sparse |
- | Tensor FP8-16 1.31 PFLOPSFP8-16 Tensor Sparse 2.62 PFLOPSTensor FP8-32 1.31 PFLOPSFP8-32 Tensor Sparse 2.62 PFLOPS |
Tensor FP16-16 149.7 TFLOPSFP16-16 Tensor Sparse 299.3 TFLOPSTensor FP16-32 74.83 TFLOPSFP16-32 Tensor Sparse 149.7 TFLOPS | Tensor FP16-16 653.7 TFLOPSFP16-16 Tensor Sparse 1.31 PFLOPSTensor FP16-32 653.7 TFLOPSFP16-32 Tensor Sparse 1.31 PFLOPS |
BF16 37.42 TFLOPSTensor BF16 74.83 TFLOPSBF16 Tensor Sparse 149.7 TFLOPS | BF16 -Tensor BF16 653.7 TFLOPSBF16 Tensor Sparse 1.31 PFLOPS |
Tensor TF32 37.42 TFLOPS | Tensor TF32 326.9 TFLOPS |
FP32 37.42 TFLOPSTensor FP32 - | FP32 81.72 TFLOPSTensor FP32 81.72 TFLOPS |
FP64 584.6 GFLOPSTensor FP64 - | FP64 40.86 TFLOPSTensor FP64 81.72 TFLOPS |
Tensor INT4 598.7 TOPS | Tensor INT4 1.31 POPS |
Tensor INT8 299.3 TOPS | Tensor INT8 1.31 POPS |
Ray 56.43 TOPS | Ray - |
Pixel Rate 194.9 GPixel/s | Pixel Rate - |
Texture Rate 584.6 GTexel/s | Texture Rate 1.28 TTexel/s |
Shaders 10,752 Shaders | Shaders 9,728 Shaders |
TMUs 336 TMUs | TMUs 608 TMUs |
ROPs 112 ROPs | ROPs - |
Tensor Cores 336 T-Cores | Tensor Cores 608 T-Cores |
RT Cores 84 RT-Cores | RT Cores - |
SMs 84 SMs | CUs 152 CUs |
Base Clock 1.3GHz | Base Clock 1GHz |
Boost Clock 1.74GHz | Boost Clock 2.1GHz |
L2 Cache 6.1MB shared | L2 Cache 16.4MB shared |
L3 Cache - | L3 Cache 128MB shared |
48GB GDDR6 | 64GB HBM3 |
Memory Bus 384-bit | Memory Bus 4096-bit |
Memory Speed 7.2GT/s | Memory Speed 5.2GT/s |
Memory Bandwidth 347.9GB/s | Memory Bandwidth 2.66TB/s |
ECC - | ECC Yes |
TDP 300W | TDP 300W |
Multi-Monitor 3 | Multi-Monitor - |
HDCP HDCP 2.3 | HDCP - |
3x DisplayPort 1.3 | - |
Encoder Model NVENC 7 | Encoder Model 2x VCN 2.6 |
Decoder Model NVDEC 5 | Decoder Model 2x VCN 2.6 |
Form Factor PCIe Card | Form Factor OAM Module |
PCIe 2-Slots | PCIe - |
Height 112 mm (4.41")Width 267 mm (10.51")Depth 40 mm (1.57") | - |
Cooling Passive | Cooling Passive |
Power Connectors 1x 8-Pin EPS | Power Connectors - |
Manufacturer | Manufacturer |
Chip Designer | Chip Designer |
Architecture | Architecture |
Family | Family |
Branding ![]() | Branding ![]() |
Codename NV172 | Codename Aqua Vanjaram |
Chip Variant GA102-200-KD-A1 | Chip Variant Aqua Vanjaram XL |
Market Segment Server | Market Segment Server |
Release Date Oct 5, 2020 | Release Date Dec 6, 2023 |
Foundry Samsung | Foundry TSMC |
Other Foundries - | Other Foundries TSMCActive Interposer |
Fabrication Node 8N - | Fabrication Node N5 N6Active Interposer |
Die Size 628mm² | Die Size 740mm² |
Transistor Count 28.3 Billion | Transistor Count 73 Billion |
Transistor Density 45.04 MTr/mm² | Transistor Density 98.65 MTr/mm² |
No images available
No images available



