AMD Instinct MI300P vs NVIDIA A40 Full Specs
9,728 Shaders 2.1GHz | 10,752 Shaders 1.74GHz |
64GB HBM32.66TB/s | 48GB GDDR6347.9GB/s |
·· 81.72 TFLOPS | ·· 37.42 TFLOPS |
Form Factor OAM Module | Form Factor PCIe Card |
TDP 300W | TDP 300W |
Power Connectors - | Power Connectors 1x 8-Pin EPS |
Clock Speed ··· | Clock Speed ··· |
Peak OPS 2.62 POPSINT4 Tensor Sparse | Peak OPS 1.2 POPSINT4 Tensor Sparse |
Tensor FP8-16 1.31 PFLOPSFP8-16 Tensor Sparse 2.62 PFLOPSTensor FP8-32 1.31 PFLOPSFP8-32 Tensor Sparse 2.62 PFLOPS | - |
Tensor FP16-16 653.7 TFLOPSFP16-16 Tensor Sparse 1.31 PFLOPSTensor FP16-32 653.7 TFLOPSFP16-32 Tensor Sparse 1.31 PFLOPS | Tensor FP16-16 149.7 TFLOPSFP16-16 Tensor Sparse 299.3 TFLOPSTensor FP16-32 74.83 TFLOPSFP16-32 Tensor Sparse 149.7 TFLOPS |
BF16 -Tensor BF16 653.7 TFLOPSBF16 Tensor Sparse 1.31 PFLOPS | BF16 37.42 TFLOPSTensor BF16 74.83 TFLOPSBF16 Tensor Sparse 149.7 TFLOPS |
Tensor TF32 326.9 TFLOPS | Tensor TF32 37.42 TFLOPS |
FP32 81.72 TFLOPSTensor FP32 81.72 TFLOPS | FP32 37.42 TFLOPSTensor FP32 - |
FP64 40.86 TFLOPSTensor FP64 81.72 TFLOPS | FP64 584.6 GFLOPSTensor FP64 - |
Tensor INT4 1.31 POPS | Tensor INT4 598.7 TOPS |
Tensor INT8 1.31 POPS | Tensor INT8 299.3 TOPS |
Ray - | Ray 56.43 TOPS |
Pixel Rate - | Pixel Rate 194.9 GPixel/s |
Texture Rate 1.28 TTexel/s | Texture Rate 584.6 GTexel/s |
Shaders 9,728 Shaders | Shaders 10,752 Shaders |
TMUs 608 TMUs | TMUs 336 TMUs |
ROPs - | ROPs 112 ROPs |
Tensor Cores 608 T-Cores | Tensor Cores 336 T-Cores |
RT Cores - | RT Cores 84 RT-Cores |
CUs 152 CUs | SMs 84 SMs |
Base Clock 1GHz | Base Clock 1.3GHz |
Boost Clock 2.1GHz | Boost Clock 1.74GHz |
L2 Cache 16.4MB shared | L2 Cache 6.1MB shared |
L3 Cache 128MB shared | L3 Cache - |
64GB HBM3 | 48GB GDDR6 |
Memory Bus 4096-bit | Memory Bus 384-bit |
Memory Speed 5.2GT/s | Memory Speed 7.2GT/s |
Memory Bandwidth 2.66TB/s | Memory Bandwidth 347.9GB/s |
ECC Yes | ECC - |
TDP 300W | TDP 300W |
Multi-Monitor - | Multi-Monitor 3 |
HDCP - | HDCP HDCP 2.3 |
- | 3x DisplayPort 1.3 |
Encoder Model 2x VCN 2.6 | Encoder Model NVENC 7 |
Decoder Model 2x VCN 2.6 | Decoder Model NVDEC 5 |
Form Factor OAM Module | Form Factor PCIe Card |
PCIe - | PCIe 2-Slots |
- | Height 112 mm (4.41")Width 267 mm (10.51")Depth 40 mm (1.57") |
Cooling Passive | Cooling Passive |
Power Connectors - | Power Connectors 1x 8-Pin EPS |
Manufacturer | Manufacturer |
Chip Designer | Chip Designer |
Architecture | Architecture |
Family | Family |
Branding ![]() | Branding ![]() |
Codename Aqua Vanjaram | Codename NV172 |
Chip Variant Aqua Vanjaram XL | Chip Variant GA102-200-KD-A1 |
Market Segment Server | Market Segment Server |
Release Date Dec 6, 2023 | Release Date Oct 5, 2020 |
Foundry TSMC | Foundry Samsung |
Other Foundries TSMCActive Interposer | Other Foundries - |
Fabrication Node N5 N6Active Interposer | Fabrication Node 8N - |
Die Size 740mm² | Die Size 628mm² |
Transistor Count 73 Billion | Transistor Count 28.3 Billion |
Transistor Density 98.65 MTr/mm² | Transistor Density 45.04 MTr/mm² |
No images available
No images available



