GPUs

NVIDIA A100 40GB vs NVIDIA RTX PRO 5000 Blackwell Full Specs

6,912 Shaders
1.41GHz
14,080 Shaders
2.62GHz
40GB HBM2e1.55TB/s
48GB GDDR71.34TB/s
··
19.49 TFLOPS
··
73.69 TFLOPS
Form Factor
PCIe Card
Form Factor
PCIe Card
TDP
250W
TDP
300W
Power Connectors
-
1x 8-Pin EPS
Power Connectors
1x 16-Pin 12VHPWR
-

A100 40GBA100 40GB2.5 POPSINT4 Tensor Sparse
x1.06
RTX PRO 5000 BlackwellRTX PRO 5000 Blackwell2.36 PFLOPSFP4 Tensor Sparse
x1

Clock Speed
···
Clock Speed
··
Peak OPS
2.5 POPSINT4 Tensor Sparse
Peak OPS
2.36 PFLOPSFP4 Tensor Sparse
-
Tensor FP4
1.18 PFLOPS
FP4 Tensor Sparse
2.36 PFLOPS
-
Tensor FP8-16
589.6 TFLOPS
FP8-16 Tensor Sparse
1.18 PFLOPS
Tensor FP8-32
294.8 TFLOPS
FP8-32 Tensor Sparse
589.6 TFLOPS
Tensor FP16-16
311.9 TFLOPS
FP16-16 Tensor Sparse
623.7 TFLOPS
Tensor FP16-32
311.9 TFLOPS
FP16-32 Tensor Sparse
623.7 TFLOPS
Tensor FP16-16
294.8 TFLOPS
FP16-16 Tensor Sparse
589.6 TFLOPS
Tensor FP16-32
147.4 TFLOPS
FP16-32 Tensor Sparse
294.8 TFLOPS
BF16
38.98 TFLOPS
Tensor BF16
311.9 TFLOPS
BF16 Tensor Sparse
623.7 TFLOPS
BF16
73.69 TFLOPS
Tensor BF16
147.4 TFLOPS
BF16 Tensor Sparse
294.8 TFLOPS
Tensor TF32
155.9 TFLOPS
Tensor TF32
73.69 TFLOPS
FP32
19.49 TFLOPS
FP32
73.69 TFLOPS
FP64
9.75 TFLOPS
Tensor FP64
19.49 TFLOPS
FP64
1.15 TFLOPS
Tensor FP64
-
Tensor INT4
1.25 POPS
Tensor INT4
-
Tensor INT8
623.7 TOPS
Tensor INT8
589.6 TOPS
Ray
-
Ray
223.5 TOPS
Pixel Rate
225.6 GPixel/s
Pixel Rate
460.6 GPixel/s
Texture Rate
609.1 GTexel/s
Texture Rate
1.15 TTexel/s

Shaders
6,912 Shaders
Shaders
14,080 Shaders
TMUs
432 TMUs
TMUs
440 TMUs
ROPs
160 ROPs
ROPs
176 ROPs
Tensor Cores
432 T-Cores
Tensor Cores
440 T-Cores
RT Cores
-
RT Cores
110 RT-Cores
SMs
108 SMs
SMs
110 SMs

Base Clock
765MHz
Base Clock
-
Boost Clock
1.41GHz
Boost Clock
2.62GHz

L2 Cache
41MB shared
L2 Cache
98.3MB shared

40GB HBM2e
48GB GDDR7
Memory Bus
5120-bit
Memory Bus
384-bit
Memory Speed
2.4GT/s
Memory Speed
28GT/s
Memory Bandwidth
1.55TB/s
Memory Bandwidth
1.34TB/s
ECC
-
ECC
Yes

TDP
250W
TDP
300W

Multi-Monitor
-
Multi-Monitor
4
HDCP
-
HDCP
HDCP 2.3

-
4x DisplayPort 2.1

-
Encoder Model
2x NVENC 9

Decoder Model
5x NVDEC 4
Decoder Model
2x NVDEC 6

Form Factor
PCIe Card
Form Factor
PCIe Card
PCIe
2-Slots
PCIe
2-Slots
Height
111 mm (4.37")
Width
267 mm (10.51")
Depth
40 mm (1.57")
Height
111.8 mm (4.4")
Width
266.7 mm (10.5")
Depth
40 mm (1.57")
Cooling
Passive
-
Cooling
Blower
1x Fan
Power Connectors
1x 8-Pin EPS
Power Connectors
1x 16-Pin 12VHPWR

Manufacturer
Manufacturer
Chip Designer
Chip Designer
Architecture
Architecture
Family
Family
Branding
A100 40GB branding
Branding
RTX branding
Codename
NV170
Codename
NV192
Chip Variant
-
Chip Variant
GB202-250-A1
Market Segment
Server
Market Segment
Workstation
Release Date
Jun 28, 2021
Release Date
Mar 18, 2025

Foundry
TSMC
Foundry
TSMC
Fabrication Node
7N
Fabrication Node
4NP
Die Size
826mm²
Die Size
762mm²
Transistor Count
54.2 Billion
Transistor Count
92.2 Billion
Transistor Density
65.62 MTr/mm²
Transistor Density
121.1 MTr/mm²

No images available
No images available