NVIDIA RTX PRO 4000 Blackwell SFF vs NVIDIA RTX A400

NVIDIA RTX PRO 4000 Blackwell SFF
NVIDIA RTX A400
8960 Shaders
24GB GDDR7
1337MHz
768 Shaders
4GB GDDR6
1760MHz
Peak AI Performance
766.69 TFLOPS
FP4 Tensor Sparse
Peak AI Performance
86.51 TOPS
INT4 Tensor Sparse
FP32
23.96 TFLOPS
FP32
2.7 TFLOPS
FP16
23.96 TFLOPS
FP16
2.7 TFLOPS
Form Factor
PCIe Card
1.0-Slots
Form Factor
PCIe Card
1.0-Slots
TDP
70W
TDP
50W
-
-
-
-
-
-
-
-
-
-
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Peak AI Performance
766.69 TFLOPS
FP4 Tensor Sparse
Peak AI Performance
86.51 TOPS
INT4 Tensor Sparse
FP4
383.35 TFLOPS Tensor
766.69 TFLOPS Tensor Sparse
-
-
-
FP8
-
191.67 TFLOPS Tensor (FP16 Accumulate)
383.34 TFLOPS Tensor (FP16 Accumulate) Sparse
191.67 TFLOPS Tensor (FP32 Accumulate)
383.34 TFLOPS Tensor (FP32 Accumulate) Sparse
-
-
-
-
-
-
FP16
23.96 TFLOPS
95.84 TFLOPS Tensor (FP16 Accumulate)
191.67 TFLOPS Tensor (FP16 Accumulate) Sparse
95.84 TFLOPS Tensor (FP32 Accumulate)
191.67 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
2.7 TFLOPS
10.81 TFLOPS Tensor (FP16 Accumulate)
21.63 TFLOPS Tensor (FP16 Accumulate) Sparse
10.81 TFLOPS Tensor (FP32 Accumulate)
21.63 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
23.96 TFLOPS
-
-
FP32
2.7 TFLOPS
-
-
FP64
370 GFLOPS
-
FP64
40 GFLOPS
-
BF16
23.96 TFLOPS
95.84 TFLOPS Tensor
191.67 TFLOPS Tensor Sparse
BF16
2.7 TFLOPS
10.81 TFLOPS Tensor
21.63 TFLOPS Tensor Sparse
TF32
47.92 TFLOPS Tensor
95.84 TFLOPS Tensor Sparse
TF32
5.41 TFLOPS Tensor
10.81 TFLOPS Tensor Sparse
-
-
-
INT4
43.25 TOPS Tensor
86.51 TOPS Tensor Sparse
INT8
-
191.67 TOPS Tensor
383.34 TOPS Tensor Sparse
INT8
-
21.63 TOPS Tensor
43.25 TOPS Tensor Sparse
INT32
11.98 TOPS
INT32
1.35 TOPS
Ray Tracing
72.7 TOPS
Ray Tracing
5.3 TOPS
Pixel Fillrate
128.352 GPixel/s
Pixel Fillrate
28.16 GPixel/s
-
-
-
-
Texture Fillrate
374.36 GTexel/s
Texture Fillrate
42.24 GTexel/s
Manufacturer
NVIDIA
Manufacturer
NVIDIA
Chip Designer
NVIDIA
Chip Designer
NVIDIA
Architecture
Blackwell
Architecture
Ampere
Family
RTX PRO Blackwell
Family
RTX A
Codename
NV193
GB203
-
-
Codename
NV177
GA107
-
-
Market Segment
Workstation
Market Segment
Workstation
Release Date
8/11/2025
Release Date
4/17/2023
Foundry
TSMC
-
Foundry
Samsung
-
Fabrication Node
4NP
-
Fabrication Node
8N
-
Die Size
378 mm²
-
Die Size
200 mm²
-
Transistor Count
45.6 Billion
-
Transistor Count
8.7 Billion
-
Transistor Density
120.63M/mm²
-
Transistor Density
43.50M/mm²
-
Form
PCIe Card
Form
PCIe Card
Shading Units
8960 Shaders
-
Shading Units
768 Shaders
-
Texture Mapping Units
280 TMUs
Texture Mapping Units
24 TMUs
Render Output Units
96 ROPs
Render Output Units
16 ROPs
Tensor Cores
280 T-Cores
Tensor Cores
24 T-Cores
Ray-Tracing Cores
70 RT-Cores
Ray-Tracing Cores
6 RT-Cores
Streaming Multiprocessors
70 SMs
Streaming Multiprocessors
6 SMs
-
-
-
-
-
-
-
-
-
-
Graphics Processing Clusters
1 GPC
-
-
790MHz Base
1337MHz
-
-
1000MHz Base
1760MHz
-
-
-
-
L1
64KB/SM Tex
128KB/SM
-
-
L1
64KB/SM Tex
128KB/SM
-
-
L2
48MB Shared
L2
1MB Shared
-
-
-
-
-
-
24GB
GDDR7
ECC
4GB
GDDR6
-
Bus Width
192Bit
Bus Width
64Bit
Clock
1125MHz
Transfer Rate
18GT/s
Bandwidth
432GB/s
Clock
1500MHz
Transfer Rate
12GT/s
Bandwidth
96GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
70W
TDP
50W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
4x DisplayPort 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
4x mini-DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
165Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
-
Variable Refresh Rate
G-Sync
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
3
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
2x NVENC 9
Model
NVENC 7
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
Model
2x NVDEC 6
Model
NVDEC 5
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.2
Shader Model
6.8
CUDA
12.8
-
-
PureVideo HD
VP13
VDPAU
Feature Set M
Shader Model
6.6
CUDA
8.6
-
-
PureVideo HD
VP11
VDPAU
Feature Set K
-
-
-
1x Fan
-
-
-
1x Fan
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Slots Required
1.0
PCIe Version
5.0
PCIe Lanes
16
Slots Required
1.0
PCIe Version
4.0
PCIe Lanes
8
Multi GPU Support
Supported
Type
NVLink
-
-
-
-
Height
69 mm (2.72 in)
Width
167 mm (6.57 in)
Depth
20 mm (0.79 in)
Height
69 mm (2.72 in)
Width
163 mm (6.42 in)
Depth
40 mm (1.57 in)
Change Comparison