AMD Radeon AI PRO R9700 vs NVIDIA RTX PRO 4000 Blackwell

AMD Radeon AI PRO R9700
NVIDIA RTX PRO 4000 Blackwell
4096 Shaders
32GB GDDR6
2920MHz
8960 Shaders
24GB GDDR7
2452MHz
Peak AI Performance
1.53 POPS
INT4 Tensor Sparse
Peak AI Performance
1.41 PFLOPS
FP4 Tensor Sparse
FP32
47.84 TFLOPS
FP32
43.94 TFLOPS
FP16
95.68 TFLOPS
FP16
43.94 TFLOPS
Form Factor
PCIe Card
2.0-Slots
Form Factor
PCIe Card
1.0-Slots
TDP
300W
TDP
140W
Power Connectors
-
-
1x 12-Pin
-
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
GB6 OpenCL N/A
0%
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
AMD
Manufacturer
NVIDIA
Chip Designer
AMD
Chip Designer
NVIDIA
Architecture
RDNA 4
Architecture
Blackwell
Family
Radeon AI PRO
Family
RTX PRO Blackwell
Codename
-
Navi 48
Variant
Navi 48 XT
Codename
NV193
GB203
-
-
Market Segment
Workstation
Market Segment
Workstation
Release Date
5/20/2025
Release Date
3/18/2025
Foundry
TSMC
-
Foundry
TSMC
-
Fabrication Node
N4P
-
Fabrication Node
4NP
-
Die Size
357 mm²
-
Die Size
378 mm²
-
Transistor Count
53.9 Billion
-
Transistor Count
45.6 Billion
-
Transistor Density
151.19M/mm²
-
Transistor Density
120.63M/mm²
-
Form
PCIe Card
Form
PCIe Card
Shading Units
4096 Shaders
-
Shading Units
8960 Shaders
-
Texture Mapping Units
256 TMUs
Texture Mapping Units
280 TMUs
Render Output Units
128 ROPs
Render Output Units
96 ROPs
Tensor Cores
128 T-Cores
Tensor Cores
280 T-Cores
Ray-Tracing Cores
64 RT-Cores
Ray-Tracing Cores
70 RT-Cores
-
-
Streaming Multiprocessors
70 SMs
Compute Units
64 CUs
-
-
-
-
-
-
-
-
-
-
-
-
2400MHz Base
2920MHz
-
-
-
2452MHz
Peak AI Performance
1.53 POPS
INT4 Tensor Sparse
Peak AI Performance
1.41 PFLOPS
FP4 Tensor Sparse
-
-
-
FP4
703.04 TFLOPS Tensor
1.41 PFLOPS Tensor Sparse
FP8
191.37 TFLOPS
191.37 TFLOPS Tensor (FP16 Accumulate)
382.73 TFLOPS Tensor (FP16 Accumulate) Sparse
191.37 TFLOPS Tensor (FP32 Accumulate)
382.73 TFLOPS Tensor (FP32 Accumulate) Sparse
FP8
-
351.52 TFLOPS Tensor (FP16 Accumulate)
703.04 TFLOPS Tensor (FP16 Accumulate) Sparse
351.52 TFLOPS Tensor (FP32 Accumulate)
703.04 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
95.68 TFLOPS
95.68 TFLOPS Tensor (FP16 Accumulate)
191.37 TFLOPS Tensor (FP16 Accumulate) Sparse
95.68 TFLOPS Tensor (FP32 Accumulate)
191.37 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
43.94 TFLOPS
175.76 TFLOPS Tensor (FP16 Accumulate)
351.52 TFLOPS Tensor (FP16 Accumulate) Sparse
175.76 TFLOPS Tensor (FP32 Accumulate)
351.52 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
47.84 TFLOPS
-
-
FP32
43.94 TFLOPS
-
-
FP64
1.5 TFLOPS
-
FP64
690 GFLOPS
-
BF16
95.68 TFLOPS
95.68 TFLOPS Tensor
191.37 TFLOPS Tensor Sparse
BF16
43.94 TFLOPS
175.76 TFLOPS Tensor
351.52 TFLOPS Tensor Sparse
-
-
-
TF32
87.88 TFLOPS Tensor
175.76 TFLOPS Tensor Sparse
INT4
765.46 TOPS Tensor
1.53 POPS Tensor Sparse
-
-
-
INT8
-
382.73 TOPS Tensor
765.46 TOPS Tensor Sparse
INT8
-
351.52 TOPS Tensor
703.04 TOPS Tensor Sparse
INT32
23.92 TOPS
INT32
21.97 TOPS
-
-
Ray Tracing
133.3 TOPS
Pixel Fillrate
373.76 GPixel/s
Pixel Fillrate
235.392 GPixel/s
-
-
-
-
Texture Fillrate
747.52 GTexel/s
Texture Fillrate
686.56 GTexel/s
L0
64KB/WGP
-
-
L1
-
-
-
256KB/Array
L1
64KB/SM Tex
128KB/SM
-
-
L2
8MB Shared
L2
48MB Shared
L3
64MB Shared
2.25TB/s
-
-
-
32GB
GDDR6
-
24GB
GDDR7
ECC
Bus Width
256Bit
Bus Width
192Bit
Clock
2500MHz
Transfer Rate
20GT/s
Bandwidth
640GB/s
Clock
2210MHz
Transfer Rate
28GT/s
Bandwidth
672GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
300W
TDP
140W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 2.1
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
4x DisplayPort 2.1
-
-
-
-
-
-
-
-
-
-
-
-
Max Resolution
15360x8640
Max Resolution
7680x4320
Max Resolution Refresh Rate
165Hz
Max Resolution Refresh Rate
165Hz
Variable Refresh Rate
-
FreeSync
-
Variable Refresh Rate
G-Sync
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
3
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
VCN 4.0
Model
2x NVENC 9
Codec
-
-
-
-
-
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Model
VCN 4.0
Model
2x NVDEC 6
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Direct X
12
Direct 3D
12_2
Direct X
12
Direct 3D
12_3
OpenGL
4.6
OpenCL
2.2
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
Shader Model
6.8
-
-
GFX
12
-
-
-
-
Shader Model
6.8
CUDA
12.8
-
-
PureVideo HD
VP13
VDPAU
Feature Set M
-
-
-
1x Fan
-
-
-
1x Fan
Power Connectors
-
-
-
-
-
1x 12-Pin
-
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
Slots Required
2.0
PCIe Version
5.0
PCIe Lanes
16
Slots Required
1.0
PCIe Version
5.0
PCIe Lanes
16
Multi GPU Support
Supported
Type
CrossFire XDMA
Multi GPU Support
Supported
Type
NVLink
Height
111 mm (4.37 in)
Width
241 mm (9.49 in)
Depth
40 mm (1.57 in)
Height
111.8 mm (4.4 in)
Width
241.3 mm (9.5 in)
Depth
20 mm (0.79 in)
Change Comparison