AMD Radeon AI PRO R9700 vs NVIDIA RTX A5000
AMD Radeon AI PRO R9700
NVIDIA RTX A5000
|
4096 Shaders
32GB GDDR6
2920MHz
|
8192 Shaders
24GB GDDR6
1695MHz
|
|
Peak AI Performance
1.53 POPS
INT4 Tensor Sparse
|
Peak AI Performance
888.67 TOPS
INT4 Tensor Sparse
|
|
FP32
47.84 TFLOPS
|
FP32
27.77 TFLOPS
|
|
FP16
95.68 TFLOPS
|
FP16
27.77 TFLOPS
|
|
Form Factor
PCIe Card
2.0-Slots
|
Form Factor
PCIe Card
2.0-Slots
|
|
TDP
300W
|
TDP
230W
|
|
Power Connectors
-
-
1x 12-Pin
-
|
Power Connectors
-
1x 8-Pin
-
-
|
|
GB6 OpenCL N/A
0%
|
|
|
GB6 Metal N/A
0%
|
GB6 Metal N/A
0%
|
|
GB6 Vulkan N/A
0%
|
|
GB5 OpenCL N/A
0%
|
GB5 OpenCL N/A
0%
|
|
GB5 CUDA N/A
0%
|
GB5 CUDA N/A
0%
|
|
GB5 Metal N/A
0%
|
GB5 Metal N/A
0%
|
|
GB5 Vulkan N/A
0%
|
GB5 Vulkan N/A
0%
|
|
OCT 2020.1 N/A
0%
|
OCT 2020.1 N/A
0%
|
|
OCT Metal N/A
0%
|
OCT Metal N/A
0%
|
|
Peak AI
Performance
1.53 POPS
INT4 Tensor Sparse
|
Peak AI
Performance
888.67 TOPS
INT4 Tensor Sparse
|
|
-
-
-
|
-
-
-
|
|
FP8
191.37 TFLOPS
191.37 TFLOPS
Tensor (FP16 Accumulate)
382.73 TFLOPS
Tensor (FP16 Accumulate) Sparse
191.37 TFLOPS
Tensor (FP32 Accumulate)
382.73 TFLOPS
Tensor (FP32 Accumulate) Sparse
|
-
-
-
-
-
-
|
|
FP16
95.68 TFLOPS
95.68 TFLOPS
Tensor (FP16 Accumulate)
191.37 TFLOPS
Tensor (FP16 Accumulate) Sparse
95.68 TFLOPS
Tensor (FP32 Accumulate)
191.37 TFLOPS
Tensor (FP32 Accumulate) Sparse
|
FP16
27.77 TFLOPS
111.08 TFLOPS
Tensor (FP16 Accumulate)
222.17 TFLOPS
Tensor (FP16 Accumulate) Sparse
111.08 TFLOPS
Tensor (FP32 Accumulate)
222.17 TFLOPS
Tensor (FP32 Accumulate) Sparse
|
|
FP32
47.84 TFLOPS
-
-
|
FP32
27.77 TFLOPS
-
-
|
|
FP64
1.5 TFLOPS
-
|
FP64
430 GFLOPS
-
|
|
BF16
95.68 TFLOPS
95.68 TFLOPS
Tensor
191.37 TFLOPS
Tensor Sparse
|
BF16
27.77 TFLOPS
111.08 TFLOPS
Tensor
222.17 TFLOPS
Tensor Sparse
|
|
-
-
-
|
TF32
55.54 TFLOPS
Tensor
111.08 TFLOPS
Tensor Sparse
|
|
INT4
765.46 TOPS
Tensor
1.53 POPS
Tensor Sparse
|
INT4
444.33 TOPS
Tensor
888.67 TOPS
Tensor Sparse
|
|
INT8
-
382.73 TOPS
Tensor
765.46 TOPS
Tensor Sparse
|
INT8
-
222.17 TOPS
Tensor
444.33 TOPS
Tensor Sparse
|
|
INT32
23.92 TOPS
|
INT32
13.89 TOPS
|
|
-
-
|
Ray Tracing
54.2 TOPS
|
|
Pixel
Fillrate
373.76 GPixel/s
|
Pixel
Fillrate
162.72 GPixel/s
|
|
-
-
|
-
-
|
|
Texture
Fillrate
747.52 GTexel/s
|
Texture
Fillrate
433.92 GTexel/s
|
|
Manufacturer
AMD
|
Manufacturer
NVIDIA
|
|
Chip Designer
AMD
|
Chip Designer
NVIDIA
|
|
Architecture
RDNA 4
|
Architecture
Ampere
|
|
Family
Radeon AI PRO
|
Family
RTX A
|
|
Codename
-
Navi 48
Variant
Navi 48 XT
|
Codename
NV172
GA102
Variant
GA102-850-A1
|
|
Market Segment
Workstation
|
Market Segment
Workstation
|
|
Release Date
5/20/2025
|
Release Date
4/12/2021
|
|
Foundry
TSMC
-
|
Foundry
Samsung
-
|
|
Fabrication Node
N4P
-
|
Fabrication Node
8N
-
|
|
Die Size
357 mm²
-
|
Die Size
628 mm²
-
|
|
Transistor Count
53.9 Billion
-
|
Transistor Count
28.3 Billion
-
|
|
Transistor Density
151.19M/mm²
-
|
Transistor Density
45.04M/mm²
-
|
|
Form
PCIe Card
|
Form
PCIe Card
|
|
Shading Units
4096 Shaders
-
|
Shading Units
8192 Shaders
-
|
|
Texture Mapping Units
256 TMUs
|
Texture Mapping Units
256 TMUs
|
|
Render Output Units
128 ROPs
|
Render Output Units
96 ROPs
|
|
Tensor Cores
128 T-Cores
|
Tensor Cores
256 T-Cores
|
|
Ray-Tracing Cores
64 RT-Cores
|
Ray-Tracing Cores
64 RT-Cores
|
|
-
-
|
Streaming Multiprocessors
64 SMs
|
|
Compute Units
64 CUs
|
-
-
|
|
-
-
|
-
-
|
|
-
-
|
-
-
|
|
-
-
2400MHz Base
2920MHz
|
-
-
1170MHz Base
1695MHz
|
|
L0
64KB/WGP
|
-
-
|
|
L1
-
-
-
256KB/Array
|
L1
64KB/SM Tex
128KB/SM
-
-
|
|
L2
8MB Shared
|
L2
6MB Shared
|
|
L3
64MB Shared
2.25TB/s
|
-
-
-
|
|
32GB
GDDR6
-
|
24GB
GDDR6
ECC
|
|
Bus Width
256Bit
|
Bus Width
384Bit
|
|
Clock
2500MHz
Transfer Rate
20GT/s
Bandwidth
640GB/s
|
Clock
2000MHz
Transfer Rate
16GT/s
Bandwidth
768GB/s
|
|
-
-
-
-
-
-
-
-
-
|
-
-
-
-
-
-
-
-
-
|
|
TDP
300W
|
TDP
230W
|
|
-
-
|
-
-
|
|
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 2.1
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
|
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
4x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
-
-
-
|
|
Max Resolution
15360x8640
|
Max Resolution
7680x4320
|
|
Max Resolution Refresh Rate
165Hz
|
Max Resolution Refresh Rate
60Hz
|
|
Variable Refresh Rate
-
FreeSync
-
|
Variable Refresh Rate
G-Sync
FreeSync
-
|
|
Display Stream Compression (DSC)
Supported
|
Display Stream Compression (DSC)
Supported
|
|
Multi Monitor Support
3
|
Multi Monitor Support
3
|
|
Content Protection
HDCP 2.3
|
Content Protection
HDCP 2.3
|
|
Model
VCN 4.0
|
Model
NVENC 7
|
|
Codec
-
-
-
-
-
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
|
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
|
|
Model
VCN 4.0
|
Model
NVDEC 5
|
|
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
|
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
|
|
Direct X
12
Direct 3D
12_2
|
Direct X
12
Direct 3D
12_2
|
|
OpenGL
4.6
OpenCL
2.2
Vulkan
1.3
|
OpenGL
4.6
OpenCL
3.0
Vulkan
1.2
|
|
Shader Model
6.8
-
-
GFX
12
-
-
-
-
|
Shader Model
6.6
CUDA
8.6
-
-
PureVideo HD
VP11
VDPAU
Feature Set K
|
|
-
-
-
1x Fan
|
-
-
-
1x Fan
|
|
Power Connectors
-
-
-
-
-
1x 12-Pin
-
|
Power Connectors
-
-
-
1x 8-Pin
-
-
-
|
|
Slots Required
2.0
PCIe Version
5.0
PCIe Lanes
16
|
Slots Required
2.0
PCIe Version
4.0
PCIe Lanes
16
|
|
Multi GPU Support
Supported
Type
CrossFire XDMA
|
Multi GPU Support
Supported
Type
2-way NVLink
|
|
Height
111 mm (4.37 in)
Width
241 mm (9.49 in)
Depth
40 mm (1.57 in)
|
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
40 mm (1.57 in)
|
Copy Link