AMD Radeon RX 9070 XT vs NVIDIA GeForce RTX 4070 Super

AMD Radeon RX 9070 XT $599
NVIDIA GeForce RTX 4070 Super $599
4096 Shaders
16GB GDDR6
2970MHz
7168 Shaders
12GB GDDR6X
2475MHz
Peak AI Performance
1.56 POPS
INT4 Tensor Sparse
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
FP32
48.66 TFLOPS
FP32
35.48 TFLOPS
FP16
97.32 TFLOPS
FP16
35.48 TFLOPS
Form Factor
PCIe Card
2.5-Slots
Form Factor
PCIe Card
2.1-Slots
TDP
304W
TDP
225W
Power Connectors
-
2x 8-Pin
-
-
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
AMD
Manufacturer
NVIDIA
Chip Designer
AMD
Chip Designer
NVIDIA
Architecture
RDNA 4
Architecture
Ada Lovelace
Family
Radeon RX 9000
Family
GeForce 40
Codename
-
Navi 48
Variant
Navi 48 XT
Codename
NV184
AD104
Variant
AD104-350-A1
Market Segment
Desktop
Market Segment
Desktop
Release Date
1/6/2025
Release Date
1/8/2024
Foundry
TSMC
-
Foundry
TSMC
-
Fabrication Node
N4P
-
Fabrication Node
4N
-
Die Size
357 mm²
-
Die Size
295 mm²
-
Transistor Count
53.9 Billion
-
Transistor Count
35.8 Billion
-
Transistor Density
151.19M/mm²
-
Transistor Density
121.56M/mm²
-
Form
PCIe Card
Form
PCIe Card
Shading Units
4096 Shaders
-
Shading Units
7168 Shaders
-
Texture Mapping Units
512 TMUs
Texture Mapping Units
224 TMUs
Render Output Units
128 ROPs
Render Output Units
80 ROPs
Tensor Cores
128 T-Cores
Tensor Cores
224 T-Cores
Ray-Tracing Cores
64 RT-Cores
Ray-Tracing Cores
56 RT-Cores
-
-
Streaming Multiprocessors
56 SMs
Compute Units
64 CUs
-
-
-
-
-
-
-
-
Graphics Processing Clusters
5 GPCs
-
-
2400MHz Base
2970MHz
-
-
1980MHz Base
2475MHz
Peak AI Performance
1.56 POPS
INT4 Tensor Sparse
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
-
-
-
-
-
-
FP8
194.64 TFLOPS
194.64 TFLOPS Tensor (FP16 Accumulate)
389.28 TFLOPS Tensor (FP16 Accumulate) Sparse
194.64 TFLOPS Tensor (FP32 Accumulate)
389.28 TFLOPS Tensor (FP32 Accumulate) Sparse
FP8
-
283.85 TFLOPS Tensor (FP16 Accumulate)
567.71 TFLOPS Tensor (FP16 Accumulate) Sparse
141.93 TFLOPS Tensor (FP32 Accumulate)
283.85 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
97.32 TFLOPS
97.32 TFLOPS Tensor (FP16 Accumulate)
194.64 TFLOPS Tensor (FP16 Accumulate) Sparse
97.32 TFLOPS Tensor (FP32 Accumulate)
194.64 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
35.48 TFLOPS
141.93 TFLOPS Tensor (FP16 Accumulate)
283.85 TFLOPS Tensor (FP16 Accumulate) Sparse
70.96 TFLOPS Tensor (FP32 Accumulate)
141.93 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
48.66 TFLOPS
-
-
FP32
35.48 TFLOPS
-
-
FP64
1.52 TFLOPS
-
FP64
550 GFLOPS
-
BF16
97.32 TFLOPS
97.32 TFLOPS Tensor
194.64 TFLOPS Tensor Sparse
BF16
35.48 TFLOPS
70.96 TFLOPS Tensor
141.93 TFLOPS Tensor Sparse
-
-
-
TF32
35.48 TFLOPS Tensor
70.96 TFLOPS Tensor Sparse
INT4
778.57 TOPS Tensor
1.56 POPS Tensor Sparse
INT4
567.71 TOPS Tensor
1.14 POPS Tensor Sparse
INT8
-
389.28 TOPS Tensor
778.57 TOPS Tensor Sparse
INT8
-
283.85 TOPS Tensor
567.71 TOPS Tensor Sparse
INT32
24.33 TOPS
INT32
17.74 TOPS
-
-
Ray Tracing
82 TOPS
Pixel Fillrate
380.16 GPixel/s
Pixel Fillrate
198 GPixel/s
-
-
-
-
Texture Fillrate
1520.64 GTexel/s
Texture Fillrate
554.4 GTexel/s
L0
64KB/WGP
-
-
L1
-
-
-
256KB/Array
L1
64KB/SM Tex
128KB/SM
-
-
L2
6MB Shared
L2
36MB Shared
L3
64MB Shared
2.25TB/s
-
-
-
16GB
GDDR6
-
12GB
GDDR6X
-
Bus Width
256Bit
Bus Width
192Bit
Clock
2438MHz
Transfer Rate
19.5GT/s
Bandwidth
624GB/s
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
504GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
304W
TDP
225W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 2.1
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
Max Resolution
15360x8640
Max Resolution
7680x4320
Max Resolution Refresh Rate
165Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
-
FreeSync
-
Variable Refresh Rate
G-Sync
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
3
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
VCN 4.0
Model
2x NVENC 8
Codec
-
-
-
-
-
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Model
VCN 4.0
Model
NVDEC 5
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Direct X
12
Direct 3D
12_2
Direct X
12
Direct 3D
12_3
OpenGL
4.6
OpenCL
2.2
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
Shader Model
6.8
-
-
GFX
12
-
-
-
-
Shader Model
6.7
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
-
-
-
3x Fans
-
-
-
2x Fans
Power Connectors
-
-
-
2x 8-Pin
-
-
-
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
Slots Required
2.5
PCIe Version
5.0
PCIe Lanes
16
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
Multi GPU Support
Supported
Type
CrossFire XDMA
-
-
-
-
Height
111 mm (4.37 in)
Width
267 mm (10.51 in)
Depth
50 mm (1.97 in)
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
42 mm (1.65 in)
Change Comparison