NVIDIA GeForce RTX 4070 Super vs AMD Radeon RX 9060 XT 16GB

NVIDIA GeForce RTX 4070 Super $599
AMD Radeon RX 9060 XT 16GB $349
7168 Shaders
12GB GDDR6X
2475MHz
2048 Shaders
16GB GDDR6
3130MHz
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
Peak AI Performance
820.51 TOPS
INT4 Tensor Sparse
FP32
35.48 TFLOPS
FP32
25.64 TFLOPS
FP16
35.48 TFLOPS
FP16
51.28 TFLOPS
Form Factor
PCIe Card
2.1-Slots
Form Factor
PCIe Card
2.0-Slots
TDP
225W
TDP
160W
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
Power Connectors
-
1x 8-Pin
-
-
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
NVIDIA
Manufacturer
AMD
Chip Designer
NVIDIA
Chip Designer
AMD
Architecture
Ada Lovelace
Architecture
RDNA 4
Family
GeForce 40
Family
Radeon RX 9000
Codename
NV184
AD104
Variant
AD104-350-A1
Codename
-
Navi 44
Variant
Navi 44 XT
Market Segment
Desktop
Market Segment
Desktop
Release Date
1/8/2024
Release Date
5/20/2025
Foundry
TSMC
-
Foundry
TSMC
-
Fabrication Node
4N
-
Fabrication Node
N4P
-
Die Size
295 mm²
-
Die Size
153 mm²
-
Transistor Count
35.8 Billion
-
Transistor Count
29.7 Billion
-
Transistor Density
121.56M/mm²
-
Transistor Density
194.12M/mm²
-
Form
PCIe Card
Form
PCIe Card
Shading Units
7168 Shaders
-
Shading Units
2048 Shaders
-
Texture Mapping Units
224 TMUs
Texture Mapping Units
128 TMUs
Render Output Units
80 ROPs
Render Output Units
64 ROPs
Tensor Cores
224 T-Cores
Tensor Cores
64 T-Cores
Ray-Tracing Cores
56 RT-Cores
Ray-Tracing Cores
32 RT-Cores
Streaming Multiprocessors
56 SMs
-
-
-
-
Compute Units
32 CUs
-
-
-
-
Graphics Processing Clusters
5 GPCs
-
-
-
-
1980MHz Base
2475MHz
-
-
2530MHz Base
3130MHz
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
Peak AI Performance
820.51 TOPS
INT4 Tensor Sparse
-
-
-
-
-
-
FP8
-
283.85 TFLOPS Tensor (FP16 Accumulate)
567.71 TFLOPS Tensor (FP16 Accumulate) Sparse
141.93 TFLOPS Tensor (FP32 Accumulate)
283.85 TFLOPS Tensor (FP32 Accumulate) Sparse
FP8
102.56 TFLOPS
102.56 TFLOPS Tensor (FP16 Accumulate)
205.13 TFLOPS Tensor (FP16 Accumulate) Sparse
102.56 TFLOPS Tensor (FP32 Accumulate)
205.13 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
35.48 TFLOPS
141.93 TFLOPS Tensor (FP16 Accumulate)
283.85 TFLOPS Tensor (FP16 Accumulate) Sparse
70.96 TFLOPS Tensor (FP32 Accumulate)
141.93 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
51.28 TFLOPS
51.28 TFLOPS Tensor (FP16 Accumulate)
102.56 TFLOPS Tensor (FP16 Accumulate) Sparse
51.28 TFLOPS Tensor (FP32 Accumulate)
102.56 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
35.48 TFLOPS
-
-
FP32
25.64 TFLOPS
-
-
FP64
550 GFLOPS
-
FP64
800 GFLOPS
-
BF16
35.48 TFLOPS
70.96 TFLOPS Tensor
141.93 TFLOPS Tensor Sparse
BF16
51.28 TFLOPS
51.28 TFLOPS Tensor
102.56 TFLOPS Tensor Sparse
TF32
35.48 TFLOPS Tensor
70.96 TFLOPS Tensor Sparse
-
-
-
INT4
567.71 TOPS Tensor
1.14 POPS Tensor Sparse
INT4
410.26 TOPS Tensor
820.51 TOPS Tensor Sparse
INT8
-
283.85 TOPS Tensor
567.71 TOPS Tensor Sparse
INT8
-
205.13 TOPS Tensor
410.26 TOPS Tensor Sparse
INT32
17.74 TOPS
INT32
12.82 TOPS
Ray Tracing
82 TOPS
-
-
Pixel Fillrate
198 GPixel/s
Pixel Fillrate
200.32 GPixel/s
-
-
-
-
Texture Fillrate
554.4 GTexel/s
Texture Fillrate
400.64 GTexel/s
-
-
L0
64KB/WGP
L1
64KB/SM Tex
128KB/SM
-
-
L1
-
-
-
256KB/Array
L2
36MB Shared
L2
4MB Shared
-
-
-
L3
32MB Shared
1.13TB/s
12GB
GDDR6X
-
16GB
GDDR6
-
Bus Width
192Bit
Bus Width
128Bit
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
504GB/s
Clock
2500MHz
Transfer Rate
20GT/s
Bandwidth
320GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
225W
TDP
160W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
2x DisplayPort 2.1
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
Max Resolution
7680x4320
Max Resolution
15360x8640
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
165Hz
Variable Refresh Rate
G-Sync
FreeSync
-
Variable Refresh Rate
-
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
3
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
2x NVENC 8
Model
VCN 4.0
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Model
NVDEC 5
Model
VCN 4.0
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
2.2
Vulkan
1.3
Shader Model
6.7
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
Shader Model
6.8
-
-
GFX
12
-
-
-
-
-
-
-
2x Fans
-
-
-
3x Fans
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
Power Connectors
-
-
-
1x 8-Pin
-
-
-
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
Slots Required
2.0
PCIe Version
5.0
PCIe Lanes
8
-
-
-
-
Multi GPU Support
Supported
Type
CrossFire XDMA
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
42 mm (1.65 in)
Height
111 mm (4.37 in)
Width
204 mm (8.03 in)
Depth
40 mm (1.57 in)
Change Comparison