NVIDIA GeForce RTX 4070 Super vs AMD Radeon RX 9070

NVIDIA GeForce RTX 4070 Super $599
AMD Radeon RX 9070 $549
7168 Shaders
12GB GDDR6X
2475MHz
3584 Shaders
16GB GDDR6
2520MHz
Peak Performance
1.14 POPS
INT4 Tensor Sparse
Peak Performance
1.16 POPS
INT4 Tensor Sparse
FP32
35.48 TFLOPS
FP32
36.13 TFLOPS
FP16
35.48 TFLOPS
FP16
72.25 TFLOPS
Form Factor
PCIe Card
2.1-Slots
Form Factor
PCIe Card
2.5-Slots
TDP
225W
TDP
220W
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
Power Connectors
-
2x 8-Pin
-
-
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
NVIDIA
Manufacturer
AMD
Chip Designer
NVIDIA
Chip Designer
AMD
Architecture
Ada Lovelace
Architecture
RDNA 4
Family
GeForce 40
Family
Radeon RX 9000
Codename
NV184
AD104
Variant
AD104-350-A1
Codename
-
Navi 48
Variant
Navi 48 XL
Market Segment
Desktop
Market Segment
Desktop
Release Date
1/8/2024
Release Date
1/6/2025
Foundry
TSMC
-
Foundry
TSMC
-
Fabrication Node
4N
-
Fabrication Node
N4P
-
Die Size
295 mm²
-
Die Size
357 mm²
-
Transistor Count
35.8 Billion
-
Transistor Count
53.9 Billion
-
Transistor Density
121.56M/mm²
-
Transistor Density
151.19M/mm²
-
Form
PCIe Card
Form
PCIe Card
Shading Units
7168 Shaders
-
Shading Units
3584 Shaders
-
Texture Mapping Units
224 TMUs
Texture Mapping Units
448 TMUs
Render Output Units
80 ROPs
Render Output Units
128 ROPs
Tensor Cores
224 T-Cores
Tensor Cores
112 T-Cores
Ray-Tracing Cores
56 RT-Cores
Ray-Tracing Cores
56 RT-Cores
Streaming Multiprocessors
56 SMs
-
-
-
-
Compute Units
56 CUs
-
-
-
-
Graphics Processing Clusters
5 GPCs
-
-
-
-
1980MHz Base
2475MHz
-
-
2210MHz Base
2520MHz
Peak Performance
1.14 POPS
INT4 Tensor Sparse
Peak Performance
1.16 POPS
INT4 Tensor Sparse
-
-
-
-
-
-
FP8
-
283.85 TFLOPS Tensor (FP16 Accumulate)
567.71 TFLOPS Tensor (FP16 Accumulate) Sparse
141.93 TFLOPS Tensor (FP32 Accumulate)
283.85 TFLOPS Tensor (FP32 Accumulate) Sparse
FP8
144.51 TFLOPS
144.51 TFLOPS Tensor (FP16 Accumulate)
289.01 TFLOPS Tensor (FP16 Accumulate) Sparse
144.51 TFLOPS Tensor (FP32 Accumulate)
289.01 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
35.48 TFLOPS
141.93 TFLOPS Tensor (FP16 Accumulate)
283.85 TFLOPS Tensor (FP16 Accumulate) Sparse
70.96 TFLOPS Tensor (FP32 Accumulate)
141.93 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
72.25 TFLOPS
72.25 TFLOPS Tensor (FP16 Accumulate)
144.51 TFLOPS Tensor (FP16 Accumulate) Sparse
72.25 TFLOPS Tensor (FP32 Accumulate)
144.51 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
35.48 TFLOPS
-
-
FP32
36.13 TFLOPS
-
-
FP64
550 GFLOPS
-
FP64
1.13 TFLOPS
-
BF16
35.48 TFLOPS
70.96 TFLOPS Tensor
141.93 TFLOPS Tensor Sparse
BF16
72.25 TFLOPS
72.25 TFLOPS Tensor
144.51 TFLOPS Tensor Sparse
TF32
35.48 TFLOPS Tensor
70.96 TFLOPS Tensor Sparse
-
-
-
INT4
567.71 TOPS Tensor
1.14 POPS Tensor Sparse
INT4
578.03 TOPS Tensor
1.16 POPS Tensor Sparse
INT8
-
283.85 TOPS Tensor
567.71 TOPS Tensor Sparse
INT8
-
289.01 TOPS Tensor
578.03 TOPS Tensor Sparse
INT32
17.74 TOPS
INT32
18.06 TOPS
Ray Tracing
82 TOPS
-
-
Pixel Fillrate
198 GPixel/s
Pixel Fillrate
322.56 GPixel/s
-
-
-
-
Texture Fillrate
554.4 GTexel/s
Texture Fillrate
1128.96 GTexel/s
-
-
L0
64KB/WGP
L1
64KB/SM Tex
128KB/SM
-
-
L1
-
-
-
256KB/Array
L2
36MB Shared
L2
6MB Shared
-
-
-
L3
64MB Shared
2.25TB/s
12GB
GDDR6X
-
16GB
GDDR6
-
Bus Width
192Bit
Bus Width
256Bit
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
504GB/s
Clock
2438MHz
Transfer Rate
19.5GT/s
Bandwidth
624GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
225W
TDP
220W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 2.1
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
Max Resolution
7680x4320
Max Resolution
15360x8640
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
165Hz
Variable Refresh Rate
G-Sync
FreeSync
-
Variable Refresh Rate
-
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
3
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
2x NVENC 8
Model
VCN 4.0
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Model
NVDEC 5
Model
VCN 4.0
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
2.2
Vulkan
1.3
Shader Model
6.7
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
Shader Model
6.8
-
-
GFX
12
-
-
-
-
-
-
-
2x Fans
-
-
-
3x Fans
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
Power Connectors
-
-
-
2x 8-Pin
-
-
-
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
Slots Required
2.5
PCIe Version
5.0
PCIe Lanes
16
-
-
-
-
Multi GPU Support
Supported
Type
CrossFire XDMA
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
42 mm (1.65 in)
Height
111 mm (4.37 in)
Width
267 mm (10.51 in)
Depth
50 mm (1.97 in)
Change Comparison