NVIDIA GeForce RTX 4070 Super vs Radeon Pro 580X MPX

NVIDIA GeForce RTX 4070 Super $599
Radeon Pro 580X MPX
7168 Shaders
12GB GDDR6X
2475MHz
2304 Shaders
8GB GDDR5
1220MHz
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
Peak AI Performance
11.24 TFLOPS
FP16
FP32
35.48 TFLOPS
FP32
5.62 TFLOPS
FP16
35.48 TFLOPS
FP16
11.24 TFLOPS
Form Factor
PCIe Card
2.1-Slots
Form Factor
MPX
2.0-Slots
TDP
225W
TDP
185W
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
-
-
-
-
-
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL 41,950
14%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal 38,490
21%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
NVIDIA
Manufacturer
Apple
Chip Designer
NVIDIA
Chip Designer
AMD
Architecture
Ada Lovelace
Architecture
GCN 4
Family
GeForce 40
Family
Radeon Pro
Codename
NV184
AD104
Variant
AD104-350-A1
Codename
Ellesmere
Polaris 20
Variant
Polaris 20 XTA
Market Segment
Desktop
Market Segment
Workstation
Release Date
1/8/2024
Release Date
12/11/2019
Foundry
TSMC
-
Foundry
GlobalFoundries
-
Fabrication Node
4N
-
Fabrication Node
14LPP
-
Die Size
295 mm²
-
Die Size
232 mm²
-
Transistor Count
35.8 Billion
-
Transistor Count
5.7 Billion
-
Transistor Density
121.56M/mm²
-
Transistor Density
24.57M/mm²
-
Form
PCIe Card
Form
MPX
Shading Units
7168 Shaders
-
Shading Units
2304 Shaders
-
Texture Mapping Units
224 TMUs
Texture Mapping Units
144 TMUs
Render Output Units
80 ROPs
Render Output Units
32 ROPs
Tensor Cores
224 T-Cores
-
-
Ray-Tracing Cores
56 RT-Cores
-
-
Streaming Multiprocessors
56 SMs
-
-
-
-
Compute Units
36 CUs
-
-
-
-
Graphics Processing Clusters
5 GPCs
Graphics Processing Clusters
4 GPCs
-
-
1980MHz Base
2475MHz
-
-
1100MHz Base
1220MHz
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
Peak AI Performance
11.24 TFLOPS
FP16
-
-
-
-
-
-
FP8
-
283.85 TFLOPS Tensor (FP16 Accumulate)
567.71 TFLOPS Tensor (FP16 Accumulate) Sparse
141.93 TFLOPS Tensor (FP32 Accumulate)
283.85 TFLOPS Tensor (FP32 Accumulate) Sparse
-
-
-
-
-
-
FP16
35.48 TFLOPS
141.93 TFLOPS Tensor (FP16 Accumulate)
283.85 TFLOPS Tensor (FP16 Accumulate) Sparse
70.96 TFLOPS Tensor (FP32 Accumulate)
141.93 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
11.24 TFLOPS
-
-
-
-
FP32
35.48 TFLOPS
-
-
FP32
5.62 TFLOPS
-
-
FP64
550 GFLOPS
-
FP64
350 GFLOPS
-
BF16
35.48 TFLOPS
70.96 TFLOPS Tensor
141.93 TFLOPS Tensor Sparse
-
-
-
-
TF32
35.48 TFLOPS Tensor
70.96 TFLOPS Tensor Sparse
-
-
-
INT4
567.71 TOPS Tensor
1.14 POPS Tensor Sparse
-
-
-
INT8
-
283.85 TOPS Tensor
567.71 TOPS Tensor Sparse
-
-
-
-
INT32
17.74 TOPS
-
-
Ray Tracing
82 TOPS
-
-
Pixel Fillrate
198 GPixel/s
Pixel Fillrate
39.04 GPixel/s
-
-
-
-
Texture Fillrate
554.4 GTexel/s
Texture Fillrate
175.68 GTexel/s
-
-
-
-
L1
64KB/SM Tex
128KB/SM
-
-
L1
-
-
16KB/CU
-
L2
36MB Shared
L2
2MB Shared
-
-
-
-
-
-
12GB
GDDR6X
-
8GB
GDDR5
-
Bus Width
192Bit
Bus Width
256Bit
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
504GB/s
Clock
1710MHz
Transfer Rate
6.8GT/s
Bandwidth
218.9GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
225W
TDP
185W
-
-
Temp
100°C Max
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
2x HDMI 2.0
-
-
-
Max Resolution
7680x4320
Max Resolution
6016x3384
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
-
Variable Refresh Rate
-
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Not Supported
Multi Monitor Support
4
Multi Monitor Support
6
Content Protection
HDCP 2.3
Content Protection
HDCP 2.2
Model
2x NVENC 8
Model
VCE 3.4
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
Model
NVDEC 5
Model
UVD 6.3
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_0
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
2.1
Vulkan
1.2
Shader Model
6.7
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
Shader Model
6.4
-
-
GFX
8
-
-
-
-
-
-
-
2x Fans
-
-
-
-
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
-
-
-
-
-
-
-
-
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
Slots Required
2.0
PCIe Version
3.0
PCIe Lanes
16
-
-
-
-
Multi GPU Support
Supported
Type
Bridgeless
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
42 mm (1.65 in)
-
-
-
-
-
-
Change Comparison