AMD Radeon RX 7650 GRE vs NVIDIA GeForce RTX 4070 Super

AMD Radeon RX 7650 GRE $279
NVIDIA GeForce RTX 4070 Super $599
2048 Shaders
8GB GDDR6
2695MHz
7168 Shaders
12GB GDDR6X
2475MHz
Peak AI Performance
88.31 TOPS
INT4 Tensor
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
FP32
22.08 TFLOPS
FP32
35.48 TFLOPS
FP16
44.15 TFLOPS
FP16
35.48 TFLOPS
Form Factor
PCIe Card
2.0-Slots
Form Factor
PCIe Card
2.1-Slots
TDP
165W
TDP
225W
Power Connectors
-
1x 8-Pin
-
-
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
AMD
Manufacturer
NVIDIA
Chip Designer
AMD
Chip Designer
NVIDIA
Architecture
RDNA 3
Architecture
Ada Lovelace
Family
Radeon RX 7000
Family
GeForce 40
Codename
Hotpink Bonefish
Navi 33
Variant
Navi 33 XL
Codename
NV184
AD104
Variant
AD104-350-A1
Market Segment
Desktop
Market Segment
Desktop
Release Date
2/26/2025
Release Date
1/8/2024
Foundry
TSMC
-
Foundry
TSMC
-
Fabrication Node
N6
-
Fabrication Node
4N
-
Die Size
204 mm²
-
Die Size
295 mm²
-
Transistor Count
13.3 Billion
-
Transistor Count
35.8 Billion
-
Transistor Density
65.20M/mm²
-
Transistor Density
121.56M/mm²
-
Form
PCIe Card
Form
PCIe Card
Shading Units
2048 Shaders
-
Shading Units
7168 Shaders
-
Texture Mapping Units
128 TMUs
Texture Mapping Units
224 TMUs
Render Output Units
64 ROPs
Render Output Units
80 ROPs
Tensor Cores
64 T-Cores
Tensor Cores
224 T-Cores
Ray-Tracing Cores
32 RT-Cores
Ray-Tracing Cores
56 RT-Cores
-
-
Streaming Multiprocessors
56 SMs
Compute Units
32 CUs
-
-
-
-
-
-
-
-
Graphics Processing Clusters
5 GPCs
-
-
1720MHz Base
2695MHz
-
-
1980MHz Base
2475MHz
Peak AI Performance
88.31 TOPS
INT4 Tensor
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
-
-
-
-
-
-
-
-
-
-
-
-
FP8
-
283.85 TFLOPS Tensor (FP16 Accumulate)
567.71 TFLOPS Tensor (FP16 Accumulate) Sparse
141.93 TFLOPS Tensor (FP32 Accumulate)
283.85 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
44.15 TFLOPS
22.08 TFLOPS Tensor (FP16 Accumulate)
-
22.08 TFLOPS Tensor (FP32 Accumulate)
-
FP16
35.48 TFLOPS
141.93 TFLOPS Tensor (FP16 Accumulate)
283.85 TFLOPS Tensor (FP16 Accumulate) Sparse
70.96 TFLOPS Tensor (FP32 Accumulate)
141.93 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
22.08 TFLOPS
-
-
FP32
35.48 TFLOPS
-
-
FP64
690 GFLOPS
-
FP64
550 GFLOPS
-
BF16
44.15 TFLOPS
22.08 TFLOPS Tensor
-
BF16
35.48 TFLOPS
70.96 TFLOPS Tensor
141.93 TFLOPS Tensor Sparse
-
-
-
TF32
35.48 TFLOPS Tensor
70.96 TFLOPS Tensor Sparse
INT4
88.31 TOPS Tensor
-
INT4
567.71 TOPS Tensor
1.14 POPS Tensor Sparse
INT8
-
22.08 TOPS Tensor
-
INT8
-
283.85 TOPS Tensor
567.71 TOPS Tensor Sparse
INT32
11.04 TOPS
INT32
17.74 TOPS
-
-
Ray Tracing
82 TOPS
Pixel Fillrate
172.48 GPixel/s
Pixel Fillrate
198 GPixel/s
-
-
-
-
Texture Fillrate
344.96 GTexel/s
Texture Fillrate
554.4 GTexel/s
L0
64KB/WGP
-
-
L1
-
-
-
256KB/Array
L1
64KB/SM Tex
128KB/SM
-
-
L2
2MB Shared
L2
36MB Shared
L3
32MB Shared
1.16TB/s
-
-
-
8GB
GDDR6
-
12GB
GDDR6X
-
Bus Width
128Bit
Bus Width
192Bit
Clock
2250MHz
Transfer Rate
18GT/s
Bandwidth
288GB/s
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
504GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
165W
TDP
225W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 2.1
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
Max Resolution
15360x8640
Max Resolution
7680x4320
Max Resolution Refresh Rate
165Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
-
FreeSync
-
Variable Refresh Rate
G-Sync
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
3
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
VCN 4.0
Model
2x NVENC 8
Codec
-
-
-
-
-
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Model
VCN 4.0
Model
NVDEC 5
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Direct X
12
Direct 3D
12_2
Direct X
12
Direct 3D
12_3
OpenGL
4.6
OpenCL
2.2
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
Shader Model
6.7
-
-
GFX
11
-
-
-
-
Shader Model
6.7
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
-
-
-
2x Fans
-
-
-
2x Fans
Power Connectors
-
-
-
1x 8-Pin
-
-
-
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
Slots Required
2.0
PCIe Version
4.0
PCIe Lanes
8
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
Multi GPU Support
Supported
Type
CrossFire XDMA
-
-
-
-
Height
115 mm (4.53 in)
Width
204 mm (8.03 in)
Depth
40 mm (1.57 in)
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
42 mm (1.65 in)
Change Comparison