NVIDIA GeForce RTX 4080 Super vs AMD Radeon RX 7900 XTX

NVIDIA GeForce RTX 4080 Super $999
AMD Radeon RX 7900 XTX $999
10240 Shaders
16GB GDDR6X
2550MHz
6144 Shaders
24GB GDDR6
2500MHz
Peak Performance
1.67 POPS
INT4 Tensor Sparse
Peak Performance
245.76 TOPS
INT4 Tensor
FP32
52.22 TFLOPS
FP32
61.44 TFLOPS
FP16
52.22 TFLOPS
FP16
122.88 TFLOPS
Form Factor
PCIe Card
3.0-Slots
Form Factor
PCIe Card
2.5-Slots
TDP
320W
TDP
355W
Power Connectors
-
1x 16-Pin 12VHPWR
Power Connectors
2x 8-Pin
-

Peak Performance

  • 6.8x faster vs Radeon RX 7900 XTX
  • 85% slower vs GeForce RTX 4080 Super
GeForce RTX 4080 Super - 1.67 POPS INT4 Tensor Sparse
x6.8
Radeon RX 7900 XTX - 245.76 TOPS INT4 Tensor
x1

FP32

  • 15% slower vs Radeon RX 7900 XTX
  • 18% faster vs GeForce RTX 4080 Super
GeForce RTX 4080 Super - 52.22 TFLOPS FP32
x1
Radeon RX 7900 XTX - 61.44 TFLOPS FP32
x1.18

FP16

  • 58% slower vs Radeon RX 7900 XTX
  • 2.35x faster vs GeForce RTX 4080 Super
GeForce RTX 4080 Super - 52.22 TFLOPS FP16
x1
Radeon RX 7900 XTX - 122.88 TFLOPS FP16
x2.35
  • 25% slower vs Radeon RX 7900 XTX
  • 34% faster vs GeForce RTX 4080 Super
GeForce RTX 4080 Super - 716.8GB/s
x1
Radeon RX 7900 XTX - 960GB/s
x1.34
  • 10% lower vs Radeon RX 7900 XTX
  • 11% higher vs GeForce RTX 4080 Super
GeForce RTX 4080 Super - 320W
x1
Radeon RX 7900 XTX - 355W
x1.11
  • 12% faster vs Radeon RX 7900 XTX
  • 11% slower vs GeForce RTX 4080 Super
GeForce RTX 4080 Super - GB6 OpenCL 247,660
x1.12
Radeon RX 7900 XTX - GB6 OpenCL 220,795
x1
Manufacturer
NVIDIA
Manufacturer
AMD
Chip Designer
NVIDIA
Chip Designer
AMD
Architecture
Ada Lovelace
Architecture
RDNA 3
Family
GeForce 40
Family
Radeon RX 7000
Codename
NV183
AD103
Variant
AD103-400-A1
Codename
Plum Bonito
Navi 31
Variant
Navi 31 XTX
Market Segment
Desktop
Market Segment
Desktop
Release Date
1/8/2024
Release Date
12/13/2022
Foundry
TSMC
-
Foundry
TSMC
TSMC Memory Cache Die
Fabrication Node
4N
-
Fabrication Node
N5
N6 Memory Cache Die
Die Size
379 mm²
-
Die Size
304 mm²
6x 38 mm² Memory Cache Die
Transistor Count
45.9 Billion
-
Transistor Count
45.4 Billion
6x 2.1 Billion Memory Cache Die
Transistor Density
121.24M/mm²
-
Transistor Density
149.17M/mm²
54.64M/mm² Memory Cache Die
Form
PCIe Card
Form
PCIe Card
Shading Units
10240 Shaders
Shading Units
6144 Shaders
Texture Mapping Units
320 TMUs
Texture Mapping Units
384 TMUs
Render Output Units
112 ROPs
Render Output Units
192 ROPs
Tensor Cores
320 T-Cores
Tensor Cores
192 T-Cores
Ray-Tracing Cores
80 RT-Cores
Ray-Tracing Cores
96 RT-Cores
Streaming Multiprocessors
80 SMs
-
-
-
-
Compute Units
96 CUs
Graphics Processing Clusters
7 GPCs
-
-
2295MHz Base
2550MHz
1900MHz Base
2500MHz
Peak Performance
1.67 POPS
INT4 Tensor Sparse
Peak Performance
245.76 TOPS
INT4 Tensor
FP8
417.79 TFLOPS Tensor (FP16 Accumulate)
835.58 TFLOPS Tensor (FP16 Accumulate) Sparse
208.9 TFLOPS Tensor (FP32 Accumulate)
417.79 TFLOPS Tensor (FP32 Accumulate) Sparse
-
-
-
-
-
FP16
52.22 TFLOPS
208.9 TFLOPS Tensor (FP16 Accumulate)
417.79 TFLOPS Tensor (FP16 Accumulate) Sparse
104.45 TFLOPS Tensor (FP32 Accumulate)
208.9 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
122.88 TFLOPS
61.44 TFLOPS Tensor (FP16 Accumulate)
-
61.44 TFLOPS Tensor (FP32 Accumulate)
-
FP32
52.22 TFLOPS
FP32
61.44 TFLOPS
FP64
820 GFLOPS
FP64
1.92 TFLOPS
BF16
52.22 TFLOPS
104.45 TFLOPS Tensor
208.9 TFLOPS Tensor Sparse
BF16
122.88 TFLOPS
61.44 TFLOPS Tensor
-
TF32
52.22 TFLOPS Tensor
104.45 TFLOPS Tensor Sparse
-
-
-
INT4
835.58 TOPS Tensor
1.67 POPS Tensor Sparse
INT4
245.76 TOPS Tensor
-
INT8
417.79 TOPS Tensor
835.58 TOPS Tensor Sparse
INT8
61.44 TOPS Tensor
-
INT32
26.11 TOPS
INT32
30.72 TOPS
Ray Tracing
120.7 TOPS
-
-
Pixel Fillrate
285.6 GPixel/s
Pixel Fillrate
480 GPixel/s
Texture Fillrate
816 GTexel/s
Texture Fillrate
960 GTexel/s
-
-
L0
64KB/WGP
L1
64KB/SM Tex
128KB/SM
-
L1
-
-
256KB/Array
L2
64MB Shared
L2
6MB Shared
-
-
-
L3
96MB Shared
3.5TB/s
16GB
GDDR6X
24GB
GDDR6
Bus Width
256Bit
Bus Width
384Bit
Clock
1400MHz
Transfer Rate
22.4GT/s
Bandwidth
716.8GB/s
Clock
2500MHz
Transfer Rate
20GT/s
Bandwidth
960GB/s
TDP
320W
TDP
355W
Temp
90°C Max
-
-
3x DisplayPort 1.4
-
1x HDMI 2.1
-
-
2x DisplayPort 2.1
1x HDMI 2.1
1x USB-C + DP
Max Resolution
7680x4320
Max Resolution
15360x8640
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
165Hz
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
-
FreeSync
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
3
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
2x NVENC 8
Model
VCN 4.0
Codec
-
AVC (H.264)
HEVC (H.265)
AV1
Codec
VP9
AVC (H.264)
HEVC (H.265)
AV1
Model
NVDEC 5
Model
VCN 4.0
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
AVC (H.264)
HEVC (H.265)
AV1
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
2.2
Vulkan
1.3
Shader Model
6.6
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
Shader Model
6.7
-
-
GFX
11
-
-
-
-
2x Fans
3x Fans
Power Connectors
-
1x 16-Pin 12VHPWR
Power Connectors
2x 8-Pin
-
Slots Required
3.0
PCIe Version
4.0
PCIe Lanes
16
Slots Required
2.5
PCIe Version
4.0
PCIe Lanes
16
-
-
-
-
Multi GPU Support
Supported
Type
CrossFire XDMA
Height
137 mm (5.39 in)
Width
304 mm (11.97 in)
Depth
61 mm (2.4 in)
Height
135 mm (5.31 in)
Width
287 mm (11.3 in)
Depth
51 mm (2.01 in)
Change Comparison