NVIDIA GeForce RTX 5090 vs NVIDIA GeForce RTX 4080 Super

NVIDIA GeForce RTX 5090 $1,999
NVIDIA GeForce RTX 4080 Super $999
21760 Shaders
32GB GDDR7
2407MHz
10240 Shaders
16GB GDDR6X
2550MHz
Peak AI Performance
3.35 PFLOPS
FP4 Tensor Sparse
Peak AI Performance
1.67 POPS
INT4 Tensor Sparse
FP32
104.75 TFLOPS
FP32
52.22 TFLOPS
FP16
104.75 TFLOPS
FP16
52.22 TFLOPS
Form Factor
PCIe Card
2.1-Slots
Form Factor
PCIe Card
3.0-Slots
TDP
575W
TDP
320W
Power Connectors
1x 16-Pin 12VHPWR
Power Connectors
1x 16-Pin 12VHPWR

Peak AI Performance

  • 2.01x faster vs GeForce RTX 4080 Super
  • 50% slower vs GeForce RTX 5090
GeForce RTX 5090 - 3.35 PFLOPS FP4 Tensor Sparse
x2.01
GeForce RTX 4080 Super - 1.67 POPS INT4 Tensor Sparse
x1

FP32

  • 2.01x faster vs GeForce RTX 4080 Super
  • 50% slower vs GeForce RTX 5090
GeForce RTX 5090 - 104.75 TFLOPS FP32
x2.01
GeForce RTX 4080 Super - 52.22 TFLOPS FP32
x1

FP16

  • 2.01x faster vs GeForce RTX 4080 Super
  • 50% slower vs GeForce RTX 5090
GeForce RTX 5090 - 104.75 TFLOPS FP16
x2.01
GeForce RTX 4080 Super - 52.22 TFLOPS FP16
x1
  • 2.5x faster vs GeForce RTX 4080 Super
  • 60% slower vs GeForce RTX 5090
GeForce RTX 5090 - 1792GB/s
x2.5
GeForce RTX 4080 Super - 716.8GB/s
x1
  • 80% higher vs GeForce RTX 4080 Super
  • 44% lower vs GeForce RTX 5090
GeForce RTX 5090 - 575W
x1.8
GeForce RTX 4080 Super - 320W
x1
  • 56% faster vs GeForce RTX 4080 Super
  • 36% slower vs GeForce RTX 5090
GeForce RTX 5090 - GB6 OpenCL 385,770
x1.56
GeForce RTX 4080 Super - GB6 OpenCL 247,660
x1
Manufacturer
NVIDIA
Manufacturer
NVIDIA
Chip Designer
NVIDIA
Chip Designer
NVIDIA
Architecture
Blackwell
Architecture
Ada Lovelace
Family
GeForce 50
Family
GeForce 40
Codename
NV192
GB202
Variant
GB202-300-A1
Codename
NV183
AD103
Variant
AD103-400-A1
Market Segment
Desktop
Market Segment
Desktop
Release Date
1/6/2025
Release Date
1/8/2024
Foundry
TSMC
Foundry
TSMC
Fabrication Node
4NP
Fabrication Node
4N
Die Size
762 mm²
Die Size
379 mm²
Transistor Count
92.2 Billion
Transistor Count
45.9 Billion
Transistor Density
121.07M/mm²
Transistor Density
121.24M/mm²
Form
PCIe Card
Form
PCIe Card
Shading Units
21760 Shaders
Shading Units
10240 Shaders
Texture Mapping Units
680 TMUs
Texture Mapping Units
320 TMUs
Render Output Units
176 ROPs
Render Output Units
112 ROPs
Tensor Cores
680 T-Cores
Tensor Cores
320 T-Cores
Ray-Tracing Cores
170 RT-Cores
Ray-Tracing Cores
80 RT-Cores
Streaming Multiprocessors
170 SMs
Streaming Multiprocessors
80 SMs
Graphics Processing Clusters
11 GPCs
Graphics Processing Clusters
7 GPCs
2010MHz Base
2407MHz
2295MHz Base
2550MHz
Peak AI Performance
3.35 PFLOPS
FP4 Tensor Sparse
Peak AI Performance
1.67 POPS
INT4 Tensor Sparse
FP4
1.68 PFLOPS Tensor
3.35 PFLOPS Tensor Sparse
-
-
-
FP8
838.02 TFLOPS Tensor (FP16 Accumulate)
1.68 PFLOPS Tensor (FP16 Accumulate) Sparse
419.01 TFLOPS Tensor (FP32 Accumulate)
838.02 TFLOPS Tensor (FP32 Accumulate) Sparse
FP8
417.79 TFLOPS Tensor (FP16 Accumulate)
835.58 TFLOPS Tensor (FP16 Accumulate) Sparse
208.9 TFLOPS Tensor (FP32 Accumulate)
417.79 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
104.75 TFLOPS
419.01 TFLOPS Tensor (FP16 Accumulate)
838.02 TFLOPS Tensor (FP16 Accumulate) Sparse
209.51 TFLOPS Tensor (FP32 Accumulate)
419.01 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
52.22 TFLOPS
208.9 TFLOPS Tensor (FP16 Accumulate)
417.79 TFLOPS Tensor (FP16 Accumulate) Sparse
104.45 TFLOPS Tensor (FP32 Accumulate)
208.9 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
104.75 TFLOPS
FP32
52.22 TFLOPS
FP64
1.64 TFLOPS
FP64
820 GFLOPS
BF16
104.75 TFLOPS
209.51 TFLOPS Tensor
419.01 TFLOPS Tensor Sparse
BF16
52.22 TFLOPS
104.45 TFLOPS Tensor
208.9 TFLOPS Tensor Sparse
TF32
104.75 TFLOPS Tensor
209.51 TFLOPS Tensor Sparse
TF32
52.22 TFLOPS Tensor
104.45 TFLOPS Tensor Sparse
-
-
-
INT4
835.58 TOPS Tensor
1.67 POPS Tensor Sparse
INT8
838.02 TOPS Tensor
1.68 POPS Tensor Sparse
INT8
417.79 TOPS Tensor
835.58 TOPS Tensor Sparse
INT32
104.75 TOPS
INT32
26.11 TOPS
Ray Tracing
317.7 TOPS
Ray Tracing
120.7 TOPS
Pixel Fillrate
423.632 GPixel/s
Pixel Fillrate
285.6 GPixel/s
Texture Fillrate
1636.76 GTexel/s
Texture Fillrate
816 GTexel/s
L1
64KB/SM Tex
128KB/SM
L1
64KB/SM Tex
128KB/SM
L2
96MB Shared
L2
64MB Shared
32GB
GDDR7
16GB
GDDR6X
Bus Width
512Bit
Bus Width
256Bit
Clock
2210MHz
Transfer Rate
28GT/s
Bandwidth
1792GB/s
Clock
1400MHz
Transfer Rate
22.4GT/s
Bandwidth
716.8GB/s
TDP
575W
TDP
320W
Temp
90°C Max
Temp
90°C Max
-
3x DisplayPort 2.1
1x HDMI 2.1
3x DisplayPort 1.4
-
1x HDMI 2.1
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
165Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
G-Sync
FreeSync
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
3x NVENC 9
Model
2x NVENC 8
Codec
AVC (H.264)
HEVC (H.265)
AV1
Codec
AVC (H.264)
HEVC (H.265)
AV1
Model
2x NVDEC 6
Model
NVDEC 5
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
Shader Model
6.8
CUDA
12.8
PureVideo HD
VP13
VDPAU
Feature Set M
Shader Model
6.6
CUDA
8.9
PureVideo HD
VP12
VDPAU
Feature Set L
2x Fans
2x Fans
Power Connectors
1x 16-Pin 12VHPWR
Power Connectors
1x 16-Pin 12VHPWR
Slots Required
2.1
PCIe Version
5.0
PCIe Lanes
16
Slots Required
3.0
PCIe Version
4.0
PCIe Lanes
16
Height
137 mm (5.39 in)
Width
304 mm (11.97 in)
Depth
42 mm (1.65 in)
Height
137 mm (5.39 in)
Width
304 mm (11.97 in)
Depth
61 mm (2.4 in)
Change Comparison