NVIDIA GeForce RTX 4080 vs GIGABYTE GeForce RTX 3090 Ti Gaming OC

NVIDIA GeForce RTX 4080 $1,199
GIGABYTE GeForce RTX 3090 Ti Gaming OC $1,999
9728 Shaders
16GB GDDR6X
2505MHz
10752 Shaders
24GB GDDR6X
1905MHz
Peak AI Performance
1.56 POPS
INT4 Tensor Sparse
Peak AI Performance
1.31 POPS
INT4 Tensor Sparse
FP32
48.74 TFLOPS
FP32
40.97 TFLOPS
FP16
48.74 TFLOPS
FP16
40.97 TFLOPS
Form Factor
PCIe Card
3.0-Slots
Form Factor
PCIe Card
3.4-Slots
TDP
320W
TDP
450W
Power Connectors
1x 16-Pin 12VHPWR
Power Connectors
1x 16-Pin 12VHPWR

Peak AI Performance

  • 19% faster vs GeForce RTX 3090 Ti Gaming OC
  • 16% slower vs GeForce RTX 4080
GeForce RTX 4080 - 1.56 POPS INT4 Tensor Sparse
x1.19
GeForce RTX 3090 Ti Gaming OC - 1.31 POPS INT4 Tensor Sparse
x1

FP32

  • 19% faster vs GeForce RTX 3090 Ti Gaming OC
  • 16% slower vs GeForce RTX 4080
GeForce RTX 4080 - 48.74 TFLOPS FP32
x1.19
GeForce RTX 3090 Ti Gaming OC - 40.97 TFLOPS FP32
x1

FP16

  • 19% faster vs GeForce RTX 3090 Ti Gaming OC
  • 16% slower vs GeForce RTX 4080
GeForce RTX 4080 - 48.74 TFLOPS FP16
x1.19
GeForce RTX 3090 Ti Gaming OC - 40.97 TFLOPS FP16
x1
  • 29% slower vs GeForce RTX 3090 Ti Gaming OC
  • 41% faster vs GeForce RTX 4080
GeForce RTX 4080 - 716.8GB/s
x1
GeForce RTX 3090 Ti Gaming OC - 1008GB/s
x1.41
  • 29% lower vs GeForce RTX 3090 Ti Gaming OC
  • 41% higher vs GeForce RTX 4080
GeForce RTX 4080 - 320W
x1
GeForce RTX 3090 Ti Gaming OC - 450W
x1.41
GB6 OpenCL N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL 230,890
76%
GB5 CUDA N/A
0%
GB5 CUDA 268,250
76%
GB5 Vulkan N/A
0%
GB5 Vulkan 155,325
76%
OCT 2020.1 N/A
0%
OCT 2020.1 755
99%
Manufacturer
NVIDIA
Manufacturer
GIGABYTE
Chip Designer
NVIDIA
Chip Designer
NVIDIA
Architecture
Ada Lovelace
Architecture
Ampere
Family
GeForce 40
Family
GeForce 30
Codename
NV183
AD103
Variant
AD103-300-A1
Codename
NV172
GA102
Variant
GA102-350-A1
Market Segment
Desktop
Market Segment
Desktop
Release Date
11/16/2022
Release Date
3/29/2022
Foundry
TSMC
Foundry
Samsung
Fabrication Node
4N
Fabrication Node
8N
Die Size
379 mm²
Die Size
628 mm²
Transistor Count
45.9 Billion
Transistor Count
28.3 Billion
Transistor Density
121.24M/mm²
Transistor Density
45.04M/mm²
Form
PCIe Card
Form
PCIe Card
Shading Units
9728 Shaders
Shading Units
10752 Shaders
Texture Mapping Units
304 TMUs
Texture Mapping Units
336 TMUs
Render Output Units
112 ROPs
Render Output Units
112 ROPs
Tensor Cores
304 T-Cores
Tensor Cores
336 T-Cores
Ray-Tracing Cores
76 RT-Cores
Ray-Tracing Cores
84 RT-Cores
Streaming Multiprocessors
76 SMs
Streaming Multiprocessors
84 SMs
Graphics Processing Clusters
6 GPCs
Graphics Processing Clusters
7 GPCs
2205MHz Base
2505MHz
1560MHz Base
1905MHz
Peak AI Performance
1.56 POPS
INT4 Tensor Sparse
Peak AI Performance
1.31 POPS
INT4 Tensor Sparse
FP8
389.9 TFLOPS Tensor (FP16 Accumulate)
779.8 TFLOPS Tensor (FP16 Accumulate) Sparse
194.95 TFLOPS Tensor (FP32 Accumulate)
389.9 TFLOPS Tensor (FP32 Accumulate) Sparse
-
-
-
-
-
FP16
48.74 TFLOPS
194.95 TFLOPS Tensor (FP16 Accumulate)
389.9 TFLOPS Tensor (FP16 Accumulate) Sparse
97.48 TFLOPS Tensor (FP32 Accumulate)
194.95 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
40.97 TFLOPS
163.86 TFLOPS Tensor (FP16 Accumulate)
327.72 TFLOPS Tensor (FP16 Accumulate) Sparse
81.93 TFLOPS Tensor (FP32 Accumulate)
163.86 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
48.74 TFLOPS
FP32
40.97 TFLOPS
FP64
760 GFLOPS
FP64
640 GFLOPS
BF16
48.74 TFLOPS
97.48 TFLOPS Tensor
194.95 TFLOPS Tensor Sparse
BF16
40.97 TFLOPS
81.93 TFLOPS Tensor
163.86 TFLOPS Tensor Sparse
TF32
48.74 TFLOPS Tensor
97.47 TFLOPS Tensor Sparse
TF32
40.97 TFLOPS Tensor
81.93 TFLOPS Tensor Sparse
INT4
779.8 TOPS Tensor
1.56 POPS Tensor Sparse
INT4
655.44 TOPS Tensor
1.31 POPS Tensor Sparse
INT8
389.9 TOPS Tensor
779.8 TOPS Tensor Sparse
INT8
327.72 TOPS Tensor
655.44 TOPS Tensor Sparse
INT32
24.37 TOPS
INT32
20.48 TOPS
Ray Tracing
112.7 TOPS
Ray Tracing
80 TOPS
Pixel Fillrate
280.56 GPixel/s
Pixel Fillrate
213.36 GPixel/s
Texture Fillrate
761.52 GTexel/s
Texture Fillrate
640.08 GTexel/s
L1
64KB/SM Tex
128KB/SM
L1
64KB/SM Tex
128KB/SM
L2
64MB Shared
L2
6MB Shared
16GB
GDDR6X
24GB
GDDR6X
Bus Width
256Bit
Bus Width
384Bit
Clock
1400MHz
Transfer Rate
22.4GT/s
Bandwidth
716.8GB/s
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
1008GB/s
TDP
320W
TDP
450W
Temp
90°C Max
Temp
92°C Max
3x DisplayPort 1.4
1x HDMI 2.1
3x DisplayPort 1.4
1x HDMI 2.1
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
G-Sync
FreeSync
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
2x NVENC 8
Model
NVENC 7
Codec
AVC (H.264)
HEVC (H.265)
AV1
Codec
AVC (H.264)
HEVC (H.265)
-
Model
NVDEC 5
Model
NVDEC 5
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.2
Shader Model
6.6
CUDA
8.9
PureVideo HD
VP12
VDPAU
Feature Set L
Shader Model
6.6
CUDA
8.6
PureVideo HD
VP11
VDPAU
Feature Set K
2x Fans
3x Fans
Power Connectors
1x 16-Pin 12VHPWR
Power Connectors
1x 16-Pin 12VHPWR
Slots Required
3.0
PCIe Version
4.0
PCIe Lanes
16
Slots Required
3.4
PCIe Version
4.0
PCIe Lanes
16
-
-
-
-
Multi GPU Support
Supported
Type
2-way NVLink
Height
137 mm (5.39 in)
Width
304 mm (11.97 in)
Depth
61 mm (2.4 in)
Height
150 mm (5.91 in)
Width
331 mm (13.03 in)
Depth
70 mm (2.76 in)
Change Comparison