GIGABYTE GeForce RTX 3090 Ti Gaming OC vs NVIDIA GeForce RTX 4080 Super

GIGABYTE GeForce RTX 3090 Ti Gaming OC $1,999
NVIDIA GeForce RTX 4080 Super $999
10752 Shaders
24GB GDDR6X
1905MHz
10240 Shaders
16GB GDDR6X
2550MHz
Peak AI Performance
1.31 POPS
INT4 Tensor Sparse
Peak AI Performance
1.67 POPS
INT4 Tensor Sparse
FP32
40.97 TFLOPS
FP32
52.22 TFLOPS
FP16
40.97 TFLOPS
FP16
52.22 TFLOPS
Form Factor
PCIe Card
3.4-Slots
Form Factor
PCIe Card
3.0-Slots
TDP
450W
TDP
320W
Power Connectors
1x 16-Pin 12VHPWR
Power Connectors
1x 16-Pin 12VHPWR

Peak AI Performance

  • 22% slower vs GeForce RTX 4080 Super
  • 27% faster vs GeForce RTX 3090 Ti Gaming OC
GeForce RTX 3090 Ti Gaming OC - 1.31 POPS INT4 Tensor Sparse
x1
GeForce RTX 4080 Super - 1.67 POPS INT4 Tensor Sparse
x1.27

FP32

  • 22% slower vs GeForce RTX 4080 Super
  • 27% faster vs GeForce RTX 3090 Ti Gaming OC
GeForce RTX 3090 Ti Gaming OC - 40.97 TFLOPS FP32
x1
GeForce RTX 4080 Super - 52.22 TFLOPS FP32
x1.27

FP16

  • 22% slower vs GeForce RTX 4080 Super
  • 27% faster vs GeForce RTX 3090 Ti Gaming OC
GeForce RTX 3090 Ti Gaming OC - 40.97 TFLOPS FP16
x1
GeForce RTX 4080 Super - 52.22 TFLOPS FP16
x1.27
  • 41% faster vs GeForce RTX 4080 Super
  • 29% slower vs GeForce RTX 3090 Ti Gaming OC
GeForce RTX 3090 Ti Gaming OC - 1008GB/s
x1.41
GeForce RTX 4080 Super - 716.8GB/s
x1
  • 41% higher vs GeForce RTX 4080 Super
  • 29% lower vs GeForce RTX 3090 Ti Gaming OC
GeForce RTX 3090 Ti Gaming OC - 450W
x1.41
GeForce RTX 4080 Super - 320W
x1
GB6 OpenCL N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL 230,890
76%
GB5 OpenCL N/A
0%
GB5 CUDA 268,250
76%
GB5 CUDA N/A
0%
GB5 Vulkan 155,325
76%
GB5 Vulkan N/A
0%
OCT 2020.1 755
99%
OCT 2020.1 N/A
0%
Manufacturer
GIGABYTE
Manufacturer
NVIDIA
Chip Designer
NVIDIA
Chip Designer
NVIDIA
Architecture
Ampere
Architecture
Ada Lovelace
Family
GeForce 30
Family
GeForce 40
Codename
NV172
GA102
Variant
GA102-350-A1
Codename
NV183
AD103
Variant
AD103-400-A1
Market Segment
Desktop
Market Segment
Desktop
Release Date
3/29/2022
Release Date
1/8/2024
Foundry
Samsung
Foundry
TSMC
Fabrication Node
8N
Fabrication Node
4N
Die Size
628 mm²
Die Size
379 mm²
Transistor Count
28.3 Billion
Transistor Count
45.9 Billion
Transistor Density
45.04M/mm²
Transistor Density
121.24M/mm²
Form
PCIe Card
Form
PCIe Card
Shading Units
10752 Shaders
Shading Units
10240 Shaders
Texture Mapping Units
336 TMUs
Texture Mapping Units
320 TMUs
Render Output Units
112 ROPs
Render Output Units
112 ROPs
Tensor Cores
336 T-Cores
Tensor Cores
320 T-Cores
Ray-Tracing Cores
84 RT-Cores
Ray-Tracing Cores
80 RT-Cores
Streaming Multiprocessors
84 SMs
Streaming Multiprocessors
80 SMs
Graphics Processing Clusters
7 GPCs
Graphics Processing Clusters
7 GPCs
1560MHz Base
1905MHz
2295MHz Base
2550MHz
Peak AI Performance
1.31 POPS
INT4 Tensor Sparse
Peak AI Performance
1.67 POPS
INT4 Tensor Sparse
-
-
-
-
-
FP8
417.79 TFLOPS Tensor (FP16 Accumulate)
835.58 TFLOPS Tensor (FP16 Accumulate) Sparse
208.9 TFLOPS Tensor (FP32 Accumulate)
417.79 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
40.97 TFLOPS
163.86 TFLOPS Tensor (FP16 Accumulate)
327.72 TFLOPS Tensor (FP16 Accumulate) Sparse
81.93 TFLOPS Tensor (FP32 Accumulate)
163.86 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
52.22 TFLOPS
208.9 TFLOPS Tensor (FP16 Accumulate)
417.79 TFLOPS Tensor (FP16 Accumulate) Sparse
104.45 TFLOPS Tensor (FP32 Accumulate)
208.9 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
40.97 TFLOPS
FP32
52.22 TFLOPS
FP64
640 GFLOPS
FP64
820 GFLOPS
BF16
40.97 TFLOPS
81.93 TFLOPS Tensor
163.86 TFLOPS Tensor Sparse
BF16
52.22 TFLOPS
104.45 TFLOPS Tensor
208.9 TFLOPS Tensor Sparse
TF32
40.97 TFLOPS Tensor
81.93 TFLOPS Tensor Sparse
TF32
52.22 TFLOPS Tensor
104.45 TFLOPS Tensor Sparse
INT4
655.44 TOPS Tensor
1.31 POPS Tensor Sparse
INT4
835.58 TOPS Tensor
1.67 POPS Tensor Sparse
INT8
327.72 TOPS Tensor
655.44 TOPS Tensor Sparse
INT8
417.79 TOPS Tensor
835.58 TOPS Tensor Sparse
INT32
20.48 TOPS
INT32
26.11 TOPS
Ray Tracing
80 TOPS
Ray Tracing
120.7 TOPS
Pixel Fillrate
213.36 GPixel/s
Pixel Fillrate
285.6 GPixel/s
Texture Fillrate
640.08 GTexel/s
Texture Fillrate
816 GTexel/s
L1
64KB/SM Tex
128KB/SM
L1
64KB/SM Tex
128KB/SM
L2
6MB Shared
L2
64MB Shared
24GB
GDDR6X
16GB
GDDR6X
Bus Width
384Bit
Bus Width
256Bit
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
1008GB/s
Clock
1400MHz
Transfer Rate
22.4GT/s
Bandwidth
716.8GB/s
TDP
450W
TDP
320W
Temp
92°C Max
Temp
90°C Max
3x DisplayPort 1.4
1x HDMI 2.1
3x DisplayPort 1.4
1x HDMI 2.1
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
G-Sync
FreeSync
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
NVENC 7
Model
2x NVENC 8
Codec
AVC (H.264)
HEVC (H.265)
-
Codec
AVC (H.264)
HEVC (H.265)
AV1
Model
NVDEC 5
Model
NVDEC 5
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Direct X
12
Direct 3D
12_2
Direct X
12
Direct 3D
12_3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
Shader Model
6.6
CUDA
8.6
PureVideo HD
VP11
VDPAU
Feature Set K
Shader Model
6.6
CUDA
8.9
PureVideo HD
VP12
VDPAU
Feature Set L
3x Fans
2x Fans
Power Connectors
1x 16-Pin 12VHPWR
Power Connectors
1x 16-Pin 12VHPWR
Slots Required
3.4
PCIe Version
4.0
PCIe Lanes
16
Slots Required
3.0
PCIe Version
4.0
PCIe Lanes
16
Multi GPU Support
Supported
Type
2-way NVLink
-
-
-
-
Height
150 mm (5.91 in)
Width
331 mm (13.03 in)
Depth
70 mm (2.76 in)
Height
137 mm (5.39 in)
Width
304 mm (11.97 in)
Depth
61 mm (2.4 in)
Change Comparison