NVIDIA 8x GeForce RTX 4090 vs NVIDIA GeForce RTX 3070 Ti

NVIDIA 8x GeForce RTX 4090 $12,792
NVIDIA GeForce RTX 3070 Ti $600
8x 16384 Shaders
192GB (8x 24GB) GDDR6X
2520MHz
6144 Shaders
8GB GDDR6X
1770MHz
Peak AI Performance
21.14 POPS
INT4 Tensor Sparse
Peak AI Performance
695.99 TOPS
INT4 Tensor Sparse
FP32
660.6 TFLOPS
FP32
21.75 TFLOPS
FP16
660.6 TFLOPS
FP16
21.75 TFLOPS
Form Factor
PCIe Card
8x 3.0-Slots
Form Factor
PCIe Card
2.1-Slots
TDP
3600W
TDP
290W
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
Power Connectors
-
-
1x 12-Pin
-
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL 153,225
50%
GB5 CUDA N/A
0%
GB5 CUDA 166,375
47%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan 95,755
47%
OCT 2020.1 N/A
0%
OCT 2020.1 455
60%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
NVIDIA
Manufacturer
NVIDIA
Chip Designer
NVIDIA
Chip Designer
NVIDIA
Architecture
Ada Lovelace
Architecture
Ampere
Family
GeForce 40
Family
GeForce 30
Codename
NV182
AD102
Variant
AD102-300-A1
Codename
NV174
GA104
Variant
GA104-400-A1
Market Segment
Desktop
Market Segment
Desktop
Release Date
10/12/2022
Release Date
6/10/2021
Foundry
TSMC
-
Foundry
Samsung
-
Fabrication Node
4N
-
Fabrication Node
8N
-
Die Size
8x 608 mm²
-
Die Size
393 mm²
-
Transistor Count
8x 76.3 Billion
-
Transistor Count
17.4 Billion
-
Transistor Density
125.41M/mm²
-
Transistor Density
44.33M/mm²
-
Form
PCIe Card
Form
PCIe Card
Shading Units
8x 16384 Shaders
-
Shading Units
6144 Shaders
-
Texture Mapping Units
8x 512 TMUs
Texture Mapping Units
192 TMUs
Render Output Units
8x 176 ROPs
Render Output Units
96 ROPs
Tensor Cores
8x 512 T-Cores
Tensor Cores
192 T-Cores
Ray-Tracing Cores
8x 128 RT-Cores
Ray-Tracing Cores
48 RT-Cores
Streaming Multiprocessors
8x 128 SMs
Streaming Multiprocessors
48 SMs
-
-
-
-
-
-
-
-
Graphics Processing Clusters
8x 11 GPCs
Graphics Processing Clusters
6 GPCs
-
-
2230MHz Base
2520MHz
-
-
1575MHz Base
1770MHz
Peak AI Performance
21.14 POPS
INT4 Tensor Sparse
Peak AI Performance
695.99 TOPS
INT4 Tensor Sparse
-
-
-
-
-
-
FP8
-
5.28 PFLOPS Tensor (FP16 Accumulate)
10.57 PFLOPS Tensor (FP16 Accumulate) Sparse
2.64 PFLOPS Tensor (FP32 Accumulate)
5.28 PFLOPS Tensor (FP32 Accumulate) Sparse
-
-
-
-
-
-
FP16
660.6 TFLOPS
2.64 PFLOPS Tensor (FP16 Accumulate)
5.28 PFLOPS Tensor (FP16 Accumulate) Sparse
1.32 PFLOPS Tensor (FP32 Accumulate)
2.64 PFLOPS Tensor (FP32 Accumulate) Sparse
FP16
21.75 TFLOPS
87 TFLOPS Tensor (FP16 Accumulate)
174 TFLOPS Tensor (FP16 Accumulate) Sparse
43.5 TFLOPS Tensor (FP32 Accumulate)
87 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
660.6 TFLOPS
-
-
FP32
21.75 TFLOPS
-
-
FP64
10.32 TFLOPS
-
FP64
340 GFLOPS
-
BF16
660.6 TFLOPS
1.32 PFLOPS Tensor
2.64 PFLOPS Tensor Sparse
BF16
21.75 TFLOPS
43.5 TFLOPS Tensor
87 TFLOPS Tensor Sparse
TF32
660.6 TFLOPS Tensor
1.32 PFLOPS Tensor Sparse
TF32
21.75 TFLOPS Tensor
43.5 TFLOPS Tensor Sparse
INT4
10.57 POPS Tensor
21.14 POPS Tensor Sparse
INT4
348 TOPS Tensor
695.99 TOPS Tensor Sparse
INT8
-
5.28 POPS Tensor
10.57 POPS Tensor Sparse
INT8
-
174 TOPS Tensor
348 TOPS Tensor Sparse
INT32
330.3 TOPS
INT32
10.88 TOPS
Ray Tracing
190.9 TOPS
Ray Tracing
42.5 TOPS
Pixel Fillrate
443.52 GPixel/s
Pixel Fillrate
169.92 GPixel/s
-
-
-
-
Texture Fillrate
1290.24 GTexel/s
Texture Fillrate
339.84 GTexel/s
-
-
-
-
L1
64KB/SM Tex
128KB/SM
-
-
L1
64KB/SM Tex
128KB/SM
-
-
L2
72MB Shared
L2
4MB Shared
-
-
-
-
-
-
192GB (8x 24GB)
GDDR6X
-
8GB
GDDR6X
-
Bus Width
384Bit
Bus Width
256Bit
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
1008GB/s
Clock
1188MHz
Transfer Rate
19GT/s
Bandwidth
608GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
3600W
TDP
290W
Temp
90°C Max
Temp
93°C Max
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
24x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
8x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
-
Variable Refresh Rate
G-Sync
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
2x NVENC 8
Model
NVENC 7
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
Model
NVDEC 5
Model
NVDEC 5
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.2
Shader Model
6.6
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
Shader Model
6.6
CUDA
8.6
-
-
PureVideo HD
VP11
VDPAU
Feature Set K
-
-
-
2x Fans
-
-
-
2x Fans
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
Power Connectors
-
-
-
-
-
1x 12-Pin
-
Slots Required
3.0
PCIe Version
4.0
PCIe Lanes
16
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
-
-
-
-
-
-
-
-
Height
137 mm (5.39 in)
Width
304 mm (11.97 in)
Depth
61 mm (2.4 in)
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
43 mm (1.69 in)
Change Comparison