NVIDIA GeForce RTX 4090 D vs NVIDIA GeForce RTX 3090 Ti
NVIDIA GeForce RTX 4090 D
$1,599
NVIDIA GeForce RTX 3090 Ti
$1,999
|
14592 Shaders
24GB GDDR6X
2520MHz
|
10752 Shaders
24GB GDDR6X
1860MHz
|
|
Peak AI Performance
2.35 POPS
INT4 Tensor Sparse
|
Peak AI Performance
1.28 POPS
INT4 Tensor Sparse
|
|
FP32
73.54 TFLOPS
|
FP32
40 TFLOPS
|
|
FP16
73.54 TFLOPS
|
FP16
40 TFLOPS
|
|
Form Factor
PCIe Card
3.0-Slots
|
Form Factor
PCIe Card
3.0-Slots
|
|
TDP
450W
|
TDP
450W
|
|
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
|
Power Connectors
-
-
1x 12-Pin
-
|
|
GB6 OpenCL N/A
0%
|
|
|
GB6 Metal N/A
0%
|
GB6 Metal N/A
0%
|
|
GB6 Vulkan N/A
0%
|
|
GB5 OpenCL N/A
0%
|
GB5 OpenCL 225,435
74%
|
|
GB5 CUDA N/A
0%
|
GB5 CUDA 261,915
74%
|
|
GB5 Metal N/A
0%
|
GB5 Metal N/A
0%
|
|
GB5 Vulkan N/A
0%
|
GB5 Vulkan 151,655
74%
|
|
OCT 2020.1 N/A
0%
|
OCT 2020.1 735
97%
|
|
OCT Metal N/A
0%
|
OCT Metal N/A
0%
|
|
Peak AI
Performance
2.35 POPS
INT4 Tensor Sparse
|
Peak AI
Performance
1.28 POPS
INT4 Tensor Sparse
|
|
-
-
-
|
-
-
-
|
|
FP8
-
588.35 TFLOPS
Tensor (FP16 Accumulate)
1.18 PFLOPS
Tensor (FP16 Accumulate) Sparse
294.18 TFLOPS
Tensor (FP32 Accumulate)
588.35 TFLOPS
Tensor (FP32 Accumulate) Sparse
|
-
-
-
-
-
-
|
|
FP16
73.54 TFLOPS
294.18 TFLOPS
Tensor (FP16 Accumulate)
588.35 TFLOPS
Tensor (FP16 Accumulate) Sparse
147.09 TFLOPS
Tensor (FP32 Accumulate)
294.17 TFLOPS
Tensor (FP32 Accumulate) Sparse
|
FP16
40 TFLOPS
159.99 TFLOPS
Tensor (FP16 Accumulate)
319.98 TFLOPS
Tensor (FP16 Accumulate) Sparse
80 TFLOPS
Tensor (FP32 Accumulate)
159.99 TFLOPS
Tensor (FP32 Accumulate) Sparse
|
|
FP32
73.54 TFLOPS
-
-
|
FP32
40 TFLOPS
-
-
|
|
FP64
1.15 TFLOPS
-
|
FP64
630 GFLOPS
-
|
|
BF16
73.54 TFLOPS
147.09 TFLOPS
Tensor
294.17 TFLOPS
Tensor Sparse
|
BF16
40 TFLOPS
80 TFLOPS
Tensor
159.99 TFLOPS
Tensor Sparse
|
|
TF32
73.54 TFLOPS
Tensor
147.09 TFLOPS
Tensor Sparse
|
TF32
40 TFLOPS
Tensor
79.99 TFLOPS
Tensor Sparse
|
|
INT4
1.18 POPS
Tensor
2.35 POPS
Tensor Sparse
|
INT4
639.96 TOPS
Tensor
1.28 POPS
Tensor Sparse
|
|
INT8
-
588.35 TOPS
Tensor
1.18 POPS
Tensor Sparse
|
INT8
-
319.98 TOPS
Tensor
639.96 TOPS
Tensor Sparse
|
|
INT32
36.77 TOPS
|
INT32
20 TOPS
|
|
Ray Tracing
170 TOPS
|
Ray Tracing
78.1 TOPS
|
|
Pixel
Fillrate
443.52 GPixel/s
|
Pixel
Fillrate
208.32 GPixel/s
|
|
-
-
|
-
-
|
|
Texture
Fillrate
1149.12 GTexel/s
|
Texture
Fillrate
624.96 GTexel/s
|
|
Manufacturer
NVIDIA
|
Manufacturer
NVIDIA
|
|
Chip Designer
NVIDIA
|
Chip Designer
NVIDIA
|
|
Architecture
Ada Lovelace
|
Architecture
Ampere
|
|
Family
GeForce 40
|
Family
GeForce 30
|
|
Codename
NV182
AD102
Variant
AD102-250-A1
|
Codename
NV172
GA102
Variant
GA102-350-A1
|
|
Market Segment
Desktop
|
Market Segment
Desktop
|
|
Release Date
12/28/2023
|
Release Date
3/29/2022
|
|
Foundry
TSMC
-
|
Foundry
Samsung
-
|
|
Fabrication Node
4N
-
|
Fabrication Node
8N
-
|
|
Die Size
608 mm²
-
|
Die Size
628 mm²
-
|
|
Transistor Count
76.3 Billion
-
|
Transistor Count
28.3 Billion
-
|
|
Transistor Density
125.41M/mm²
-
|
Transistor Density
45.04M/mm²
-
|
|
Form
PCIe Card
|
Form
PCIe Card
|
|
Shading Units
14592 Shaders
-
|
Shading Units
10752 Shaders
-
|
|
Texture Mapping Units
456 TMUs
|
Texture Mapping Units
336 TMUs
|
|
Render Output Units
176 ROPs
|
Render Output Units
112 ROPs
|
|
Tensor Cores
456 T-Cores
|
Tensor Cores
336 T-Cores
|
|
Ray-Tracing Cores
114 RT-Cores
|
Ray-Tracing Cores
84 RT-Cores
|
|
Streaming Multiprocessors
114 SMs
|
Streaming Multiprocessors
84 SMs
|
|
-
-
|
-
-
|
|
-
-
|
-
-
|
|
Graphics Processing Clusters
10 GPCs
|
Graphics Processing Clusters
7 GPCs
|
|
-
-
2230MHz Base
2520MHz
|
-
-
1560MHz Base
1860MHz
|
|
-
-
|
-
-
|
|
L1
64KB/SM Tex
128KB/SM
-
-
|
L1
64KB/SM Tex
128KB/SM
-
-
|
|
L2
72MB Shared
|
L2
6MB Shared
|
|
-
-
-
|
-
-
-
|
|
24GB
GDDR6X
-
|
24GB
GDDR6X
-
|
|
Bus Width
384Bit
|
Bus Width
384Bit
|
|
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
1008GB/s
|
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
1008GB/s
|
|
-
-
-
-
-
-
-
-
-
|
-
-
-
-
-
-
-
-
-
|
|
TDP
450W
|
TDP
450W
|
|
Temp
90°C Max
|
Temp
92°C Max
|
|
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
|
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
|
|
Max Resolution
7680x4320
|
Max Resolution
7680x4320
|
|
Max Resolution Refresh Rate
60Hz
|
Max Resolution Refresh Rate
60Hz
|
|
Variable Refresh Rate
G-Sync
FreeSync
-
|
Variable Refresh Rate
G-Sync
FreeSync
-
|
|
Display Stream Compression (DSC)
Supported
|
Display Stream Compression (DSC)
Supported
|
|
Multi Monitor Support
4
|
Multi Monitor Support
4
|
|
Content Protection
HDCP 2.3
|
Content Protection
HDCP 2.3
|
|
Model
2x NVENC 8
|
Model
NVENC 7
|
|
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
|
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
|
|
Model
NVDEC 5
|
Model
NVDEC 5
|
|
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
|
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
|
|
Direct X
12
Direct 3D
12_3
|
Direct X
12
Direct 3D
12_2
|
|
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
|
OpenGL
4.6
OpenCL
3.0
Vulkan
1.2
|
|
Shader Model
6.6
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
|
Shader Model
6.6
CUDA
8.6
-
-
PureVideo HD
VP11
VDPAU
Feature Set K
|
|
-
-
-
2x Fans
|
-
-
-
2x Fans
|
|
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
|
Power Connectors
-
-
-
-
-
1x 12-Pin
-
|
|
Slots Required
3.0
PCIe Version
4.0
PCIe Lanes
16
|
Slots Required
3.0
PCIe Version
4.0
PCIe Lanes
16
|
|
-
-
-
-
|
Multi GPU Support
Supported
Type
2-way NVLink
|
|
Height
137 mm (5.39 in)
Width
304 mm (11.97 in)
Depth
61 mm (2.4 in)
|
Height
140 mm (5.51 in)
Width
336 mm (13.23 in)
Depth
61 mm (2.4 in)
|
Copy Link