NVIDIA GeForce RTX 4060 Ti vs AMD Radeon RX 6950 XT

NVIDIA GeForce RTX 4060 Ti $399
AMD Radeon RX 6950 XT $1,099
4352 Shaders
10GB GDDR6
2540MHz
5120 Shaders
16GB GDDR6
2324MHz
Peak AI Performance
707.46 TOPS
INT4 Tensor Sparse
Peak AI Performance
47.6 TFLOPS
FP16
FP32
22.11 TFLOPS
FP32
23.8 TFLOPS
FP16
22.11 TFLOPS
FP16
47.6 TFLOPS
Form Factor
PCIe Card
2.1-Slots
Form Factor
PCIe Card
2.5-Slots
TDP
160W
TDP
335W
Power Connectors
-
1x 16-Pin 12VHPWR
Power Connectors
2x 8-Pin
-

Peak AI Performance

  • 14.86x faster vs Radeon RX 6950 XT
  • 93% slower vs GeForce RTX 4060 Ti
GeForce RTX 4060 Ti - 707.46 TOPS INT4 Tensor Sparse
x14.86
Radeon RX 6950 XT - 47.6 TFLOPS FP16
x1

FP32

  • 7% slower vs Radeon RX 6950 XT
  • 8% faster vs GeForce RTX 4060 Ti
GeForce RTX 4060 Ti - 22.11 TFLOPS FP32
x1
Radeon RX 6950 XT - 23.8 TFLOPS FP32
x1.08

FP16

  • 54% slower vs Radeon RX 6950 XT
  • 2.15x faster vs GeForce RTX 4060 Ti
GeForce RTX 4060 Ti - 22.11 TFLOPS FP16
x1
Radeon RX 6950 XT - 47.6 TFLOPS FP16
x2.15
  • 38% slower vs Radeon RX 6950 XT
  • 60% faster vs GeForce RTX 4060 Ti
GeForce RTX 4060 Ti - 360GB/s
x1
Radeon RX 6950 XT - 576GB/s
x1.6
  • 52% lower vs Radeon RX 6950 XT
  • 109% higher vs GeForce RTX 4060 Ti
GeForce RTX 4060 Ti - 160W
x1
Radeon RX 6950 XT - 335W
x2.09
  • 30% slower vs Radeon RX 6950 XT
  • 42% faster vs GeForce RTX 4060 Ti
GeForce RTX 4060 Ti - GB6 OpenCL 132,650
x1
Radeon RX 6950 XT - GB6 OpenCL 188,500
x1.42
GB5 OpenCL N/A
0%
GB5 OpenCL 170,545
56%
GB5 Metal N/A
0%
GB5 Metal 167,895
93%
GB5 Vulkan N/A
0%
GB5 Vulkan 117,795
57%
OCT Metal N/A
0%
OCT Metal 345
59%
Manufacturer
NVIDIA
Manufacturer
AMD
Chip Designer
NVIDIA
Chip Designer
AMD
Architecture
Ada Lovelace
Architecture
RDNA 2
Family
GeForce 40
Family
Radeon RX 6000
Codename
NV186
AD106
Variant
AD106-350-A1
Codename
Sienna Cichlid
Navi 21
Variant
Navi 21 KXTX
Market Segment
Desktop
Market Segment
Desktop
Release Date
5/18/2023
Release Date
5/10/2022
Foundry
TSMC
Foundry
TSMC
Fabrication Node
4N
Fabrication Node
N7
Die Size
188 mm²
Die Size
520 mm²
Transistor Count
22.9 Billion
Transistor Count
26.8 Billion
Transistor Density
121.81M/mm²
Transistor Density
51.56M/mm²
Form
PCIe Card
Form
PCIe Card
Shading Units
4352 Shaders
Shading Units
5120 Shaders
Texture Mapping Units
136 TMUs
Texture Mapping Units
320 TMUs
Render Output Units
48 ROPs
Render Output Units
128 ROPs
Tensor Cores
136 T-Cores
-
-
Ray-Tracing Cores
34 RT-Cores
Ray-Tracing Cores
80 RT-Cores
Streaming Multiprocessors
34 SMs
-
-
-
-
Compute Units
80 CUs
Graphics Processing Clusters
3 GPCs
Graphics Processing Clusters
4 GPCs
2310MHz Base
2540MHz
1925MHz Base
2324MHz
Peak AI Performance
707.46 TOPS
INT4 Tensor Sparse
Peak AI Performance
47.6 TFLOPS
FP16
FP8
176.87 TFLOPS Tensor (FP16 Accumulate)
353.73 TFLOPS Tensor (FP16 Accumulate) Sparse
88.43 TFLOPS Tensor (FP32 Accumulate)
176.87 TFLOPS Tensor (FP32 Accumulate) Sparse
-
-
-
-
-
FP16
22.11 TFLOPS
88.43 TFLOPS Tensor (FP16 Accumulate)
176.87 TFLOPS Tensor (FP16 Accumulate) Sparse
44.22 TFLOPS Tensor (FP32 Accumulate)
88.43 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
47.6 TFLOPS
-
-
-
-
FP32
22.11 TFLOPS
FP32
23.8 TFLOPS
FP64
350 GFLOPS
FP64
1.49 TFLOPS
BF16
22.11 TFLOPS
44.22 TFLOPS Tensor
88.43 TFLOPS Tensor Sparse
-
-
-
-
TF32
22.11 TFLOPS Tensor
44.22 TFLOPS Tensor Sparse
-
-
-
INT4
353.73 TOPS Tensor
707.46 TOPS Tensor Sparse
-
-
-
INT8
176.87 TOPS Tensor
353.73 TOPS Tensor Sparse
-
-
-
INT32
11.05 TOPS
-
-
Ray Tracing
51.1 TOPS
-
-
Pixel Fillrate
121.92 GPixel/s
Pixel Fillrate
297.472 GPixel/s
Texture Fillrate
345.44 GTexel/s
Texture Fillrate
743.68 GTexel/s
-
-
L0
32KB/WGP
L1
64KB/SM Tex
128KB/SM
-
L1
-
-
128KB/Array
L2
48MB Shared
L2
4MB Shared
-
-
-
L3
128MB Shared
1.66TB/s
10GB
GDDR6
16GB
GDDR6
Bus Width
160Bit
Bus Width
256Bit
Clock
2250MHz
Transfer Rate
18GT/s
Bandwidth
360GB/s
Clock
2250MHz
Transfer Rate
18GT/s
Bandwidth
576GB/s
TDP
160W
TDP
335W
Temp
90°C Max
Temp
110°C Max
3x DisplayPort 1.4
1x HDMI 2.1
-
2x DisplayPort 1.4
1x HDMI 2.1
1x USB-C + DP
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
120Hz
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
-
FreeSync
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
NVENC 8
Model
VCN 3.0
Codec
AVC (H.264)
HEVC (H.265)
AV1
Codec
AVC (H.264)
HEVC (H.265)
-
Model
NVDEC 5
Model
VCN 3.0
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
AVC (H.264)
HEVC (H.265)
AV1
Direct X
12
Direct 3D
12_3
Direct X
12
Direct 3D
12_2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
2.1
Vulkan
1.2
Shader Model
6.6
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
Shader Model
6.5
-
-
GFX
10.3
-
-
-
-
2x Fans
3x Fans
Power Connectors
-
1x 16-Pin 12VHPWR
Power Connectors
2x 8-Pin
-
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
Slots Required
2.5
PCIe Version
4.0
PCIe Lanes
16
-
-
-
-
Multi GPU Support
Supported
Type
Bridgeless
Height
98 mm (3.86 in)
Width
244 mm (9.61 in)
Depth
42 mm (1.65 in)
Height
120 mm (4.72 in)
Width
267 mm (10.51 in)
Depth
50 mm (1.97 in)
Change Comparison