AMD Radeon RX 6950 XT vs NVIDIA GeForce RTX 4060 Ti

AMD Radeon RX 6950 XT $1,099
NVIDIA GeForce RTX 4060 Ti $399
5120 Shaders
16GB GDDR6
2324MHz
4352 Shaders
10GB GDDR6
2540MHz
Peak AI Performance
47.6 TFLOPS
FP16
Peak AI Performance
707.46 TOPS
INT4 Tensor Sparse
FP32
23.8 TFLOPS
FP32
22.11 TFLOPS
FP16
47.6 TFLOPS
FP16
22.11 TFLOPS
Form Factor
PCIe Card
2.5-Slots
Form Factor
PCIe Card
2.1-Slots
TDP
335W
TDP
160W
Power Connectors
2x 8-Pin
-
Power Connectors
-
1x 16-Pin 12VHPWR

Peak AI Performance

  • 93% slower vs GeForce RTX 4060 Ti
  • 14.86x faster vs Radeon RX 6950 XT
Radeon RX 6950 XT - 47.6 TFLOPS FP16
x1
GeForce RTX 4060 Ti - 707.46 TOPS INT4 Tensor Sparse
x14.86

FP32

  • 8% faster vs GeForce RTX 4060 Ti
  • 7% slower vs Radeon RX 6950 XT
Radeon RX 6950 XT - 23.8 TFLOPS FP32
x1.08
GeForce RTX 4060 Ti - 22.11 TFLOPS FP32
x1

FP16

  • 2.15x faster vs GeForce RTX 4060 Ti
  • 54% slower vs Radeon RX 6950 XT
Radeon RX 6950 XT - 47.6 TFLOPS FP16
x2.15
GeForce RTX 4060 Ti - 22.11 TFLOPS FP16
x1
  • 60% faster vs GeForce RTX 4060 Ti
  • 38% slower vs Radeon RX 6950 XT
Radeon RX 6950 XT - 576GB/s
x1.6
GeForce RTX 4060 Ti - 360GB/s
x1
  • 109% higher vs GeForce RTX 4060 Ti
  • 52% lower vs Radeon RX 6950 XT
Radeon RX 6950 XT - 335W
x2.09
GeForce RTX 4060 Ti - 160W
x1
  • 42% faster vs GeForce RTX 4060 Ti
  • 30% slower vs Radeon RX 6950 XT
Radeon RX 6950 XT - GB6 OpenCL 188,500
x1.42
GeForce RTX 4060 Ti - GB6 OpenCL 132,650
x1
GB5 OpenCL 170,545
56%
GB5 OpenCL N/A
0%
GB5 Metal 167,895
93%
GB5 Metal N/A
0%
GB5 Vulkan 117,795
57%
GB5 Vulkan N/A
0%
OCT Metal 345
59%
OCT Metal N/A
0%
Manufacturer
AMD
Manufacturer
NVIDIA
Chip Designer
AMD
Chip Designer
NVIDIA
Architecture
RDNA 2
Architecture
Ada Lovelace
Family
Radeon RX 6000
Family
GeForce 40
Codename
Sienna Cichlid
Navi 21
Variant
Navi 21 KXTX
Codename
NV186
AD106
Variant
AD106-350-A1
Market Segment
Desktop
Market Segment
Desktop
Release Date
5/10/2022
Release Date
5/18/2023
Foundry
TSMC
Foundry
TSMC
Fabrication Node
N7
Fabrication Node
4N
Die Size
520 mm²
Die Size
188 mm²
Transistor Count
26.8 Billion
Transistor Count
22.9 Billion
Transistor Density
51.56M/mm²
Transistor Density
121.81M/mm²
Form
PCIe Card
Form
PCIe Card
Shading Units
5120 Shaders
Shading Units
4352 Shaders
Texture Mapping Units
320 TMUs
Texture Mapping Units
136 TMUs
Render Output Units
128 ROPs
Render Output Units
48 ROPs
-
-
Tensor Cores
136 T-Cores
Ray-Tracing Cores
80 RT-Cores
Ray-Tracing Cores
34 RT-Cores
-
-
Streaming Multiprocessors
34 SMs
Compute Units
80 CUs
-
-
Graphics Processing Clusters
4 GPCs
Graphics Processing Clusters
3 GPCs
1925MHz Base
2324MHz
2310MHz Base
2540MHz
Peak AI Performance
47.6 TFLOPS
FP16
Peak AI Performance
707.46 TOPS
INT4 Tensor Sparse
-
-
-
-
-
FP8
176.87 TFLOPS Tensor (FP16 Accumulate)
353.73 TFLOPS Tensor (FP16 Accumulate) Sparse
88.43 TFLOPS Tensor (FP32 Accumulate)
176.87 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
47.6 TFLOPS
-
-
-
-
FP16
22.11 TFLOPS
88.43 TFLOPS Tensor (FP16 Accumulate)
176.87 TFLOPS Tensor (FP16 Accumulate) Sparse
44.22 TFLOPS Tensor (FP32 Accumulate)
88.43 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
23.8 TFLOPS
FP32
22.11 TFLOPS
FP64
1.49 TFLOPS
FP64
350 GFLOPS
-
-
-
-
BF16
22.11 TFLOPS
44.22 TFLOPS Tensor
88.43 TFLOPS Tensor Sparse
-
-
-
TF32
22.11 TFLOPS Tensor
44.22 TFLOPS Tensor Sparse
-
-
-
INT4
353.73 TOPS Tensor
707.46 TOPS Tensor Sparse
-
-
-
INT8
176.87 TOPS Tensor
353.73 TOPS Tensor Sparse
-
-
INT32
11.05 TOPS
-
-
Ray Tracing
51.1 TOPS
Pixel Fillrate
297.472 GPixel/s
Pixel Fillrate
121.92 GPixel/s
Texture Fillrate
743.68 GTexel/s
Texture Fillrate
345.44 GTexel/s
L0
32KB/WGP
-
-
L1
-
-
128KB/Array
L1
64KB/SM Tex
128KB/SM
-
L2
4MB Shared
L2
48MB Shared
L3
128MB Shared
1.66TB/s
-
-
-
16GB
GDDR6
10GB
GDDR6
Bus Width
256Bit
Bus Width
160Bit
Clock
2250MHz
Transfer Rate
18GT/s
Bandwidth
576GB/s
Clock
2250MHz
Transfer Rate
18GT/s
Bandwidth
360GB/s
TDP
335W
TDP
160W
Temp
110°C Max
Temp
90°C Max
2x DisplayPort 1.4
1x HDMI 2.1
1x USB-C + DP
3x DisplayPort 1.4
1x HDMI 2.1
-
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
120Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
-
FreeSync
Variable Refresh Rate
G-Sync
FreeSync
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
4
Content Protection
HDCP 2.3
Content Protection
HDCP 2.3
Model
VCN 3.0
Model
NVENC 8
Codec
AVC (H.264)
HEVC (H.265)
-
Codec
AVC (H.264)
HEVC (H.265)
AV1
Model
VCN 3.0
Model
NVDEC 5
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
VP9
AVC (H.264)
HEVC (H.265)
AV1
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Direct X
12
Direct 3D
12_2
Direct X
12
Direct 3D
12_3
OpenGL
4.6
OpenCL
2.1
Vulkan
1.2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
Shader Model
6.5
-
-
GFX
10.3
-
-
-
-
Shader Model
6.6
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
3x Fans
2x Fans
Power Connectors
2x 8-Pin
-
Power Connectors
-
1x 16-Pin 12VHPWR
Slots Required
2.5
PCIe Version
4.0
PCIe Lanes
16
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
Multi GPU Support
Supported
Type
Bridgeless
-
-
-
-
Height
120 mm (4.72 in)
Width
267 mm (10.51 in)
Depth
50 mm (1.97 in)
Height
98 mm (3.86 in)
Width
244 mm (9.61 in)
Depth
42 mm (1.65 in)
Change Comparison