NVIDIA GeForce RTX 4070 Super vs 60-Core M2 Ultra

NVIDIA GeForce RTX 4070 Super $599
60-Core M2 Ultra
7168 Shaders
12GB GDDR6X
2475MHz
7680 Shaders
Shared Memory
1398MHz
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
Peak AI Performance
21.47 TFLOPS
FP16
FP32
35.48 TFLOPS
FP32
21.47 TFLOPS
FP16
35.48 TFLOPS
FP16
21.47 TFLOPS
Form Factor
PCIe Card
2.1-Slots
Form Factor
iGPU
-
TDP
225W
TDP
Shared
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
-
-
-
-
-
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL 105,745
35%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal 112,540
62%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
NVIDIA
Manufacturer
Apple
Chip Designer
NVIDIA
Chip Designer
Apple
Architecture
Ada Lovelace
Architecture
Apple Gen 5
Family
GeForce 40
Family
M Series
Codename
NV184
AD104
Variant
AD104-350-A1
Codename
G14D
M2 Ultra
-
-
Market Segment
Desktop
Market Segment
Desktop
Release Date
1/8/2024
Release Date
6/5/2023
Foundry
TSMC
-
-
-
-
Fabrication Node
4N
-
-
-
-
Die Size
295 mm²
-
-
-
-
Transistor Count
35.8 Billion
-
-
-
Transistor Density
121.56M/mm²
-
-
-
-
Form
PCIe Card
Form
iGPU
Shading Units
7168 Shaders
-
Shading Units
7680 Shaders
-
Texture Mapping Units
224 TMUs
Texture Mapping Units
480 TMUs
Render Output Units
80 ROPs
Render Output Units
240 ROPs
Tensor Cores
224 T-Cores
-
-
Ray-Tracing Cores
56 RT-Cores
-
-
Streaming Multiprocessors
56 SMs
-
-
-
-
-
-
-
-
Execution Units
960 EUs
Graphics Processing Clusters
5 GPCs
Graphics Processing Clusters
60 GPCs
-
-
1980MHz Base
2475MHz
-
-
-
1398MHz
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
Peak AI Performance
21.47 TFLOPS
FP16
-
-
-
-
-
-
FP8
-
283.85 TFLOPS Tensor (FP16 Accumulate)
567.71 TFLOPS Tensor (FP16 Accumulate) Sparse
141.93 TFLOPS Tensor (FP32 Accumulate)
283.85 TFLOPS Tensor (FP32 Accumulate) Sparse
-
-
-
-
-
-
FP16
35.48 TFLOPS
141.93 TFLOPS Tensor (FP16 Accumulate)
283.85 TFLOPS Tensor (FP16 Accumulate) Sparse
70.96 TFLOPS Tensor (FP32 Accumulate)
141.93 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
21.47 TFLOPS
-
-
-
-
FP32
35.48 TFLOPS
-
-
FP32
21.47 TFLOPS
-
-
FP64
550 GFLOPS
-
-
-
-
BF16
35.48 TFLOPS
70.96 TFLOPS Tensor
141.93 TFLOPS Tensor Sparse
-
-
-
-
TF32
35.48 TFLOPS Tensor
70.96 TFLOPS Tensor Sparse
-
-
-
INT4
567.71 TOPS Tensor
1.14 POPS Tensor Sparse
-
-
-
INT8
-
283.85 TOPS Tensor
567.71 TOPS Tensor Sparse
-
-
-
-
INT32
17.74 TOPS
-
-
Ray Tracing
82 TOPS
-
-
Pixel Fillrate
198 GPixel/s
Pixel Fillrate
335.52 GPixel/s
-
-
-
-
Texture Fillrate
554.4 GTexel/s
Texture Fillrate
671.04 GTexel/s
-
-
-
-
L1
64KB/SM Tex
128KB/SM
-
-
L1
-
-
-
Unknown
L2
36MB Shared
L2
Unknown
-
-
-
-
-
-
12GB
GDDR6X
-
Shared Memory
-
-
Bus Width
192Bit
-
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
504GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
225W
TDP
Shared
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
-
Variable Refresh Rate
-
-
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
8
Content Protection
HDCP 2.3
-
-
Model
2x NVENC 8
Model
4x Apple Media Engine 2
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
ProRes
ProRes Raw
Model
NVDEC 5
Model
2x Apple Media Engine 2
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
ProRes
ProRes Raw
Direct X
12
Direct 3D
12_3
-
-
-
-
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
-
-
-
-
-
-
Shader Model
6.7
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
-
-
-
-
-
-
-
-
-
-
-
-
-
2x Fans
Not a Card
-
-
-
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
-
-
-
-
-
-
-
-
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
42 mm (1.65 in)
-
-
-
-
-
-
Change Comparison