60-Core M3 Ultra vs NVIDIA GeForce RTX 4070 Super

60-Core M3 Ultra
NVIDIA GeForce RTX 4070 Super $599
7680 Shaders
Shared Memory
1398MHz
7168 Shaders
12GB GDDR6X
2475MHz
Peak AI Performance
21.47 TFLOPS
FP16
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
FP32
21.47 TFLOPS
FP32
35.48 TFLOPS
FP16
21.47 TFLOPS
FP16
35.48 TFLOPS
Form Factor
iGPU
-
Form Factor
PCIe Card
2.1-Slots
TDP
Shared
TDP
225W
-
-
-
-
-
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
Apple
Manufacturer
NVIDIA
Chip Designer
Apple
Chip Designer
NVIDIA
Architecture
Apple Gen 6
Architecture
Ada Lovelace
Family
M Series
Family
GeForce 40
Codename
G16C
M3 Ultra
-
-
Codename
NV184
AD104
Variant
AD104-350-A1
Market Segment
Desktop
Market Segment
Desktop
Release Date
3/5/2025
Release Date
1/8/2024
-
-
-
Foundry
TSMC
-
-
-
-
Fabrication Node
4N
-
-
-
-
Die Size
295 mm²
-
-
-
Transistor Count
35.8 Billion
-
-
-
-
Transistor Density
121.56M/mm²
-
Form
iGPU
Form
PCIe Card
Shading Units
7680 Shaders
-
Shading Units
7168 Shaders
-
Texture Mapping Units
480 TMUs
Texture Mapping Units
224 TMUs
Render Output Units
240 ROPs
Render Output Units
80 ROPs
-
-
Tensor Cores
224 T-Cores
Ray-Tracing Cores
60 RT-Cores
Ray-Tracing Cores
56 RT-Cores
-
-
Streaming Multiprocessors
56 SMs
-
-
-
-
Execution Units
960 EUs
-
-
Graphics Processing Clusters
60 GPCs
Graphics Processing Clusters
5 GPCs
-
-
-
1398MHz
-
-
1980MHz Base
2475MHz
Peak AI Performance
21.47 TFLOPS
FP16
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
-
-
-
-
-
-
-
-
-
-
-
-
FP8
-
283.85 TFLOPS Tensor (FP16 Accumulate)
567.71 TFLOPS Tensor (FP16 Accumulate) Sparse
141.93 TFLOPS Tensor (FP32 Accumulate)
283.85 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
21.47 TFLOPS
-
-
-
-
FP16
35.48 TFLOPS
141.93 TFLOPS Tensor (FP16 Accumulate)
283.85 TFLOPS Tensor (FP16 Accumulate) Sparse
70.96 TFLOPS Tensor (FP32 Accumulate)
141.93 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
21.47 TFLOPS
-
-
FP32
35.48 TFLOPS
-
-
-
-
-
FP64
550 GFLOPS
-
-
-
-
-
BF16
35.48 TFLOPS
70.96 TFLOPS Tensor
141.93 TFLOPS Tensor Sparse
-
-
-
TF32
35.48 TFLOPS Tensor
70.96 TFLOPS Tensor Sparse
-
-
-
INT4
567.71 TOPS Tensor
1.14 POPS Tensor Sparse
-
-
-
-
INT8
-
283.85 TOPS Tensor
567.71 TOPS Tensor Sparse
-
-
INT32
17.74 TOPS
-
-
Ray Tracing
82 TOPS
Pixel Fillrate
335.52 GPixel/s
Pixel Fillrate
198 GPixel/s
-
-
-
-
Texture Fillrate
671.04 GTexel/s
Texture Fillrate
554.4 GTexel/s
-
-
-
-
L1
-
-
-
Unknown
L1
64KB/SM Tex
128KB/SM
-
-
L2
Unknown
L2
36MB Shared
-
-
-
-
-
-
Shared Memory
-
-
12GB
GDDR6X
-
-
Bus Width
192Bit
-
-
-
-
-
-
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
504GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
Shared
TDP
225W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
-
-
-
Variable Refresh Rate
G-Sync
FreeSync
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
8
Multi Monitor Support
4
-
-
Content Protection
HDCP 2.3
Model
4x Apple Media Engine 3
Model
2x NVENC 8
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
ProRes
ProRes Raw
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Model
2x Apple Media Engine 3
Model
NVDEC 5
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
ProRes
ProRes Raw
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
-
-
-
-
Direct X
12
Direct 3D
12_3
-
-
-
-
-
-
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
-
-
-
-
-
-
-
-
-
-
Shader Model
6.7
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
Not a Card
-
-
-
-
-
-
2x Fans
-
-
-
-
-
-
-
-
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
-
-
-
-
-
-
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
42 mm (1.65 in)
Change Comparison