NVIDIA GeForce RTX 4070 Super vs 48-Core M1 Ultra

NVIDIA GeForce RTX 4070 Super $599
48-Core M1 Ultra
7168 Shaders
12GB GDDR6X
2475MHz
6144 Shaders
Shared Memory
1290MHz
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
Peak AI Performance
15.85 TFLOPS
FP16
FP32
35.48 TFLOPS
FP32
15.85 TFLOPS
FP16
35.48 TFLOPS
FP16
15.85 TFLOPS
Form Factor
PCIe Card
2.1-Slots
Form Factor
iGPU
-
TDP
225W
TDP
Shared
Power Connectors
-
-
-
1x 16-Pin 12VHPWR
-
-
-
-
-
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL 77,200
25%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal 92,500
51%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal 310
53%
Manufacturer
NVIDIA
Manufacturer
Apple
Chip Designer
NVIDIA
Chip Designer
Apple
Architecture
Ada Lovelace
Architecture
Apple Gen 4
Family
GeForce 40
Family
M Series
Codename
NV184
AD104
Variant
AD104-350-A1
Codename
G13D
M1 Ultra
-
-
Market Segment
Desktop
Market Segment
Desktop
Release Date
1/8/2024
Release Date
3/8/2022
Foundry
TSMC
-
-
-
-
Fabrication Node
4N
-
-
-
-
Die Size
295 mm²
-
-
-
-
Transistor Count
35.8 Billion
-
-
-
Transistor Density
121.56M/mm²
-
-
-
-
Form
PCIe Card
Form
iGPU
Shading Units
7168 Shaders
-
Shading Units
6144 Shaders
-
Texture Mapping Units
224 TMUs
Texture Mapping Units
384 TMUs
Render Output Units
80 ROPs
Render Output Units
192 ROPs
Tensor Cores
224 T-Cores
-
-
Ray-Tracing Cores
56 RT-Cores
-
-
Streaming Multiprocessors
56 SMs
-
-
-
-
-
-
-
-
Execution Units
768 EUs
Graphics Processing Clusters
5 GPCs
Graphics Processing Clusters
48 GPCs
-
-
1980MHz Base
2475MHz
-
-
-
1290MHz
Peak AI Performance
1.14 POPS
INT4 Tensor Sparse
Peak AI Performance
15.85 TFLOPS
FP16
-
-
-
-
-
-
FP8
-
283.85 TFLOPS Tensor (FP16 Accumulate)
567.71 TFLOPS Tensor (FP16 Accumulate) Sparse
141.93 TFLOPS Tensor (FP32 Accumulate)
283.85 TFLOPS Tensor (FP32 Accumulate) Sparse
-
-
-
-
-
-
FP16
35.48 TFLOPS
141.93 TFLOPS Tensor (FP16 Accumulate)
283.85 TFLOPS Tensor (FP16 Accumulate) Sparse
70.96 TFLOPS Tensor (FP32 Accumulate)
141.93 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
15.85 TFLOPS
-
-
-
-
FP32
35.48 TFLOPS
-
-
FP32
15.85 TFLOPS
-
-
FP64
550 GFLOPS
-
-
-
-
BF16
35.48 TFLOPS
70.96 TFLOPS Tensor
141.93 TFLOPS Tensor Sparse
-
-
-
-
TF32
35.48 TFLOPS Tensor
70.96 TFLOPS Tensor Sparse
-
-
-
INT4
567.71 TOPS Tensor
1.14 POPS Tensor Sparse
-
-
-
INT8
-
283.85 TOPS Tensor
567.71 TOPS Tensor Sparse
-
-
-
-
INT32
17.74 TOPS
-
-
Ray Tracing
82 TOPS
-
-
Pixel Fillrate
198 GPixel/s
Pixel Fillrate
247.68 GPixel/s
-
-
-
-
Texture Fillrate
554.4 GTexel/s
Texture Fillrate
495.36 GTexel/s
-
-
-
-
L1
64KB/SM Tex
128KB/SM
-
-
L1
-
-
-
Unknown
L2
36MB Shared
L2
Unknown
-
-
-
-
-
-
12GB
GDDR6X
-
Shared Memory
-
-
Bus Width
192Bit
-
Clock
1313MHz
Transfer Rate
21GT/s
Bandwidth
504GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
225W
TDP
Shared
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
3x DisplayPort 1.4
-
-
-
-
-
-
-
-
-
-
-
1x HDMI 2.1
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
Max Resolution
7680x4320
Max Resolution
6016x3384
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
-
Variable Refresh Rate
-
-
-
Display Stream Compression (DSC)
Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
5
Content Protection
HDCP 2.3
-
-
Model
2x NVENC 8
Model
4x Apple Media Engine 2
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
ProRes
ProRes Raw
Model
NVDEC 5
Model
2x Apple Media Engine 2
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
ProRes
ProRes Raw
Direct X
12
Direct 3D
12_3
-
-
-
-
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
-
-
-
-
-
-
Shader Model
6.7
CUDA
8.9
-
-
PureVideo HD
VP12
VDPAU
Feature Set L
-
-
-
-
-
-
-
-
-
-
-
-
-
2x Fans
Not a Card
-
-
-
Power Connectors
-
-
-
-
-
-
1x 16-Pin 12VHPWR
-
-
-
-
-
-
-
-
Slots Required
2.1
PCIe Version
4.0
PCIe Lanes
16
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Height
112 mm (4.41 in)
Width
267 mm (10.51 in)
Depth
42 mm (1.65 in)
-
-
-
-
-
-
Change Comparison