NVIDIA GA10B 16SM 918MHz vs Qualcomm Adreno 740+

NVIDIA GA10B 16SM 918MHz
Qualcomm Adreno 740+
1024 Shaders
Shared Memory
918MHz
1536 Shaders
Shared Memory
719MHz
Peak AI Performance
120.32 TOPS
INT4 Tensor Sparse
Peak AI Performance
4.42 TFLOPS
FP16
FP32
1.88 TFLOPS
FP32
2.21 TFLOPS
FP16
3.76 TFLOPS
FP16
4.42 TFLOPS
Form Factor
iGPU
Form Factor
iGPU
TDP
Shared
TDP
5W

Peak AI Performance

  • 27.22x faster vs Adreno 740+
  • 96% slower vs GA10B 16SM 918MHz
GA10B 16SM 918MHz - 120.32 TOPS INT4 Tensor Sparse
x27.22
Adreno 740+ - 4.42 TFLOPS FP16
x1

FP32

  • 15% slower vs Adreno 740+
  • 18% faster vs GA10B 16SM 918MHz
GA10B 16SM 918MHz - 1.88 TFLOPS FP32
x1
Adreno 740+ - 2.21 TFLOPS FP32
x1.18

FP16

  • 15% slower vs Adreno 740+
  • 18% faster vs GA10B 16SM 918MHz
GA10B 16SM 918MHz - 3.76 TFLOPS FP16
x1
Adreno 740+ - 4.42 TFLOPS FP16
x1.18
Manufacturer
NVIDIA
Manufacturer
Qualcomm
Chip Designer
NVIDIA
Chip Designer
Qualcomm
Architecture
Ampere
Architecture
Adreno 700
Family
Jetson
Family
Adreno 700
Codename
Hercules
GA10B
-
-
-
Market Segment
Mobile
Market Segment
Mobile
Release Date
12/1/2022
Release Date
11/16/2022
Form
iGPU
Form
iGPU
Shading Units
1024 Shaders
Shading Units
1536 Shaders
Texture Mapping Units
8 TMUs
Texture Mapping Units
96 TMUs
Render Output Units
16 ROPs
Render Output Units
48 ROPs
Tensor Cores
32 T-Cores
-
-
Streaming Multiprocessors
8 SMs
-
-
-
-
Execution Units
12 EUs
Graphics Processing Clusters
1 GPC
-
-
918MHz
719MHz
Peak AI Performance
120.32 TOPS
INT4 Tensor Sparse
Peak AI Performance
4.42 TFLOPS
FP16
FP16
3.76 TFLOPS
15.04 TFLOPS Tensor (FP16 Accumulate)
30.08 TFLOPS Tensor (FP16 Accumulate) Sparse
15.04 TFLOPS Tensor (FP32 Accumulate)
30.08 TFLOPS Tensor (FP32 Accumulate) Sparse
FP16
4.42 TFLOPS
-
-
-
-
FP32
1.88 TFLOPS
FP32
2.21 TFLOPS
-
-
FP64
550 GFLOPS
BF16
-
15.04 TFLOPS Tensor
30.08 TFLOPS Tensor Sparse
BF16
4.42 TFLOPS
-
-
TF32
7.52 TFLOPS Tensor
15.04 TFLOPS Tensor Sparse
-
-
-
INT4
60.16 TOPS Tensor
120.32 TOPS Tensor Sparse
-
-
-
INT8
30.08 TOPS Tensor
60.16 TOPS Tensor Sparse
-
-
-
-
-
INT32
2.21 TOPS
Pixel Fillrate
14.688 GPixel/s
Pixel Fillrate
34.512 GPixel/s
Texture Fillrate
7.344 GTexel/s
Texture Fillrate
69.024 GTexel/s
L1
64KB/SM Tex
192KB/SM
-
L1
-
-
Unknown
L2
2MB Shared
L2
1MB Shared
-
-
L3
3072MB Shared
Shared Memory
Shared Memory
TDP
Shared
TDP
5W
Max Resolution
Unknown
Max Resolution
3840x2160
Max Resolution Refresh Rate
-
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
-
-
Display Stream Compression (DSC)
Not Supported
Display Stream Compression (DSC)
Not Supported
Multi Monitor Support
Unknown
Multi Monitor Support
1
Model
NVENC 7
Model
Hexagon
Codec
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
Codec
MPEG-4
VC-1
VP8
VP9
H.263
AVC (H.264)
HEVC (H.265)
Model
NVDEC 5
Model
Hexagon
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
AV1
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
-
Direct X
12
Direct 3D
12_2
Direct X
12
Direct 3D
12_1
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
3.2
OpenCL
2.0
Vulkan
1.3
Shader Model
6.7
CUDA
8.7
PureVideo HD
VP11
VDPAU
Feature Set K
-
-
-
-
-
-
-
-
Not a Card
Not a Card
Change Comparison