HiSilicon Maleoon 910 vs NVIDIA GA10B 16SM 625MHz

HiSilicon Maleoon 910
NVIDIA GA10B 16SM 625MHz
1024 Shaders
Shared Memory
750MHz
1024 Shaders
Shared Memory
625MHz
Peak Performance
3.07 TFLOPS
FP16
Peak Performance
81.92 TOPS
INT4 Tensor Sparse
FP32
1.54 TFLOPS
FP32
1.28 TFLOPS
FP16
3.07 TFLOPS
FP16
2.56 TFLOPS
Form Factor
iGPU
-
Form Factor
iGPU
-
TDP
Shared
TDP
Shared
-
-
-
-
-
-
-
-
-
-
GB6 OpenCL N/A
0%
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Manufacturer
HiSilicon
Manufacturer
NVIDIA
Chip Designer
HiSilicon
Chip Designer
NVIDIA
Architecture
Maleoon 1
Architecture
Ampere
Family
-
Family
Jetson
-
-
-
-
-
Codename
Hercules
GA10B
-
-
Market Segment
Mobile
Market Segment
Mobile
Release Date
6/1/2024
Release Date
1/1/2023
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Form
iGPU
Form
iGPU
Shading Units
1024 Shaders
-
Shading Units
1024 Shaders
-
Texture Mapping Units
128 TMUs
Texture Mapping Units
8 TMUs
Render Output Units
64 ROPs
Render Output Units
16 ROPs
-
-
Tensor Cores
32 T-Cores
-
-
-
-
-
-
Streaming Multiprocessors
8 SMs
-
-
-
-
Execution Units
64 EUs
-
-
Graphics Processing Clusters
4 GPCs
Graphics Processing Clusters
1 GPC
-
-
-
750MHz
-
-
-
625MHz
Peak Performance
3.07 TFLOPS
FP16
Peak Performance
81.92 TOPS
INT4 Tensor Sparse
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
FP16
3.07 TFLOPS
-
-
-
-
FP16
2.56 TFLOPS
10.24 TFLOPS Tensor (FP16 Accumulate)
20.48 TFLOPS Tensor (FP16 Accumulate) Sparse
10.24 TFLOPS Tensor (FP32 Accumulate)
20.48 TFLOPS Tensor (FP32 Accumulate) Sparse
FP32
1.54 TFLOPS
-
-
FP32
1.28 TFLOPS
-
-
-
-
-
-
-
-
-
-
-
-
BF16
-
10.24 TFLOPS Tensor
20.48 TFLOPS Tensor Sparse
-
-
-
TF32
5.12 TFLOPS Tensor
10.24 TFLOPS Tensor Sparse
-
-
-
INT4
40.96 TOPS Tensor
81.92 TOPS Tensor Sparse
-
-
-
-
INT8
-
20.48 TOPS Tensor
40.96 TOPS Tensor Sparse
-
-
-
-
-
-
-
-
Pixel Fillrate
48 GPixel/s
Pixel Fillrate
10 GPixel/s
-
-
-
-
Texture Fillrate
96 GTexel/s
Texture Fillrate
5 GTexel/s
-
-
-
-
L1
-
-
-
Unknown
L1
64KB/SM Tex
192KB/SM
-
-
L2
Unknown
L2
2MB Shared
-
-
-
-
-
-
Shared Memory
-
-
Shared Memory
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
Shared
TDP
Shared
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
Max Resolution
Unknown
Max Resolution
Unknown
Max Resolution Refresh Rate
-
Max Resolution Refresh Rate
-
Variable Refresh Rate
-
-
-
Variable Refresh Rate
G-Sync
FreeSync
-
Display Stream Compression (DSC)
Not Supported
Display Stream Compression (DSC)
Not Supported
Multi Monitor Support
Unknown
Multi Monitor Support
Unknown
-
-
-
-
No Encoders
-
Model
NVENC 7
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
No Decoders
Model
NVDEC 5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
-
AVC (H.264)
HEVC (H.265)
-
AV1
-
-
-
-
-
-
Direct X
12
Direct 3D
12_2
-
-
-
-
-
-
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
-
-
-
-
-
-
-
-
-
-
Shader Model
6.7
CUDA
8.7
-
-
PureVideo HD
VP11
VDPAU
Feature Set K
Not a Card
-
-
-
Not a Card
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Change Comparison