Intel Arc Pro A40 vs NVIDIA Quadro T1000 8GB

Intel Arc Pro A40
NVIDIA Quadro T1000 8GB
1024 Shaders
6GB GDDR6
1700MHz
896 Shaders
8GB GDDR6
1400MHz
Peak AI Performance
111.41 TOPS
INT4 Tensor
Peak AI Performance
5.02 TFLOPS
FP16
FP32
3.48 TFLOPS
FP32
2.51 TFLOPS
FP16
6.96 TFLOPS
FP16
5.02 TFLOPS
Form Factor
PCIe Card
2.0-Slots
Form Factor
PCIe Card
1.0-Slots
TDP
50W
TDP
50W

Peak AI Performance

  • 22.19x faster vs Quadro T1000 8GB
  • 95% slower vs Arc Pro A40
Arc Pro A40 - 111.41 TOPS INT4 Tensor
x22.19
Quadro T1000 8GB - 5.02 TFLOPS FP16
x1

FP32

  • 39% faster vs Quadro T1000 8GB
  • 28% slower vs Arc Pro A40
Arc Pro A40 - 3.48 TFLOPS FP32
x1.39
Quadro T1000 8GB - 2.51 TFLOPS FP32
x1

FP16

  • 39% faster vs Quadro T1000 8GB
  • 28% slower vs Arc Pro A40
Arc Pro A40 - 6.96 TFLOPS FP16
x1.39
Quadro T1000 8GB - 5.02 TFLOPS FP16
x1
  • 20% faster vs Quadro T1000 8GB
  • 17% slower vs Arc Pro A40
Arc Pro A40 - 192GB/s
x1.2
Quadro T1000 8GB - 160GB/s
x1
  • Same TDP vs Quadro T1000 8GB
  • Same TDP vs Arc Pro A40
Arc Pro A40 - 50W
x1
Quadro T1000 8GB - 50W
x1
GB6 OpenCL N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL 15,155
5%
GB5 OpenCL N/A
0%
OCT Metal 50
8%
OCT Metal N/A
0%
Manufacturer
Intel
Manufacturer
NVIDIA
Chip Designer
Intel
Chip Designer
NVIDIA
Architecture
Alchemist
Architecture
Turing
Family
Arc Pro A
Family
Quadro T
Codename
Xe HPG
ACM-G11
Variant
DG2-128
Codename
NV167
TU117
-
-
Market Segment
Workstation
Market Segment
Workstation
Release Date
8/8/2022
Release Date
5/6/2021
Foundry
TSMC
Foundry
TSMC
Fabrication Node
N6
Fabrication Node
12FFN
Die Size
157 mm²
Die Size
200 mm²
Transistor Count
7.2 Billion
Transistor Count
4.7 Billion
Transistor Density
45.86M/mm²
Transistor Density
23.50M/mm²
Form
PCIe Card
Form
PCIe Card
Shading Units
1024 Shaders
Shading Units
896 Shaders
Texture Mapping Units
64 TMUs
Texture Mapping Units
56 TMUs
Render Output Units
32 ROPs
Render Output Units
32 ROPs
Tensor Cores
128 T-Cores
-
-
Ray-Tracing Cores
8 RT-Cores
-
-
-
-
Streaming Multiprocessors
14 SMs
Execution Units
128 EUs
-
-
Graphics Processing Clusters
8 GPCs
-
-
1500MHz Base
1700MHz
-
1400MHz
Peak AI Performance
111.41 TOPS
INT4 Tensor
Peak AI Performance
5.02 TFLOPS
FP16
FP16
6.96 TFLOPS
27.85 TFLOPS Tensor (FP32 Accumulate)
FP16
5.02 TFLOPS
-
FP32
3.48 TFLOPS
FP32
2.51 TFLOPS
FP64
870 GFLOPS
FP64
80 GFLOPS
BF16
27.85 TFLOPS Tensor
-
-
INT4
111.41 TOPS Tensor
-
-
INT8
55.71 TOPS Tensor
-
-
Pixel Fillrate
54.4 GPixel/s
Pixel Fillrate
44.8 GPixel/s
Texture Fillrate
108.8 GTexel/s
Texture Fillrate
78.4 GTexel/s
L1
-
-
Unknown
L1
32KB/SM Tex
64KB/SM
-
L2
4MB Shared
L2
1MB Shared
6GB
GDDR6
8GB
GDDR6
Bus Width
96Bit
Bus Width
128Bit
Clock
2000MHz
Transfer Rate
16GT/s
Bandwidth
192GB/s
Clock
1250MHz
Transfer Rate
10GT/s
Bandwidth
160GB/s
TDP
50W
TDP
50W
-
4x mini-DisplayPort 2.0
4x mini-DisplayPort 1.4
-
Max Resolution
7680x4320
Max Resolution
7680x4320
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
120Hz
Variable Refresh Rate
-
FreeSync
Variable Refresh Rate
G-Sync
FreeSync
Display Stream Compression (DSC)
Not Supported
Display Stream Compression (DSC)
Supported
Multi Monitor Support
4
Multi Monitor Support
4
-
-
Content Protection
HDCP 2.2
Model
Arc
Model
NVENC 7
Codec
AVC (H.264)
HEVC (H.265)
AV1
Codec
AVC (H.264)
HEVC (H.265)
-
Model
Arc
Model
NVDEC 4
Codec
-
MPEG-2
-
JPEG
-
-
VP9
AVC (H.264)
HEVC (H.265)
AV1
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
-
Direct X
12
Direct 3D
12_2
Direct X
12
Direct 3D
12_2
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.2
Shader Model
6.6
-
-
-
-
-
-
Shader Model
6.6
CUDA
7.5
PureVideo HD
VP10
VDPAU
Feature Set J
1x Fan
1x Fan
Slots Required
2.0
PCIe Version
4.0
PCIe Lanes
8
Slots Required
1.0
PCIe Version
3.0
PCIe Lanes
16
-
-
-
-
-
-
Height
69 mm (2.72 in)
Width
150 mm (5.91 in)
Depth
20 mm (0.79 in)
Change Comparison