GPUs

NVIDIA A800 40GB vs AMD Radeon AI PRO R9700 Full Specs

6,912 Shaders
1.41GHz
4,096 Shaders
2.92GHz
40GB HBM2e1.55TB/s
32GB GDDR6640GB/s
··
19.49 TFLOPS
··
47.84 TFLOPS
Form Factor
PCIe Card
Form Factor
PCIe Card
TDP
250W
TDP
300W
Power Connectors
-
1x 8-Pin EPS
Power Connectors
1x 12-Pin
-

A800 40GBA800 40GB2.5 POPSINT4 Tensor Sparse
x1.63
Radeon AI PRO R9700Radeon AI PRO R97001.53 POPSINT4 Tensor Sparse
x1

Clock Speed
···
Clock Speed
···
Peak OPS
2.5 POPSINT4 Tensor Sparse
Peak OPS
1.53 POPSINT4 Tensor Sparse
-
Tensor FP8-16
191.4 TFLOPS
FP8-16 Tensor Sparse
382.7 TFLOPS
Tensor FP8-32
191.4 TFLOPS
FP8-32 Tensor Sparse
382.7 TFLOPS
Tensor FP16-16
311.9 TFLOPS
FP16-16 Tensor Sparse
623.7 TFLOPS
Tensor FP16-32
311.9 TFLOPS
FP16-32 Tensor Sparse
623.7 TFLOPS
Tensor FP16-16
95.68 TFLOPS
FP16-16 Tensor Sparse
191.4 TFLOPS
Tensor FP16-32
95.68 TFLOPS
FP16-32 Tensor Sparse
191.4 TFLOPS
BF16
38.98 TFLOPS
Tensor BF16
311.9 TFLOPS
BF16 Tensor Sparse
623.7 TFLOPS
BF16
95.68 TFLOPS
Tensor BF16
95.68 TFLOPS
BF16 Tensor Sparse
191.4 TFLOPS
Tensor TF32
155.9 TFLOPS
Tensor TF32
-
FP32
19.49 TFLOPS
FP32
47.84 TFLOPS
FP64
9.75 TFLOPS
Tensor FP64
19.49 TFLOPS
FP64
1.5 TFLOPS
Tensor FP64
-
Tensor INT4
1.25 POPS
Tensor INT4
765.5 TOPS
Tensor INT8
623.7 TOPS
Tensor INT8
382.7 TOPS
Pixel Rate
225.6 GPixel/s
Pixel Rate
373.8 GPixel/s
Texture Rate
609.1 GTexel/s
Texture Rate
747.5 GTexel/s

Shaders
6,912 Shaders
Shaders
4,096 Shaders
TMUs
432 TMUs
TMUs
256 TMUs
ROPs
160 ROPs
ROPs
128 ROPs
Tensor Cores
432 T-Cores
Tensor Cores
128 T-Cores
RT Cores
-
RT Cores
64 RT-Cores
SMs
108 SMs
CUs
64 CUs

Base Clock
765MHz
Base Clock
2.4GHz
Boost Clock
1.41GHz
Boost Clock
2.92GHz

L2 Cache
41MB shared
L2 Cache
8.2MB shared
L3 Cache
-
L3 Cache
64MB shared
L3 Bandwidth
-
L3 Bandwidth
2.25TB/s

40GB HBM2e
32GB GDDR6
Memory Bus
5120-bit
Memory Bus
256-bit
Memory Speed
2.4GT/s
Memory Speed
20GT/s
Memory Bandwidth
1.55TB/s
Memory Bandwidth
640GB/s
ECC
No
ECC
No

TDP
250W
TDP
300W

Multi-Monitor
-
Multi-Monitor
3
HDCP
-
HDCP
HDCP 2.3

-
3x DisplayPort 2.1
-
1x HDMI 2.1

-
Encoder Model
VCN 4.0

Decoder Model
5x NVDEC 4
Decoder Model
VCN 4.0

Form Factor
PCIe Card
Form Factor
PCIe Card
PCIe
2-Slots
PCIe
2-Slots
Height
111 mm (4.37")
Width
267 mm (10.51")
Depth
40 mm (1.57")
Height
111 mm (4.37")
Width
241 mm (9.49")
Depth
40 mm (1.57")
Cooling
Passive
-
Cooling
Blower
1x Fan
Power Connectors
1x 8-Pin EPS
Power Connectors
1x 12-Pin

Manufacturer
Manufacturer
Chip Designer
Chip Designer
Architecture
Architecture
Family
Family
Branding
A800 40GB branding
Branding
Radeon Pro 2023 branding
Codename
NV170
Codename
-
Chip Variant
-
Chip Variant
Navi 48 XL
Market Segment
Server
Market Segment
Workstation
Release Date
Jun 28, 2021
Release Date
May 20, 2025

Foundry
TSMC
Foundry
TSMC
Fabrication Node
7N
Fabrication Node
N4P
Die Size
826mm²
Die Size
357mm²
Transistor Count
54.2 Billion
Transistor Count
53.9 Billion
Transistor Density
65.62 MTr/mm²
Transistor Density
151.2 MTr/mm²

No images available
No images available