NVIDIA Quadro P4000 Max-Q vs Radeon Pro Vega 20

NVIDIA Quadro P4000 Max-Q
Radeon Pro Vega 20
1792 Shaders
8GB GDDR5
1228MHz
1280 Shaders
4GB HBM2
1283MHz
Peak AI Performance
17.61 TOPS
INT8
Peak AI Performance
6.57 TFLOPS
FP16
FP32
4.4 TFLOPS
FP32
3.28 TFLOPS
FP16
70 GFLOPS
FP16
6.57 TFLOPS
Form Factor
Soldered
Form Factor
Soldered
TDP
80W
TDP
50W

Peak AI Performance

  • 2.68x faster vs Radeon Pro Vega 20
  • 63% slower vs Quadro P4000 Max-Q
Quadro P4000 Max-Q - 17.61 TOPS INT8
x2.68
Radeon Pro Vega 20 - 6.57 TFLOPS FP16
x1

FP32

  • 34% faster vs Radeon Pro Vega 20
  • 25% slower vs Quadro P4000 Max-Q
Quadro P4000 Max-Q - 4.4 TFLOPS FP32
x1.34
Radeon Pro Vega 20 - 3.28 TFLOPS FP32
x1

FP16

  • 99% slower vs Radeon Pro Vega 20
  • 93.86x faster vs Quadro P4000 Max-Q
Quadro P4000 Max-Q - 70 GFLOPS FP16
x1
Radeon Pro Vega 20 - 6.57 TFLOPS FP16
x93.86
  • 1% faster vs Radeon Pro Vega 20
  • 1% slower vs Quadro P4000 Max-Q
Quadro P4000 Max-Q - 192GB/s
x1.01
Radeon Pro Vega 20 - 189.4GB/s
x1
  • 60% higher vs Radeon Pro Vega 20
  • 38% lower vs Quadro P4000 Max-Q
Quadro P4000 Max-Q - 80W
x1.6
Radeon Pro Vega 20 - 50W
x1
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
Manufacturer
NVIDIA
Manufacturer
Apple
Chip Designer
NVIDIA
Chip Designer
AMD
Architecture
Pascal
Architecture
GCN 5
Family
Quadro P
Family
Radeon Pro Vega
Codename
NV134
GP104
-
-
Codename
Treasure
Vega 12
Variant
Vega 12 XTA
Market Segment
Laptop
Market Segment
Laptop
Release Date
1/11/2017
Release Date
11/14/2018
Foundry
TSMC
Foundry
TSMC
Fabrication Node
16FF
Fabrication Node
28nm
Die Size
314 mm²
Die Size
125 mm²
Transistor Count
7.2 Billion
Transistor Count
1.6 Billion
Transistor Density
22.93M/mm²
Transistor Density
12.40M/mm²
Form
Soldered
Form
Soldered
Shading Units
1792 Shaders
Shading Units
1280 Shaders
Texture Mapping Units
112 TMUs
Texture Mapping Units
80 TMUs
Render Output Units
64 ROPs
Render Output Units
32 ROPs
Streaming Multiprocessors
14 SMs
-
-
-
-
Compute Units
20 CUs
1114MHz Base
1228MHz
815MHz Base
1283MHz
Peak AI Performance
17.61 TOPS
INT8
Peak AI Performance
6.57 TFLOPS
FP16
FP16
70 GFLOPS
FP16
6.57 TFLOPS
FP32
4.4 TFLOPS
FP32
3.28 TFLOPS
FP64
140 GFLOPS
FP64
210 GFLOPS
INT8
17.61 TOPS
-
-
Pixel Fillrate
78.592 GPixel/s
Pixel Fillrate
41.056 GPixel/s
Texture Fillrate
137.536 GTexel/s
Texture Fillrate
102.64 GTexel/s
L1
48KB/SM
-
L1
-
16KB/CU
L2
2MB Shared
L2
256KB Shared
8GB
GDDR5
4GB
HBM2
Bus Width
256Bit
Bus Width
1024Bit
Clock
1500MHz
Transfer Rate
6GT/s
Bandwidth
192GB/s
Clock
740MHz
Transfer Rate
1.5GT/s
Bandwidth
189.4GB/s
TDP
80W
TDP
50W
Max Resolution
7680x4320
Max Resolution
4096x2160
Max Resolution Refresh Rate
30Hz
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
-
FreeSync
Display Stream Compression (DSC)
Not Supported
Display Stream Compression (DSC)
Not Supported
Multi Monitor Support
3
Multi Monitor Support
3
-
-
Content Protection
HDCP 1.4
Model
2x NVENC 4
No Encoders
-
Codec
AVC (H.264)
HEVC (H.265)
-
-
-
Model
NVDEC 3
No Decoders
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP9
AVC (H.264)
HEVC (H.265)
-
-
-
-
-
-
-
Direct X
12
Direct 3D
12_1
Direct X
12
Direct 3D
12_0
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
2.1
Vulkan
1.2
Shader Model
6.7
CUDA
6.1
-
-
PureVideo HD
VP8
VDPAU
Feature Set H
Shader Model
6.5
-
-
GFX
8
-
-
-
-
Not a Card
Not a Card
PCIe Version
3.0
PCIe Lanes
16
PCIe Version
3.0
PCIe Lanes
16
Change Comparison