AMD Radeon E9260 vs NVIDIA Tesla P100

AMD Radeon E9260
NVIDIA Tesla P100 $5,699
896 Shaders
4GB GDDR5
1219MHz
3584 Shaders
16GB HBM2
1303MHz
Peak AI Performance
2.18 TFLOPS
FP32
Peak AI Performance
18.68 TFLOPS
FP16
FP32
2.18 TFLOPS
FP32
9.34 TFLOPS
-
-
FP16
18.68 TFLOPS
Form Factor
MXM-A
-
Form Factor
PCIe Card
2.0-Slots
TDP
80W
TDP
250W
-
-
-
-
-
Power Connectors
-
1x 8-Pin
-
-
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Peak AI Performance
2.18 TFLOPS
FP32
Peak AI Performance
18.68 TFLOPS
FP16
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
FP16
18.68 TFLOPS
-
-
-
-
FP32
2.18 TFLOPS
-
-
FP32
9.34 TFLOPS
-
-
FP64
140 GFLOPS
-
FP64
4.67 TFLOPS
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Pixel Fillrate
19.504 GPixel/s
Pixel Fillrate
166.784 GPixel/s
-
-
-
-
Texture Fillrate
58.512 GTexel/s
Texture Fillrate
291.872 GTexel/s
Manufacturer
AMD
Manufacturer
NVIDIA
Chip Designer
AMD
Chip Designer
NVIDIA
Architecture
GCN 4
Architecture
Pascal
Family
Embedded
Family
Tesla P
Codename
Baffin
Polaris 11
Variant
Polaris 11 XL
Codename
NV130
GP100
Variant
GP100-893-A1
Market Segment
Server
Market Segment
Server
Release Date
9/1/2016
Release Date
6/20/2016
Foundry
GlobalFoundries
-
Foundry
TSMC
-
Fabrication Node
14LPP
-
Fabrication Node
16FF
-
Die Size
123 mm²
-
Die Size
610 mm²
-
Transistor Count
3 Billion
-
Transistor Count
15.3 Billion
-
Transistor Density
24.39M/mm²
-
Transistor Density
25.08M/mm²
-
Form
MXM-A
Form
PCIe Card
Shading Units
896 Shaders
-
Shading Units
3584 Shaders
-
Texture Mapping Units
48 TMUs
Texture Mapping Units
224 TMUs
Render Output Units
16 ROPs
Render Output Units
128 ROPs
-
-
-
-
-
-
-
-
-
-
Streaming Multiprocessors
56 SMs
Compute Units
14 CUs
-
-
-
-
-
-
-
-
-
-
-
-
1124MHz Base
1219MHz
-
-
1126MHz Base
1303MHz
-
-
-
-
L1
-
-
16KB/CU
-
L1
-
64KB/SM
-
-
L2
1MB Shared
L2
4MB Shared
-
-
-
-
-
-
4GB
GDDR5
-
16GB
HBM2
-
Bus Width
128Bit
Bus Width
4096Bit
Clock
1750MHz
Transfer Rate
7GT/s
Bandwidth
112GB/s
Clock
715MHz
Transfer Rate
1.4GT/s
Bandwidth
732.2GB/s
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
TDP
80W
TDP
250W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
Max Resolution
5120x2880
Max Resolution
Unknown
Max Resolution Refresh Rate
60Hz
Max Resolution Refresh Rate
-
Variable Refresh Rate
-
FreeSync
-
Variable Refresh Rate
G-Sync
FreeSync
-
Display Stream Compression (DSC)
Not Supported
Display Stream Compression (DSC)
Not Supported
Multi Monitor Support
3
Multi Monitor Support
Unknown
Content Protection
HDCP 2.2
-
-
Model
VCE 3.4
Model
3x NVENC 4
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
Model
UVD 6.3
Model
NVDEC 3
Codec
MPEG-1
MPEG-2
MPEG-4
JPEG
VC-1
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
-
VP9
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
Direct X
12
Direct 3D
12_0
Direct X
12
Direct 3D
12_1
OpenGL
4.6
OpenCL
2.1
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
Shader Model
6.7
-
-
GFX
8
-
-
-
-
Shader Model
6.0
CUDA
6.0
-
-
PureVideo HD
VP8
VDPAU
Feature Set H
Not a Card
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Power Connectors
-
-
-
1x 8-Pin
-
-
-
-
-
PCIe Version
3.0
PCIe Lanes
8
Slots Required
2.0
PCIe Version
3.0
PCIe Lanes
16
-
-
-
-
Multi GPU Support
Supported
-
-
-
-
-
-
-
-
Height
111 mm (4.37 in)
Width
267 mm (10.51 in)
Depth
40 mm (1.57 in)
Change Comparison