Intel Gaudi vs AMD FirePro S7150 X2

Intel Gaudi
AMD FirePro S7150 X2 $3,999
2048 Shaders
32GB HBM2
610MHz
2x 1792 Shaders
32GB (2x 16GB) GDDR5
1050MHz
Peak AI Performance
159.91 TFLOPS
BF16 Tensor
Peak AI Performance
7.53 TFLOPS
FP32
-
-
FP32
7.53 TFLOPS
-
-
-
-
Form Factor
OAM Module
-
Form Factor
PCIe Card
2.0-Slots
TDP
350W
TDP
265W
-
-
-
-
-
Power Connectors
1x 6-Pin
1x 8-Pin
-
-
GB6 OpenCL N/A
0%
GB6 OpenCL N/A
0%
GB6 Metal N/A
0%
GB6 Metal N/A
0%
GB6 Vulkan N/A
0%
GB6 Vulkan N/A
0%
GB5 OpenCL N/A
0%
GB5 OpenCL N/A
0%
GB5 CUDA N/A
0%
GB5 CUDA N/A
0%
GB5 Metal N/A
0%
GB5 Metal N/A
0%
GB5 Vulkan N/A
0%
GB5 Vulkan N/A
0%
OCT 2020.1 N/A
0%
OCT 2020.1 N/A
0%
OCT Metal N/A
0%
OCT Metal N/A
0%
Peak AI Performance
159.91 TFLOPS
BF16 Tensor
Peak AI Performance
7.53 TFLOPS
FP32
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
FP32
7.53 TFLOPS
-
-
-
-
-
FP64
470 GFLOPS
-
BF16
-
159.91 TFLOPS Tensor
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Pixel Fillrate
-
Pixel Fillrate
33.6 GPixel/s
-
-
-
-
Texture Fillrate
-
Texture Fillrate
117.6 GTexel/s
Manufacturer
Intel
Manufacturer
AMD
Chip Designer
Intel
Chip Designer
AMD
Architecture
Gaudi
Architecture
GCN 3
Family
Gaudi
Family
FirePro S
Codename
Gaudi
HL-2000
Variant
HL-2000
Codename
Tonga
-
Variant
Tonga Pro GL
Market Segment
Server
Market Segment
Server
Release Date
6/17/2019
Release Date
2/1/2016
Foundry
TSMC
-
Foundry
TSMC
-
Fabrication Node
16FF
-
Fabrication Node
28nm
-
-
-
-
Die Size
2x 359 mm²
-
-
-
Transistor Count
2x 5 Billion
-
-
-
-
Transistor Density
13.93M/mm²
-
Form
OAM Module
Form
PCIe Card
Shading Units
2048 Shaders
-
Shading Units
2x 1792 Shaders
-
Texture Mapping Units
-
Texture Mapping Units
2x 112 TMUs
Render Output Units
-
Render Output Units
2x 32 ROPs
Tensor Cores
8 T-Cores
-
-
-
-
-
-
-
-
-
-
Compute Units
1 CU
Compute Units
2x 28 CUs
-
-
-
-
-
-
-
-
-
-
-
610MHz
-
-
-
1050MHz
-
-
-
-
L1
-
-
-
Unknown
L1
-
-
16KB/CU
-
L2
Unknown
L2
768KB Shared
-
-
-
-
-
-
32GB
HBM2
ECC
32GB (2x 16GB)
GDDR5
-
Bus Width
4096Bit
Bus Width
256Bit
Clock
980MHz
Transfer Rate
2GT/s
Bandwidth
1003.5GB/s
Clock
1250MHz
Transfer Rate
5GT/s
Bandwidth
160GB/s
-
-
-
-
eSRAM
24MB
3200GB/s
-
-
-
-
-
-
-
-
-
-
-
TDP
350W
TDP
265W
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
No Ports
Max Resolution
Unknown
Max Resolution
4096x2160
Max Resolution Refresh Rate
-
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
-
-
-
Variable Refresh Rate
-
FreeSync
-
Display Stream Compression (DSC)
Not Supported
Display Stream Compression (DSC)
Not Supported
Multi Monitor Support
Unknown
Multi Monitor Support
3
-
-
Content Protection
HDCP 1.4
No Encoders
-
Model
VCE 3.0
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Codec
-
-
-
-
-
-
-
-
AVC (H.264)
HEVC (H.265)
-
-
-
-
No Decoders
Model
UVD 5.0
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Codec
MPEG-1
MPEG-2
MPEG-4
-
VC-1
-
-
-
AVC (H.264)
-
-
-
-
-
-
-
-
-
Direct X
12
Direct 3D
12_0
-
-
OpenCL
3.0
-
-
OpenGL
4.6
OpenCL
2.1
Vulkan
1.2
-
-
-
-
-
-
-
-
-
-
Shader Model
6.5
-
-
GFX
8
-
-
-
-
Not a Card
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Power Connectors
-
-
1x 6-Pin
1x 8-Pin
-
-
-
-
-
PCIe Version
4.0
PCIe Lanes
16
Slots Required
2.0
PCIe Version
3.0
PCIe Lanes
16
Multi GPU Support
Supported
Type
RoCE
Multi GPU Support
Supported
Type
CrossFire XDMA
-
-
-
-
-
-
Height
111 mm (4.37 in)
Width
241 mm (9.49 in)
Depth
37 mm (1.46 in)
Change Comparison