NVIDIA Tesla K40X vs AMD FirePro S9170

NVIDIA Tesla K40X $7,699
AMD FirePro S9170
2880 Shaders
12GB GDDR5
875MHz
2816 Shaders
32GB GDDR5
930MHz
Peak AI Performance
5.04 TOPS
INT32
Peak AI Performance
5.24 TFLOPS
FP32
FP32
5.04 TFLOPS
FP32
5.24 TFLOPS
Form Factor
PCIe Card
2.0-Slots
Form Factor
PCIe Card
2.0-Slots
TDP
235W
TDP
275W
Power Connectors
1x 6-Pin
1x 8-Pin
Power Connectors
1x 6-Pin
1x 8-Pin

Peak AI Performance

  • 4% slower vs FirePro S9170
  • 4% faster vs Tesla K40X
Tesla K40X - 5.04 TOPS INT32
x1
FirePro S9170 - 5.24 TFLOPS FP32
x1.04

FP32

  • 4% slower vs FirePro S9170
  • 4% faster vs Tesla K40X
Tesla K40X - 5.04 TFLOPS FP32
x1
FirePro S9170 - 5.24 TFLOPS FP32
x1.04
  • 18% slower vs FirePro S9170
  • 22% faster vs Tesla K40X
Tesla K40X - 288GB/s
x1
FirePro S9170 - 352GB/s
x1.22
  • 15% lower vs FirePro S9170
  • 17% higher vs Tesla K40X
Tesla K40X - 235W
x1
FirePro S9170 - 275W
x1.17
Manufacturer
NVIDIA
Manufacturer
AMD
Chip Designer
NVIDIA
Chip Designer
AMD
Architecture
Kepler
Architecture
GCN 2
Family
Tesla K
Family
FirePro S
Codename
NVF0
GK180
-
-
Codename
Hawaii
-
Variant
Grenada XT GL
Market Segment
Server
Market Segment
Server
Release Date
11/22/2013
Release Date
7/8/2015
Foundry
TSMC
Foundry
TSMC
Fabrication Node
28nm
Fabrication Node
28nm
Die Size
561 mm²
Die Size
438 mm²
Transistor Count
7.1 Billion
Transistor Count
6.2 Billion
Transistor Density
12.62M/mm²
Transistor Density
14.16M/mm²
Form
PCIe Card
Form
PCIe Card
Shading Units
2880 Shaders
Shading Units
2816 Shaders
Texture Mapping Units
240 TMUs
Texture Mapping Units
176 TMUs
Render Output Units
48 ROPs
Render Output Units
64 ROPs
Streaming Multiprocessors
15 SMs
-
-
-
-
Compute Units
44 CUs
745MHz Base
875MHz
-
930MHz
Peak AI Performance
5.04 TOPS
INT32
Peak AI Performance
5.24 TFLOPS
FP32
FP32
5.04 TFLOPS
FP32
5.24 TFLOPS
FP64
1.68 TFLOPS
FP64
2.62 TFLOPS
INT32
5.04 TOPS
-
-
Pixel Fillrate
42 GPixel/s
Pixel Fillrate
59.52 GPixel/s
Texture Fillrate
210 GTexel/s
Texture Fillrate
163.68 GTexel/s
L1
48KB/SM Tex
16KB/SM
-
L1
-
-
16KB/CU
L2
1.5MB Shared
L2
1MB Shared
12GB
GDDR5
32GB
GDDR5
Bus Width
384Bit
Bus Width
512Bit
Clock
1500MHz
Transfer Rate
6GT/s
Bandwidth
288GB/s
Clock
1375MHz
Transfer Rate
5.5GT/s
Bandwidth
352GB/s
TDP
235W
TDP
275W
Max Resolution
Unknown
Max Resolution
4096x2160
Max Resolution Refresh Rate
-
Max Resolution Refresh Rate
60Hz
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
-
FreeSync
Display Stream Compression (DSC)
Not Supported
Display Stream Compression (DSC)
Not Supported
Multi Monitor Support
Unknown
Multi Monitor Support
3
-
-
Content Protection
HDCP 1.4
Model
NVENC 1
Model
VCE 2.0
Codec
AVC (H.264)
Codec
AVC (H.264)
Model
NVDEC 1
Model
UVD 4.2
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
AVC (H.264)
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
AVC (H.264)
Direct X
12
Direct 3D
11_0
Direct X
12
Direct 3D
12_0
OpenGL
4.6
OpenCL
3.0
Vulkan
1.2
OpenGL
4.6
OpenCL
2.1
Vulkan
1.2
Shader Model
6.5
CUDA
3.5
-
-
PureVideo HD
VP5
VDPAU
Feature Set D
Shader Model
6.5
-
-
GFX
7
-
-
-
-
Power Connectors
1x 6-Pin
1x 8-Pin
Power Connectors
1x 6-Pin
1x 8-Pin
Slots Required
2.0
PCIe Version
3.0
PCIe Lanes
16
Slots Required
2.0
PCIe Version
3.0
PCIe Lanes
16
Multi GPU Support
Supported
-
-
Multi GPU Support
Supported
Type
4-way CrossFireX
Height
111 mm (4.37 in)
Width
267 mm (10.51 in)
Depth
39 mm (1.54 in)
Height
111 mm (4.37 in)
Width
267 mm (10.51 in)
Depth
37 mm (1.46 in)
Change Comparison