NVIDIA Tesla M4 vs NVIDIA GRID M60-1Q

NVIDIA Tesla M4
NVIDIA GRID M60-1Q
1024 Shaders
4GB GDDR5
1072MHz
2048 Shaders
1GB GDDR5
557MHz
Peak AI Performance
8.78 TOPS
INT8
Peak AI Performance
9.13 TOPS
INT8
FP32
2.19 TFLOPS
FP32
2.28 TFLOPS
Form Factor
PCIe Card
1.0-Slots
Form Factor
PCIe Card
2.0-Slots
TDP
75W
TDP
225W
-
-
Power Connectors
1x 8-Pin

Peak AI Performance

  • 4% slower vs GRID M60-1Q
  • 4% faster vs Tesla M4
Tesla M4 - 8.78 TOPS INT8
x1
GRID M60-1Q - 9.13 TOPS INT8
x1.04

FP32

  • 4% slower vs GRID M60-1Q
  • 4% faster vs Tesla M4
Tesla M4 - 2.19 TFLOPS FP32
x1
GRID M60-1Q - 2.28 TFLOPS FP32
x1.04
  • 45% slower vs GRID M60-1Q
  • 82% faster vs Tesla M4
Tesla M4 - 88GB/s
x1
GRID M60-1Q - 160.4GB/s
x1.82
  • 67% lower vs GRID M60-1Q
  • 200% higher vs Tesla M4
Tesla M4 - 75W
x1
GRID M60-1Q - 225W
x3
Manufacturer
NVIDIA
Manufacturer
NVIDIA
Chip Designer
NVIDIA
Chip Designer
NVIDIA
Architecture
Maxwell 2
Architecture
Maxwell 2
Family
Tesla M
Family
GRID
Codename
NV126
GM206
Codename
NV124
GM204
Market Segment
Server
Market Segment
Server
Release Date
11/10/2015
Release Date
8/30/2015
Foundry
TSMC
Foundry
TSMC
Fabrication Node
28nm
Fabrication Node
28nm
Die Size
227 mm²
Die Size
398 mm²
Transistor Count
8 Billion
Transistor Count
5.2 Billion
Transistor Density
35.24M/mm²
Transistor Density
13.07M/mm²
Form
PCIe Card
Form
PCIe Card
Shading Units
1024 Shaders
Shading Units
2048 Shaders
Texture Mapping Units
64 TMUs
Texture Mapping Units
128 TMUs
Render Output Units
32 ROPs
Render Output Units
64 ROPs
Streaming Multiprocessors
8 SMs
Streaming Multiprocessors
16 SMs
872MHz Base
1072MHz
-
557MHz
Peak AI Performance
8.78 TOPS
INT8
Peak AI Performance
9.13 TOPS
INT8
FP32
2.19 TFLOPS
FP32
2.28 TFLOPS
FP64
70 GFLOPS
FP64
70 GFLOPS
INT8
8.78 TOPS
INT8
9.13 TOPS
INT32
2.19 TOPS
INT32
2.28 TOPS
Pixel Fillrate
34.304 GPixel/s
Pixel Fillrate
35.648 GPixel/s
Texture Fillrate
68.608 GTexel/s
Texture Fillrate
71.296 GTexel/s
L1
48KB/SM
L1
48KB/SM
L2
1MB Shared
L2
2MB Shared
4GB
GDDR5
1GB
GDDR5
Bus Width
128Bit
Bus Width
256Bit
Clock
1375MHz
Transfer Rate
5.5GT/s
Bandwidth
88GB/s
Clock
1253MHz
Transfer Rate
5GT/s
Bandwidth
160.4GB/s
TDP
75W
TDP
225W
Max Resolution
Unknown
Max Resolution
Unknown
Variable Refresh Rate
G-Sync
FreeSync
Variable Refresh Rate
G-Sync
FreeSync
Display Stream Compression (DSC)
Not Supported
Display Stream Compression (DSC)
Not Supported
Multi Monitor Support
Unknown
Multi Monitor Support
Unknown
Model
NVENC 3
Model
2x NVENC 3
Codec
AVC (H.264)
HEVC (H.265)
Codec
AVC (H.264)
HEVC (H.265)
Model
NVDEC 2
Model
NVDEC 1
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
VP8
VP9
AVC (H.264)
HEVC (H.265)
Codec
MPEG-1
MPEG-2
MPEG-4
VC-1
-
-
AVC (H.264)
-
Direct X
12
Direct 3D
12_1
Direct X
12
Direct 3D
12_1
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
Shader Model
6.7
CUDA
5.2
PureVideo HD
VP7
VDPAU
Feature Set F
Shader Model
6.7
CUDA
5.2
PureVideo HD
VP6
VDPAU
Feature Set E
-
-
Power Connectors
1x 8-Pin
Slots Required
1.0
PCIe Version
3.0
PCIe Lanes
16
Slots Required
2.0
PCIe Version
3.0
PCIe Lanes
16
Multi GPU Support
Supported
-
-
Height
69 mm (2.72 in)
Width
150 mm (5.91 in)
Depth
20 mm (0.79 in)
Height
111 mm (4.37 in)
Width
267 mm (10.51 in)
Depth
40 mm (1.57 in)
Change Comparison