NVIDIA Tesla C2075 vs Tesla K10 theoretical performance

来源:互联网 发布:子青广场舞网络一线牵 编辑:程序博客网 时间:2024/05/05 07:51

Each graphics unit has several vital theoretical parameters that affect real-world game, 3D graphics and compute performance. These are texture fillrate, pixel fillrate, memory bandwidth, along with single- and double-precision performance. Below you will find why they are important, and which card has better characteristics.

Pixel fill rate (gigapixels/s)

60
48
36
24
12
27.6
 
47.7
  
Higher is better
The Tesla K10 card has more Raster Operations Pipelines (ROPs) than the NVIDIA Tesla C2075. On top of that, its graphics clock rate is higher, therefore its pixel fillrate is substantially higher. Better maximum pixel fill rate means that the GPU can draw more pixels on and off screen each second, increasing overall performance, unless the card is limited by something else, such as texture fillrate, memory bandwidth or CPU speed.
  - NVIDIA Tesla C2075       - NVIDIA Tesla K10

Texture fill rate (gigatexels/s)

300
240
180
120
60
32.2
 
191
  
Higher is better
Because the NVIDIA Tesla K10 graphics unit has many more TMUs (Texture Mapping Units) and higher graphics frequency, its texture fillrate is considerably higher. Better texture fill rate means that the GPU can use more complex 3D effects and/or apply more textures to each textured picture element, which improves games visual appearance.

Single Precision performance(GFLOPS)

6000
4800
3600
2400
1200
1030
 
4577
  
Higher is better
Maximum Single Precision performance indicates how fast the graphics card is at executing programs, that process primarily single-precision floating point data. The performance is expressed in GFLOPS or billions of Floating Point Operations Per Second. Generally, the more stream processors or CUDA cores the graphics card has, and the the faster they run at, the higher Single Precision performance will be. The NVIDIA Tesla K10 GPU has an upper hand here. Higher single-precision performance number means the graphics card will perform better in general computing applications. Since CUDA cores or stream processors are also used as vertex and geometry shaders for 3D image generation, higher performance is also beneficial to games.

Double Precision performance(GFLOPS)

600
480
360
240
120
515
 
191
  
Higher is better
Maximum Double Precision performance is similar to the Single Precision performance, except that it applies to double-precision (64-bit) floating point operations. Since games do not use double-precision arithmetics, this characteristic is unimportant to games performance. The Tesla C2075 is faster when processing 64-bit floating-point numbers.
  - NVIDIA Tesla C2075       - NVIDIA Tesla K10

Memory bandwidth (GB/s)

400
320
240
160
80
144
 
320
  
Higher is better
To speed up processing, the GPUs store 3D scene data, textures and intermediate data, used for image generation, in on-board memory. The video memory usually has much higher bandwidth than system RAM, and more bandwidth allows the GPU to run at higher display resolutions, use larger and more detailed textures, and apply more complex 3D effects and filters. The bandwidth depends on a few components, such as memory type, speed, and memory interface width. Specifically, higher memory bandwidth of the NVIDIA Tesla K10 is due to higher memory clock.

NVIDIA Tesla C2075 vs Tesla K10 specs comparison

All rows with different specifications or features are highlighted.

   

General information

Market segmentHPC / ServerManufacturerNVIDIAModelTesla C2075Tesla K10

Architecture / Interface

Die nameGF1002 x GK104ArchitectureFermiKeplerFabrication process40nm28nmBus interfacePCI-E 2.0 x 16PCI-E 3.0 x 16

Cores / shaders

CUDA cores4483072ROPs4864Pixel fill rate27.6 gigapixels/s47.68 gigapixels/sTexture units56256Texture fill rate32.2 gigatexels/s190.72 gigatexels/sSingle Precision performance1030.4 GFLOPS4577.28 GFLOPSDouble Precision performance515.2 GFLOPS190.72 GFLOPS

Clocks / Memory

Base clock 745 MHzGraphics clock575 MHz Processor clock1150 MHz Memory size6144 MB8192 MBMemory typeGDDR5Memory clock750 MHz1250 MHzMemory interface width384256Memory bandwidth144 GB/s320 GB/s

Other features

Maximum power247 Watt250 Watt

Better values / features are marked with green color, and worse values are in red color.

0 0