GPU Boost on NVIDIA’s Tesla K40 GPU
来源:互联网 发布:电影网络版权价格 编辑:程序博客网 时间:2024/04/29 03:20
What is GPU Boost?
GPU Boost is a new user controllable feature to change the processor clock speed on the Tesla K40 GPU. NVIDIA is currently supporting 4 selectable Stream Processor clock speeds and two selectable Memory Clock Speeds on the K40.
GPU Boost is useful as not all applications have the same power profile. The K40 has a maximum 235W power capacity. For example, an application that runs at an average power consumption of 180W at the base frequency will have a 55W power headroom. By increasing the clock frequency, the application theoretically can take advantage of the full 235W capacity.
Enabling GPU Boost
GPU Boost is controlled using NVIDIA’s System Management Interface utility (nvidia-smi
) with the following commands:
nvidia-smi –q –d SUPPORTED_CLOCKS
Show Supported Clock Frequenciesnvidia-smi –ac
Set the Memory and Graphics Clock Frequencynvidia-smi –q –d CLOCK
Shows current modenvidia-smi –rac
Resets all clocksnvidia-smi –acp 0
Allows non-root to change clocksOn the K40, an nvidia-smi
query to find the supported clock frequencies gives the following output:
[srahim@corsair3 ~]$ nvidia-smi -q -d SUPPORTED_CLOCKS
==============NVSMI LOG==============
Timestamp
Driver Version
Attached GPUs
GPU 0000:05:00.0
GPU 0000:42:00.0
To set the clock speed to 666MHz, run
[srahim@corsair3 ~]$ sudo nvidia-smi –ac 3004,666
Applications clocks set to "(MEM 3004, SM 666)" for GPU 0000:05:00.0
All done.
If you try to set an unsupported clock speed, nvidia-smi shows a helpful message.
[srahim@corsair3 ~]$ sudo nvidia-smi -ac 3004,888
Specified clock combination "(MEM 3004, SM 888)" is not supported for GPU 0000:05:00.0. Run 'nvidia-smi -q -d SUPPORTED_CLOCKS' to see list of supported clock combinations
Terminating early due to previous errors.
The current clock speed is checked with the following command:
[srahim@corsair3 ~]$ nvidia-smi -q -d CLOCK
==============NVSMI LOG==============
Timestamp
Driver Version
Attached GPUs
GPU 0000:05:00.0
The GPU Boost settings are not persistent between reboots or driver unloads and should be scripted if persistence is desired. Unless the NVIDIA driver persistence mode is set with nvidia-smi –pm 1
, the driver may unload when the GPU is idle.
Unlike Intel’s Turbo Boost, GPU Boost is not on by default. This puts the impetus on the end user or system administrator to take advantage of this feature. An application can programmatically change the Boost clock with NVML if run with appropriate permissions. Run nvidia-smi –acp 0
to grant non-root users permission to change clocks. Two caveats on the uses of GPU Boost from the document “NVIDIA GPU Boost for Tesla” are:
- An important point to remember is that no matter which clocks the end user selects, if at any time the power monitoring algorithm detects that the application may exceed the 235 W, the GPU comes down to a lower clock level as a precaution. Once the power falls below 235 W the GPU will raise its core clock to the selected clock. This happens automatically and the Tesla K40 does have a few clock levels below the base clock to handle any power digressions.
- If the workload runs on multiple GPUs and is sensitive to all GPUs running at the same clock, the user may need to try out which particular clock works best for all GPUs.
If you are not running on multiple GPUs or multiple nodes, you should crank up the GPU clock speed to the maximum frequency. Keep the GPU clock at the default value of 745MHz or lower only if there are power consumption concerns.
GPU Boost is a relatively simple way to increase performance by up to 30%. It is certainly easier than optimizing a CUDA kernel. Unless you are doing some crazy multi-node MPI application, always boost your clock speed for higher performance!
- GPU Boost on NVIDIA’s Tesla K40 GPU
- NVIDIA Tesla/Quadro和GeForce GPU比较
- CUDA on NVIDIA GPU
- caffe-Cuda7.5-cudnnv4-GPU-NugetPackages-Tesla k40-VS2013-Anaconda2-pycharm2016.2 win10
- NVIDIA之Tesla、GeForce和Quadro系列GPU对比
- 谷歌TPU吊打GPU?NVIDIA不服:Tesla P40怒怼
- Tesla GPU 如何关闭 ECC
- 谷歌说TPU比GPU更牛,Nvidia表示不服,并朝谷歌扔了一块Tesla V100
- NVIDIA-GPU-CUDA
- NVIDIA GPU分类
- Nvidia GPU architecture笔试
- 转载:NVIDIA GPU结构
- NVIDIA GPU 2016
- Nvidia GPU架构演变
- 加速GPU,加速NVIDIA
- nvidia GPU 性能查看
- GPU Powered DeepLearning with NVIDIA DIGITS on EC2
- Installing Nvidia CUDA on Ubuntu 14.04 for Linux GPU Computing
- Socket
- 基础知识(三)-JSP
- LeetCode 88. Merge Sorted Array
- 必须了解的基础的 Linux 网络命令
- SSM + Shiro 整合 (7)- 认证和授权部分缓存的添加
- GPU Boost on NVIDIA’s Tesla K40 GPU
- 2.1.5 观察者模式
- swift学习笔记 - 位移枚举的按位或运算
- 使用 GPU 加速计算
- 在js中遍历JSON数据
- 获取窗口、控件等指针。(未完待续)
- python标准模块学习11_______hmac模块
- 网络编程(12)—— 利用wait和waitpid函数消毁僵尸进程
- 【NOIP2000】乘积最大