CUDA Optimization tips
来源:互联网 发布:java 文件夹复制 覆盖 编辑:程序博客网 时间:2024/06/05 08:26
摘自 "CUDA C Best Practices"
1. To maximize developer productivity, profile the application to determine hotspots and bottlenecks
2. To get the maximum benefit from CUDA, focus first on finding ways to parallelize sequential code.
3. Use the effective bandwidth of your computation as a metric when measuring performance and optimization benefits.
4. Minimize data transfer between the host and the device, even if it means running some kernels on the device that do not show performance gains when compared with running them on the host CPU.
5. When you have to transfer data between host and device, then higher bandwidth can be achieved by using pagelocked (or pinned) memory.
6. Ensure global memory accesses are coalesced whenever possible.
7. Non-unit-stride global memory accesses should be avoided whenever possible.
To be continued...
- CUDA Optimization tips
- CUDA Tips
- CUDA tips
- Transact-SQL Optimization Tips
- CUDA --- Fundamental Optimization Strategies
- MySQL Optimization Part 5 - Optimization Tips
- Three Optimization Tips for C++
- CUDA Performance Tips
- Tips for Optimization Linux Memory Usage
- 40 Developer Tips for Android Optimization
- 50.Tips.and.Tricks.for.MongoDB.Developers --- Optimization Tips
- SQL SERVER – Stored Procedure Optimization Tips – Best Practices
- Efficient C Tips #4 – Use Speed Optimization
- C++优化三板斧:Three Optimization Tips for C++
- Cuda学习笔记(三)——Cuda编程Tips
- optimization
- optimization
- Optimization
- axis2 form upload file 异常 Invalid UTF-8 middle byte
- syslog-ng安装和使用
- 用C++实现HTTP服务器 - 处理完成端口模型(IOCP)的超时问题
- Spring AOP @AspectJ 入门实例
- USB摄像头图片采集+QT显示(二)
- CUDA Optimization tips
- codesmith无法连接Mysql的解决方法
- android基础篇------------java基础(2) (运算符的种类)
- java并发之BlockingQueue和Lock以及synchronized
- F5负载均衡器的功能介绍和配置图示
- tac_plus安装和配置
- Hibernate执行原生sql,list转换问题
- 解决:EasyUI Dialog关闭后,验证提示信息仍显示在页面最上端
- python列表去重