Several practical issues for CUDA
来源:互联网 发布:模式识别算法应用 编辑:程序博客网 时间:2024/06/03 15:05
GPU rocks, indeed. But its application is kinda like steering a wild horse. Not being familiar with it may make you crazy.
1. Take care of the computing abilities among different versions of CUDA
The differences among different versions of CUDA are huge, because CUDA is growthing rapidly. Before starting your develop, you have to refer to the corresponding GPU manual. – How many SM it has? Global memory lock supported? etc.
2. A good parallel design is essential
Never write your CUDA kernel in a ‘scattering’ way (one read & many writes), which will bring you quite a lot of bank conflict. Always write the kernel in a ‘gathering’ way(many reads & one write).
3. Seperate CPU code with GPU code
Can u believe that nvcc in emulation mode will not separate CPU/GPU code, while non-emulation mode nvcc does ? Take care of it dude. Also, it seems that MACRO is the only way for param sharing between CPU code and GPU code. Also, the only OO component you can use is ‘struct’.
In my current work, GPU helps my rendering rate be accelerated nearly 100 times faster !
- Several practical issues for CUDA
- Clocking Issues for Synthesis
- upload several sheet for sap
- Several Extension Methods for Char
- Several Rules for design pattern
- Outlook 2007 Issues for Developers
- [MS Smartphone]Several solutions for MIDI playback
- several frequently used directives for apache server
- several methods for uploading files to server
- Some practical rules for WBS
- SQL: Practical Guide for Developers
- Java: Practical Guide for Programmers
- SQL: Practical Guide for Developers
- Practical .NET for Financial Markets
- Practical Ruby for System Administration
- Practical Strategies for your Career
- Issues
- Three issues for my Android phone
- LoadRunner压力测试实例
- 用KeyTool生成安全证书
- web测试知识--iis错误列表
- 求电话号码的最大连号数(嵌套循环实例)
- CWE/SANS评出25种最危险的编程错误
- Several practical issues for CUDA
- VMware workstation 安装moblin2.1
- 如何将sql2000数据备份文件(,bak)还原为sql2005文件
- 记忆中的计算机-9 (自己的Celeron)
- 记忆中的计算机-10 (DELL Pentium III)
- 验证字符串是否为有符号整数
- 今年6月MTK有好消息哦。
- 三、Qt Creator登录对话框
- compile anjuta