CUDA Pro Tip:Write Flexible Kernels with Grid-Stride Loops

来源:互联网 发布:淘宝夜间模式 编辑:程序博客网 时间:2024/06/08 01:30


CUDA Pro Tip:Write Flexible Kernels with Grid-Stride Loops
懒得翻译了,这篇文章讲使用Grid-Stride Loop
如下非Grid-Stride Loop

kernel(){    int i = blockIdx.x * blockDim.x + threadIdx.x;    if (i<n)        ....}

Grid-Stride Loop

kernel(){    for (int i = blockIdx.x * blockDim.x + threadIdx.x;          i < n;          i += blockDim.x * gridDim.x)       {          ...      }}

并表明这样做的三个有点
1. 可扩展和thread reuse
2. Debugging
3. Portability and readability

我认为吧主要就是前两个原因。

1 0
原创粉丝点击