CUDA deviceQuery参数详解
来源:互联网 发布:形容数据整齐的词语 编辑:程序博客网 时间:2024/06/05 14:11
运行sample里的deviceQuery:
C:\ProgramData\NVIDIA Corporation\CUDA Samples\v8.0\1_Utilities\deviceQuery\../../bin/win64/Debug/deviceQuery.exe Starting... CUDA Device Query (Runtime API) version (CUDART static linking)Detected 1 CUDA Capable device(s)Device 0: "GeForce GTX 965M" CUDA Driver Version / Runtime Version 8.0 / 8.0 CUDA Capability Major/Minor version number: 5.2 Total amount of global memory: 2048 MBytes (2147483648 bytes) ( 8) Multiprocessors, (128) CUDA Cores/MP: 1024 CUDA Cores GPU Max Clock rate: 1150 MHz (1.15 GHz) Memory Clock rate: 2505 Mhz Memory Bus Width: 128-bit L2 Cache Size: 1048576 bytes Maximum Texture Dimension Size (x,y,z) 1D=(65536), 2D=(65536, 65536),3D=(4096, 4096, 4096) Maximum Layered 1D Texture Size, (num) layers 1D=(16384), 2048 layers Maximum Layered 2D Texture Size, (num) layers 2D=(16384, 16384), 2048 layers Total amount of constant memory: 65536 bytes Total amount of shared memory per block: 49152 bytes Total number of registers available per block: 65536 Warp size: 32 Maximum number of threads per multiprocessor: 2048 Maximum number of threads per block: 1024 Max dimension size of a thread block (x,y,z): (1024, 1024, 64) Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) Maximum memory pitch: 2147483647 bytes Texture alignment: 512 bytes Concurrent copy and kernel execution: Yes with 2 copy engine(s) Run time limit on kernels: Yes Integrated GPU sharing Host Memory: No Support host page-locked memory mapping: Yes Alignment requirement for Surfaces: Yes Device has ECC support: Disabled CUDA Device Driver Mode (TCC or WDDM): WDDM (Windows Display Driver Model) Device supports Unified Addressing (UVA): Yes Device PCI Domain ID / Bus ID / location ID: 0 / 1 / 0 Compute Mode: < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 8.0, CUDA Runtime Version = 8.0, NumDevs = 1, Device0 = GeForce GTX 965MResult = PASS
第26行:线程块最大线程数:1024
0 0
- CUDA deviceQuery参数详解
- deviceQuery
- CUDA(五)用deviceQuery看GPU属性
- cuda内核(kernel)参数详解
- cuda内核(kernel)参数详解
- 【走进CUDA】~详解CUDA核函数及运行时参数
- 详解CUDA核函数及运行时参数<<<>>>
- 详解CUDA核函数及运行时参数<<<>>>
- 详解CUDA核函数及运行时参数
- 详解CUDA核函数及运行时参数
- CUDA编程系列--详解CUDA核函数及运行时参数
- cuda6.5 deviceQuery.exe
- deviceQuery查看属性
- Jetson TX1/TX2 deviceQuery
- CUDA存储器详解
- CUDA __global__ function 参数分析
- CUDA __global__ function 参数分析
- cuda优化相关参数总结
- HDU-1159 Common Subsequence(最长公共子序列)
- Ubuntu16.04设置静态IP
- fan out flow
- Linux命令行与shell脚本(18)--shell连接mysql
- Spring Boot 核心-外部配置
- CUDA deviceQuery参数详解
- C++第五次作业报告
- 解决手机端中文输入法中keyup不灵便的方法
- 数据预处理-OneHot编码
- 京东2017实习生招聘试题 静态方法
- Java final和static总结
- 中间空格过滤
- 蓝牙4.0BLE协议栈
- linux tail 命令详解