GPGPU-Sim 源码阅读笔记 之 cache option默认策略
来源:互联网 发布:打车软件不合法 编辑:程序博客网 时间:2024/06/05 19:21
code 片段
//cuda-sim.cc line 855//in void ptx_instruction::pre_decode()... switch( m_cache_option ) { case CA_OPTION: cache_op = CACHE_ALL; break; case CG_OPTION: cache_op = CACHE_GLOBAL; break; case CS_OPTION: cache_op = CACHE_STREAMING; break; case LU_OPTION: cache_op = CACHE_LAST_USE; break; case CV_OPTION: cache_op = CACHE_VOLATILE; break; case WB_OPTION: cache_op = CACHE_WRITE_BACK; break; case WT_OPTION: cache_op = CACHE_WRITE_THROUGH; break; default: if( m_opcode == LD_OP || m_opcode == LDU_OP ) cache_op = CACHE_ALL; else if( m_opcode == ST_OP ) cache_op = CACHE_WRITE_BACK; else if( m_opcode == ATOM_OP ) cache_op = CACHE_GLOBAL; break; }
分析
1. 很容易看出,其默认策略 是 ld.ca 和 st.wb
2. 这也是在 英伟达 PTX ISA 手册提到的:
cache operator
PTX ISA version 2.0 introduced optional cache operators on load and store instructions.
The cache operators require a target architecture of sm_20 or higher.
Cache operators on load or store instructions are treated as performance hints only.
The use of a cache operator on an ld or st instruction does not change the memory
consistency behavior of the program.
For sm_20 and higher, the cache operators have the following definitions and behavior.
0 0
- GPGPU-Sim 源码阅读笔记 之 cache option默认策略
- GPGPU-Sim 之 block调度
- GPGPU-Sim学习-如何禁用GPGPU-Sim内的L1 cache
- GPGPU-SIM 之 单步编译 BENCHMARK
- 阅读源码策略之我见
- 编译安装gpgpu-sim
- GPGPU-SIM Code Study
- gpgpu-sim/remulator 安装
- GIT 源码阅读之 update-cache
- GPGPU-Sim学习(代码解读)-如何实现Bypass L1D cache
- Spark源码阅读笔记:Spark的数据系统之cache篇
- Bucket Cache源码阅读
- GPGPU-Sim学习(一)-GPGPU-Sim介绍
- GPGPU-Sim(番外)-GPGPU-Sim的模拟机制
- Integrated gem5 + GPGPU-Sim Simulator
- GPGPU-Sim学习(二)搭建GPGPU-Sim环境(ubuntuServer 10.04 安装GPGPU-Sim)
- 《Spring源码深度解析》阅读笔记5-默认标签的解析之bean标签的解析及注册
- GPGPU-SIM的安装 OSX10.9
- [01]项目实战-PC 端固定布局(1)
- Java NIO系列教程(二) Channel
- Sublime Text2安装SublimeCodeIntel
- 练习
- Java虚拟机学习(7):对象内存分配与回收
- GPGPU-Sim 源码阅读笔记 之 cache option默认策略
- android 学习笔记11-多媒体
- 设计模式之18 - 备忘录模式Memento
- Uva1572 Self-Assembly 【建立图论模型拓扑排序】【例题6-19】
- JFinal数据库配置
- xlog接入方案
- Stanford : cs106a 编程方法学课程 Karel程序 编译环境配置
- 57.贝赛尔曲线初步(二) - 高仿QQ未读消息气泡拖拽黏连效果
- 设计模式之19 - 观察者模式Observer