spark-0.8.0源码剖析storage
来源:互联网 发布:阿里健康 大数据应用 编辑:程序博客网 时间:2024/04/29 11:56
BlockManagerMasterActor和BlockManagerSlaveActor进行管理和通信
1DiskStore
此处为配置spark.local.dir中的机制,每个块为一个文件并且根据块号哈希进哪个文件夹中
private def getFile(blockId: String): File = {
logDebug("Getting file for block " + blockId)
// Figure out which local directory it hashes to, and which subdirectory in that
val hash = Utils.nonNegativeHash(blockId)
val dirId = hash % localDirs.length
val subDirId = (hash / localDirs.length) % subDirsPerLocalDir
2storagelevel可以了解数据存储 的 类型和搭配
class StorageLevel private(
private var useDisk_ : Boolean,
private var useMemory_ : Boolean,
private var deserialized_ : Boolean,
private var replication_ : Int = 1)
extends Externalizable {
// TODO: Also add fields for caching priority, dataset ID, and flushing.
private def this(flags: Int, replication: Int) {
this((flags & 4) != 0, (flags & 2) != 0, (flags & 1) != 0, replication)
}
def this() = this(false, true, false) // For deserialization
def useDisk = useDisk_
def useMemory = useMemory_
def deserialized = deserialized_ //是否序列化
def replication = replication_ //副本数
4
- spark-0.8.0源码剖析storage
- spark-0.8.0源码剖析--主流程
- spark-0.8.0源码剖析-分区Partitioner
- spark源码分析-storage
- spark-storage模块源码分析
- spark源码剖析-总论
- Apache Spark源码剖析
- Spark 源码剖析
- Spark RPC源码剖析
- Spark RDD源码剖析
- Spark源码解析——Storage模块
- Spark源码分析之-Storage模块
- Spark源码走读5——Storage
- Spark源码分析之-Storage模块
- spark源码剖析之----Shuffle
- spark源码剖析之----Partitioner
- Spark读取配置源码剖析
- spark调度流程源码剖析
- 基类指针指向派生类对象
- PHP实现RSA算法对数据加密解密
- c# winform 关于DataGridView的一些操作
- Swing —— Box容器组件
- python 得到HTML指定ID的内容
- spark-0.8.0源码剖析storage
- [cocos2d-x 学习] CCActionInterval(持续动作) 子类 CCMoveTo与CCMoveBy 之区别
- 编程趣味题3
- 嵌入式学习之路(二)——Unix命令的学习(1)
- YSJ_FSM状态机软件框架分析
- Windows8.1 VMware与Hyper-V共存
- learning to rank资料汇总
- cocos2d-x 网络编程二(cURL简单模拟登录的实现)
- 主动寻求改变