hadoop mapreduce包含阶段介绍
来源:互联网 发布:4fang软件论坛 编辑:程序博客网 时间:2024/04/29 02:36
1 hadoop map reduce阶段介绍
1)mapper:maps input key/value pairs to a set of intermediate key/value pairs
2 reducer:reduces a set of intermediate values while share a key to a smaller set of values
1)shuffle(洗牌):input to the reducer is the sorted output of the mappers.In this phase the framework fetches the relevant(相关的) partition of the output of all the mappers,via http.
2)sort:the framework groups reducer inputs by keys(since different mappers may have output the same key)in this stage.
the shuffle and sort phases occur simultaneous;while map-ouputs are being fetched they are merged.
3)secondary sort:If equivalence rules for grouping the intermediate keys are required to be different from those for grouping keys before reduction, then one may specify a Comparator via JobConf.setOutputValueGroupingComparator(Class). Since JobConf.setOutputKeyComparatorClass(Class) can be used to control how intermediate keys are grouped, these can be used in conjunction to simulate secondary sort on values.
4)reduce:in this phase the reduce method is called for each<key,(list of values)> pair in the grouped inputs.
1)mapper:maps input key/value pairs to a set of intermediate key/value pairs
2 reducer:reduces a set of intermediate values while share a key to a smaller set of values
1)shuffle(洗牌):input to the reducer is the sorted output of the mappers.In this phase the framework fetches the relevant(相关的) partition of the output of all the mappers,via http.
2)sort:the framework groups reducer inputs by keys(since different mappers may have output the same key)in this stage.
the shuffle and sort phases occur simultaneous;while map-ouputs are being fetched they are merged.
3)secondary sort:If equivalence rules for grouping the intermediate keys are required to be different from those for grouping keys before reduction, then one may specify a Comparator via JobConf.setOutputValueGroupingComparator(Class). Since JobConf.setOutputKeyComparatorClass(Class) can be used to control how intermediate keys are grouped, these can be used in conjunction to simulate secondary sort on values.
4)reduce:in this phase the reduce method is called for each<key,(list of values)> pair in the grouped inputs.
0 0
- hadoop mapreduce包含阶段介绍
- 关于hadoop mapreduce的job cleanup阶段
- Hadoop学习(9)----MapReduce阶段的Split阶段
- hadoop概念-MapReduce各个执行阶段及Shuffle过程详解
- hadoop-mapreduce-example中实例介绍
- 【Hadoop】MapReduce笔记(三):MapReduce的Shuffle和Sort阶段详解
- hadoop MapReduce程序 不包含Reduce的设置
- Hadoop运行包含第三方依赖库的MapReduce作业
- 精通HADOOP(二) - 初识Hadoop - MapReduce模型介绍
- hadoop 子项目hadoop-mapreduce-examples的简单介绍
- MapReduce的Shuffle阶段
- 第二代MapReduce阶段解析
- MapReduce的shuffle阶段
- Hadoop 中的 MapReduce链接作业之预处理和后处理阶段的链接
- hadoop执行mapreduce任务,能够map,不能reduce,Shuffle阶段报错
- MapReduce&hadoop
- Hadoop MapReduce
- Hadoop MapReduce
- 哈佛结构和冯·诺依曼结构
- 提高Java代码质量的Eclipse插件之Checkstyle的使用详解
- 判断单链表里面有没有环
- java.lang.ClassNotFoundException: org.quartz.DisallowConcurrentExecution
- CAN的中断管理
- hadoop mapreduce包含阶段介绍
- 思考1:项目制在创业公司实行的思索
- 泛函编程(13)-无穷数据流-Infinite Stream
- 广西火龙果苗批发
- php文件操作错误代码说明
- ios 禁用 侧边手势滑动返回功能
- 密码学简单介绍
- javascript的Date时差
- 【UVa12167】 Proving Equivalences 【强连通分量】