Spark WordCount
来源:互联网 发布:淘宝优惠券网站怎么建 编辑:程序博客网 时间:2024/05/16 11:08
1.例子
import org.apache.spark.{SparkConf, SparkContext}object WordCount { def main(args: Array[String]) { //非常重要,是通向Spark集群的入口 val conf = new SparkConf().setAppName("WC") val sc = new SparkContext(conf) sc.textFile(args(0)).flatMap(_.split(" ")).map((_, 1)) .reduceByKey(_+_).sortBy(_._2, false).saveAsTextFile(args(1)) sc.stop() }}
2.分析有几个RDD
import org.apache.spark.{SparkConf, SparkContext}object WordCount { def main(args: Array[String]) { //非常重要,是通向Spark集群的入口 val conf = new SparkConf().setAppName("WC") .setJars(Array("C:\\HelloSpark\\target\\hello-spark-1.0.jar")) .setMaster("spark://node-1.itcast.cn:7077") val sc = new SparkContext(conf) //textFile会产生两个RDD:HadoopRDD -> MapPartitinsRDD sc.textFile(args(0)).cache() // 产生一个RDD :MapPartitinsRDD .flatMap(_.split(" ")) //产生一个RDD MapPartitionsRDD .map((_, 1)) //产生一个RDD ShuffledRDD .reduceByKey(_+_) //产生一个RDD: mapPartitions .saveAsTextFile(args(1)) sc.stop() }}
阅读全文
0 0
- spark-wordcount
- Spark-wordcount
- wordcount spark...
- wordCount spark
- spark wordcount
- Spark WordCount
- spark wordcount
- Spark WordCount
- spark wordcount
- spark wordcount
- Spark流处理(WordCount)
- spark入门之wordcount
- spark如何wordcount中文
- Spark入门-WordCount
- Spark之WordCount
- 007-spark的wordCount
- Spark学习1-wordcount
- spark streaming wordcount
- CodeForces 732D Exams
- 给自己的
- 黑莓QNX和Wind River争夺汽车软件市场-QNX市场预估
- hdu 5719Substring(后缀数组)
- 穷举
- Spark WordCount
- Android studio查看SQlite数据库
- UVA
- Java 使用commons集驱动包+Servlet类实现简单的上传文件到本地!推荐
- Qt一步步搭建TcpServer4——Client的封装与网络库的使用
- 套路题
- CodeForces
- OpenCV3.1与VS2013配置教程记录(64位win7旗舰版)
- vijos 宿命的PSS