sparkshell里的wordcount

来源:互联网 发布:中文域名怎么样 编辑:程序博客网 时间:2024/04/30 22:27
val rdd =sc.textFile("hdfs://localhost.localdomain:9000/input/test")rdd.countval wordcount = rdd.flatMap(_.split(' ')).map((_,1)).reduceByKey(_+_) wordcount.collect #keypaixu wordcount.sortByKey(false) wordsort.collect #cishiupaixu rdcount.map(x=>(x._2,x._1)).sortByKey(true).map(x=>(x._2,x._1))collec
0 0
原创粉丝点击