scala数据切割分片

来源:互联网 发布:淘宝店铺怎么虚拟发货 编辑:程序博客网 时间:2024/05/16 08:06
 import scala.collection.mutable.ArrayBuffer


 def dfOfPartition(input: Iterator[Int]) = {
    var a = ArrayBuffer[Int]()
    input.foreach { elem => a+=elem}
    a.toDF
  }


val list = 1 to 100
val rdd = sc.parallelize(list,10)
val result = rdd.mapPartitions(partition => Iterator(dfOfPartition(partition)))
result.collect().foreach(println)
0 0
原创粉丝点击