Spark函数讲解序列文章

来源：互联网发布：网络封包抓取工具编辑：程序博客网时间：2024/05/29 07:04

aggregate、aggregateByKey、cache、cartesian、checkpoint、coalesce、cogroup

groupWith

collect, toArray
collectAsMap
combineByKey
compute
context, sparkContext
count
countApprox
countByKey
countByKeyApprox
countByValue
countByValueApprox
countApproxDistinct
countApproxDistinctByKey
dependencies
distinct
first
filter
filterWith
flatMap
flatMapValues
flatMapWith
fold
foldByKey
foreach
foreachPartition
foreachWith
generator, setGenerator
getCheckpointFile
preferredLocations
getStorageLevel
glom
groupBy
groupByKey
histogram
id
intersection
isCheckpointed
iterator
join
keyBy
keys
leftOuterJoin
lookup
map
mapPartitions
mapPartitionsWithContext
mapPartitionsWithIndex
mapPartitionsWithSplit
mapValues
mapWith
max
mean , meanApprox
min
name, setName
partitionBy
partitioner
partitions
persist, cache
pipe
randomSplit
reduce
reduceByKey, reduceByKeyLocally, reduceByKeyToDriver
rightOuterJoin
sample
saveAsHodoopFile, saveAsHadoopDataset, saveAsNewAPIHadoopFile
saveAsObjectFile
saveAsSequenceFile
saveAsTextFile
stats
sortBy
sortByKey
stdev , sampleStdev
subtract
subtractByKey
sum , sumApprox
take
takeOrdered
takeSample
toDebugString
toJavaRDD
top
toString
union, ++
unpersist
values
variance , sampleVariance
zip
zipPartitions
zipWithIndex
zipWithUniquId

0 0