Flink学习笔记 --- DataStream Transformations
来源:互联网 发布:linux tomcat 宕机 编辑:程序博客网 时间:2024/06/05 21:05
Stream Transformations1.Map:DataSteam -> DataSteamdataSteam.map {x => x * 2}2.FlatMapDataStream -> DataStreamdataStream.flatMap {str => str.split(" ")}3.FilterDataStream -> DataStreamdataStream.filter { _ != 0 }4.KeyByDataStream -> KeyedStreamdataStream.keyBy("someKey")dataStream.keyBy("0")5.ReduceKeyedStream -> DataStreamkeyedStream.reduce { _ + _ }6.FoldkeyedStream -> DataSyteamval result: DataStream[String] = keyedStream.fold("start")((str, i)) => { str + "-" + i })7.AggregationKeyedStream -> DataStreamkeyedStream.sum(0)KeyedStream.sum("key")keyedStream.min(0)keyedStream.min("key")keyedStream.max(0)keyedStream.max("key")keyedStream.minBy(0)keyedStream.minBy("key")keyedStream.maxBy(0)keyedStream.maxBy("key")8.WindowkeyedSteam -> windowedStreamdataStream.keyBy(0).window(TumblingEventTimeWindow.of(Time.second(5)))9.WindowALlDataStream -> AllWindowedStreamdataSteam.windowAll(TumblingEventTimeWindows.of(Time.seconds(5)))10.WindowApplyWindowStream -> DataSteamAllWindowStream -> DataStreamwindowedStream.apply { WindowFunction }allWindowSteam.apply { AllWindowFunction }11.Window ReduceWindowedStream -> DataStreamwindowedStream.reduce { _ + _ }12.Window FoldWindowedStream -> DataStreamval result: DataStream[String] = windowedStream.fold("start", (str, 1) => { str + "_" + i })13.Aggregation on windowsWindowedStream -> DataStreamwindowedStream.sum(0)windowedStream.sum("key")windowedStream.min(0)windowedStream.min("key")windowedStream.max(0)windowedStream.max("key")windowedStream.minBy(0)windowedStream.minBy("key")windowedStream.maxBy(0)windowedStream.maxBy("key")14.UnionDataStream* -> DataStreamdataStream.union(otherStream1, otherStream2, ...)15.Window JoinDataStream, DataStream -> DataStreamdataStream.join(otherStream).where(<key selector>).equalTo(<key selector>).window(TumblingEventWindows.of(Time.second(3))).apply {...}16.Window CoGroupDataStrea, DataStream -> DataStreamdataStream.coGroup(otherStream).where(0).equalTo(1).window(TumblingEventWindows.of(Time.seconds(3))).apply {}17.connectDataStream, DataStream -> ConnectedStreamsomeStream: DataStream[Int] = //otherStream: DataStream[String] = //val connectedStreams = someStream.connect(otherStream)18.CoMap, CoFlatMapConnectedStream -> DataStreamconnectedStream.map( (_:Int) => true, (_:String) => false)connectedStreams.flatMap( (_:Int) => true, (_:String) => false)19.SplitDataStream -> SplitStreamval split = someDataStream.split((num: Int) => (num % 2) match {case 0 => List("even") case 1 => List("odd") })20.SelectSplitStream -> DataStreamval even = split select "even"val odd = split select "odd"val all = split select("even", "odd")21.IterateDataStream -> iterateStream -> DataStreaminitialStream.iterate { iteration => { val iterationBody = iteration.map { } (iterationBody.filter(_ > 0), iterationBody.filter( _ <= 0)) }}22.Extract TimestampsDataStream -> DataStreamstream.assignTimestamps { timestampExtractor }
阅读全文
0 0
- Flink学习笔记 --- DataStream Transformations
- Flink学习笔记 --- 理解DataStream WordCount
- Flink学习:DataStream和InputFormat
- Flink DataStream API Programming Guide学习&译文(未完待续)
- 《Flink学习笔记一》
- Flink学习笔记 --- Flink中Windows机制
- Flink学习笔记:1、Flink快速入门
- Flink学习笔记:2、Flink介绍
- flink 的datastream的作业提交问题
- Flink学习笔记 --- Flink本地(Loacl模式)安装
- Flink学习笔记:3、Flink分布式模式(Standalone)
- Flink学习笔记 --- Basic Concepts整理笔记
- Flink Basic API Concepts 学习笔记&译文
- Flink学习笔记 --- DataSet Source and Sink
- Flink学习笔记 --- Intellij自动导入
- Flink学习笔记 --- 理解ConnectedStream 与 Union
- Flink学习笔记 --- 理解DataSet WordCount
- Flink学习笔记 --- 研究 FlinkForward2017 源码
- SqlSession中的OpenSession到底做了什么?
- 迪米特法则(Law of Demeter)与领域模型行为
- ApplicationContext的三种实现方式以及在web.xml配置的两种方式
- C# Anchor和Dock属性
- Activity 进阶
- Flink学习笔记 --- DataStream Transformations
- Linux下查看内存使用情况方法总结
- Systrace工具的使用
- Facebook SDE onsite面经
- c++尝试写一个代理类
- Android系统典型bootloader分析
- 将Eclipse代码导入到AndroidStudio的两种方式
- 机器学习与深度学习(五) 回归分析(regression analysis)
- 浅复制