Flink学习笔记 --- 理解DataSet WordCount
来源:互联网 发布:mac改壁纸 编辑:程序博客网 时间:2024/06/01 09:15
POM.xml文件:
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd"> <modelVersion>4.0.0</modelVersion> <groupId>zetyun</groupId> <artifactId>FlinkWordCounts</artifactId> <version>1.0-SNAPSHOT</version> <inceptionYear>2008</inceptionYear> <properties> <scala.version>2.11.0</scala.version> </properties> <dependencies> <dependency> <groupId>org.scala-lang</groupId> <artifactId>scala-library</artifactId> <version>${scala.version}</version> </dependency> <!-- https://mvnrepository.com/artifact/org.apache.flink/flink-core --> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-core</artifactId> <version>1.3.0</version> </dependency> <!-- https://mvnrepository.com/artifact/org.apache.flink/flink-clients_2.11 --> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-clients_2.11</artifactId> <version>1.3.0</version> </dependency> <!-- https://mvnrepository.com/artifact/org.apache.flink/flink-scala_2.11 --> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-scala_2.11</artifactId> <version>1.3.0</version> </dependency> <!-- https://mvnrepository.com/artifact/org.apache.flink/flink-streaming-scala_2.11 --> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-streaming-scala_2.11</artifactId> <version>1.3.0</version> </dependency> <!-- https://mvnrepository.com/artifact/org.apache.flink/flink-streaming-core --> <dependency> <groupId>org.apache.flink</groupId> <artifactId>flink-streaming-core</artifactId> <version>0.9.1-hadoop1</version> </dependency> </dependencies></project>
DataSet WordCount
package zetyunimport org.apache.flink.api.scala.ExecutionEnvironment/** * Created by ryan on 17-7-19. */object DataSetWordCount { def main(args: Array[String]): Unit ={ // set up the execution environment val env = ExecutionEnvironment.getExecutionEnvironment // get input data val text = env.fromElements("To be, or not to be,--that is the question:--", "Whether 'tis nobler in the mind to suffer", "The slings and arrows of outrageous fortune", "Or to take arms against a sea of troubles,") val counts = text.flatMap { _.toLowerCase.split("\\W+")} .map { (_, 1) } // put (char, 1) format .groupBy(0) // group by key .sum(1) // sum the every key's number // emit result and print result counts.print() }}
阅读全文
0 0
- Flink学习笔记 --- 理解DataSet WordCount
- Flink学习笔记 --- 理解DataStream WordCount
- Flink学习笔记 --- scala实现Flink的DataSet Source进行WordCount
- Flink学习笔记 --- DataSet Source and Sink
- Flink学习笔记 --- 理解ConnectedStream 与 Union
- 《Flink学习笔记一》
- Flink学习笔记 --- Flink中Windows机制
- Flink学习笔记:1、Flink快速入门
- Flink学习笔记:2、Flink介绍
- Flink实现WordCount
- Flink WordCount实例讲解
- Flink学习笔记 --- DataStream Transformations
- mapreduce WordCount 学习笔记
- Flink DataSet API Programming Guide学习&译文(未完待续)
- Flink学习笔记 --- Flink本地(Loacl模式)安装
- Flink学习笔记:3、Flink分布式模式(Standalone)
- Flink学习笔记 --- Basic Concepts整理笔记
- Flink Basic API Concepts 学习笔记&译文
- 关于ligerUI的碰到的一些问题
- HTTP协议格式详解
- VB6播放MP3小程序
- svg 画图
- Linux文件与目录管理
- Flink学习笔记 --- 理解DataSet WordCount
- uva 10827 球面最大子矩形
- WM_CONTEXTMENU percolate upward
- zookeeper在win10安装闪退的问题
- Wink简介
- 基准和项目章程
- transform定位
- java itext 清空PDF标签
- [LeetCode]84Largest Rectangle in Histogram && 85Maximal Rectangle