Apache Beam

来源:互联网 发布:分类信息网站源码 编辑:程序博客网 时间:2024/05/29 17:44

Apache Beam

Apache Beam provides an advanced unified programming model, allowing you to implement batch and streaming data processing jobs that can run on any execution engine.

Apache Beam is:

  • UNIFIED - Use a single programming model for both batch and streaming use cases.
  • PORTABLE - Execute pipelines on multiple execution environments, including Apache Apex, Apache Flink, Apache Spark, and Google Cloud Dataflow.
  • EXTENSIBLE - Write and share new SDKs, IO connectors, and transformation libraries.

Get Started

To use Beam for your data processing tasks, start by reading the Beam Overview and performing the steps in the Quickstart. Then dive into the Documentation section for in-depth concepts and reference materials for the Beam model, SDKs, and runners.

Contribute

Beam is an Apache Software Foundation project, available under the Apache v2 license. Beam is an open source community and contributions are greatly appreciated! If you’d like to contribute, please see theContribute section.


Apache Beam
Apache Beam提供了高级的统一编程模型,允许您实现可以在任何执行引擎上运行的批处理和流数据处理作业。


Apache Beam是:


统一的 - 对批处理和流使用情况使用单个编程模型。
可移植的 - 在多个执行环境(包括Apache Apex,Apache Flink,Apache Spark和Google Cloud Dataflow)上执行管道。
可扩展的 - 编写和共享新的SDK,IO连接器和转换库。
开始
要使用 Beam 进行数据处理任务,请先阅读 Beam 概述并执行快速入门中的步骤。然后深入到文档部分,以获取有关Beam模型,SDK和运行器的深入概念和参考资料。


赞助
Beam是Apache Software Foundation项目,可在Apache v2许可下使用。 Beam是一个开源社区,非常感谢赞助!如果您想赞助,请参阅赞助部分。

0 0
原创粉丝点击