Py-apache-beam

Jul 20, 2023

Apache Beam SDK for Python

Apache Beam is a unified programming model for Batch and Streaming data processing.

Beam provides a general approach to expressing embarrassingly parallel data processing pipelines and supports three categories of users, each of which have relatively disparate backgrounds and needs.

  • End Users Writing pipelines with an existing SDK, running it on an existing runner. These users want to focus on writing their application logic and have everything else just work.
  • SDK Writers Developing a Beam SDK targeted at a specific user community Java, Python, Scala, Go, R, graphical, etc. These users are language geeks and would prefer to be shielded from all the details of various runners and their implementations.
  • Runner Writers Have an execution environment for distributed processing and would like to support programs written against the Beam Model. Would prefer to be shielded from details of multiple SDKs.


Checkout these related ports:
  • Zoneinfo - Updated timezone definitions
  • Zine - Simple and opinionated tool to build your own magazine
  • Ytree - DOS-XTREE(tm) look-a-like file manager
  • Yaunc - Yet another uptimes.net client
  • Xtypo - X-based keyboard trainer
  • Xtar - View and manipulate contents of a tar file
  • Xtail - Watches the growth of files or directories
  • Xplr - Hackable, minimal, fast TUI file explorer
  • Xosd - X11 on-screen-display program and library
  • Xless - X11-based viewer for text files
  • Xiphos - Bible interface utilizing the sword framework
  • Xgboost - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT, GBM)
  • Xgas - The animated simulation of an ideal gas
  • Xfce4-wm-themes - Additional themes for xfwm4
  • Xfce4-weather-plugin - Weather plugin for the Xfce panel