Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Airflow | 31,852 | 271 | 2 hours ago | 157 | July 10, 2023 | 874 | apache-2.0 | Python | ||
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows | ||||||||||
Argo Workflows | 13,481 | 24 | 45 | 2 hours ago | 439 | July 20, 2023 | 905 | apache-2.0 | Go | |
Workflow Engine for Kubernetes | ||||||||||
Orchest | 3,876 | 4 months ago | 19 | December 13, 2022 | 125 | apache-2.0 | TypeScript | |||
Build data pipelines, the easy way 🛠️ | ||||||||||
Ploomber | 3,164 | 6 | a month ago | 113 | August 03, 2023 | 97 | apache-2.0 | Python | ||
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️ | ||||||||||
Awesome Etl | 2,874 | 5 months ago | 11 | |||||||
A curated list of awesome ETL frameworks, libraries, and software. | ||||||||||
Mara Pipelines | 2,019 | 1 | 7 days ago | 7 | May 29, 2022 | 24 | mit | Python | ||
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow | ||||||||||
Elyra | 1,672 | 14 | 8 hours ago | 111 | March 29, 2023 | 267 | apache-2.0 | Python | ||
Elyra extends JupyterLab with an AI centric approach. | ||||||||||
Couler | 834 | 2 months ago | 33 | apache-2.0 | Python | |||||
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow. | ||||||||||
Goodreads_etl_pipeline | 593 | 4 years ago | mit | Python | ||||||
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. | ||||||||||
Data Pipelines With Apache Airflow | 503 | 4 months ago | 35 | other | Python | |||||
Code for Data Pipelines with Apache Airflow |
This repo contains a tutorial on Airflow using census data and the Chicago taxi dataset.
For a detailed overview of the requirements, setup and contents visit the docs URL: https://opendata-airflow-tutorial.readthedocs.io/en/latest/
Note: this is still much in progress and I plan to add more pipelines, use cases and a how-to deploy to Azure Kubernetes services.