Airflow Spark

Docker with Airflow and Spark standalone cluster
Alternatives To Airflow Spark
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Pipeline4,158
a year ago85July 18, 20171apache-2.0Jsonnet
PipelineAI
Dataspherestudio2,860392 months ago7August 07, 2023360apache-2.0Java
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Around Dataengineering926
a year ago2Python
A Data Engineering & Machine Learning Knowledge Hub
Goodreads_etl_pipeline593
4 years agomitPython
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Data Engineering Interview Questions554
6 months ago
More than 2000+ Data engineer interview questions.
Agile_data_code_2435
a year ago7mitJupyter Notebook
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Data Engineering Projects322
a year ago5Jupyter Notebook
Personal Data Engineering Projects
Compass284
2 months ago75apache-2.0Java
Compass is a task diagnosis platform for bigdata
Beginner_de_project276
a year ago1mitHCL
Beginner data engineering project - batch edition
Airflow Pipeline168
4 months ago3apache-2.0Python
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Alternatives To Airflow Spark
Select To Compare


Alternative Project Comparisons
Popular Spark Projects
Popular Airflow Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Jupyter Notebook
Docker
Postgresql
Spark
Hadoop
Dag
Airflow