Goodreads_etl_pipeline Alternatives

Name: san089/goodreads_etl_pipeline
Brand: san089/goodreads_etl_pipeline
SKU: project/san089/goodreads_etl_pipeline
Rating: 4.64 (593 reviews)

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Categories > Data Processing > Pipeline

Suggest Alternative

Stars

593

Alternatives

License

mit

Open Issues

Most Recent Commit

over 6 years ago

Programming Language

Python

Dependent Repos

Dependent Packages

Total Releases

Categories

Programming Languages > Python

Data Processing > Pipeline

Data Processing > Spark

Cloud Computing > S3

Data Processing > Etl

Computer Science > Dag

Data Processing > Apache Spark

Data Processing > Data Engineering

Control Flow > Airflow

Data Storage > Redshift

Repo

Alternatives To san089/goodreads_etl_pipeline

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
apache/airflow	33,219	0	320	over 2 years ago	169	November 27, 2023	890	apache-2.0	Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
argoproj/argo-workflows	13,966	24	51	over 2 years ago	449	November 27, 2023	993	apache-2.0	Go
Workflow Engine for Kubernetes
PrefectHQ/prefect	13,886	1	152	over 2 years ago	249	December 08, 2023	632	apache-2.0	Python
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
airbytehq/airbyte	12,918	0	11	over 2 years ago	311	December 08, 2023	5,111	other	Python
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
dagster-io/dagster	9,467	2	133	over 2 years ago	585	December 07, 2023	2,343	apache-2.0	Python
An orchestration platform for the development, production, and observation of data assets.
great-expectations/great_expectations	9,179	0	53	over 2 years ago	256	December 08, 2023	182	apache-2.0	Python
Always know what to expect from your data.
mage-ai/mage-ai	6,324	0	0	over 2 years ago	314	December 06, 2023	189	apache-2.0	Python
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
kestra-io/kestra	5,257	0	4	over 2 years ago	58	November 28, 2023	464	apache-2.0	Java
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Avaiga/taipy	4,311	0	0	over 2 years ago	26	October 27, 2023	181	apache-2.0	Python
Turns Data and AI algorithms into production-ready web applications in no time.
ploomber/ploomber	3,318	0	7	over 2 years ago	115	November 29, 2023	99	apache-2.0	Python
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

Alternatives To san089/goodreads_etl_pipeline

Select To Compare

apache/airflow ⭐ 33,219

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

dependent packages 320 total releases 169 most recent commit over 2 years ago downloads badge

argoproj/argo-workflows ⭐ 13,966

Workflow Engine for Kubernetes

dependent packages 51 total releases 449 most recent commit over 2 years ago

PrefectHQ/prefect ⭐ 13,886

Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines

dependent packages 152 total releases 249 most recent commit over 2 years ago downloads badge

airbytehq/airbyte ⭐ 12,918

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

dependent packages 11 total releases 311 most recent commit over 2 years ago downloads badge

dagster-io/dagster ⭐ 9,467

An orchestration platform for the development, production, and observation of data assets.

dependent packages 133 total releases 585 most recent commit over 2 years ago downloads badge

great-expectations/great_expectations ⭐ 9,179

Always know what to expect from your data.

dependent packages 53 total releases 256 most recent commit over 2 years ago downloads badge

mage-ai/mage-ai ⭐ 6,324

🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.

dependent packages 0 total releases 314 most recent commit over 2 years ago downloads badge

kestra-io/kestra ⭐ 5,257

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

dependent packages 4 total releases 58 most recent commit over 2 years ago

Avaiga/taipy ⭐ 4,311

Turns Data and AI algorithms into production-ready web applications in no time.

dependent packages 0 total releases 26 most recent commit over 2 years ago downloads badge

ploomber/ploomber ⭐ 3,318

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

dependent packages 7 total releases 115 most recent commit over 2 years ago downloads badge

Suggest An Alternative To goodreads_etl_pipeline

Alternative Project Comparisons

san089/goodreads_etl_pipeline vs Airflow

san089/goodreads_etl_pipeline vs Argo Workflows

san089/goodreads_etl_pipeline vs Prefect

san089/goodreads_etl_pipeline vs Airbyte

san089/goodreads_etl_pipeline vs Dagster

san089/goodreads_etl_pipeline vs Great_expectations

san089/goodreads_etl_pipeline vs Mage Ai

san089/goodreads_etl_pipeline vs Kestra

san089/goodreads_etl_pipeline vs Taipy

san089/goodreads_etl_pipeline vs Ploomber

Popular Pipeline Projects

nushell/nushell⭐ 28,304

A new type of shell

vectordotdev/vector⭐ 21,215

A high-performance observability data pipeline.

jina-ai/jina⭐ 19,573

☁️ Build multimodal AI applications with cloud-native stack

spotify/luigi⭐ 17,046

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

argoproj/argo-cd⭐ 15,229

Declarative Continuous Deployment for Kubernetes

Popular Data Engineering Projects

apache/superset⭐ 56,358

Apache Superset is a Data Visualization and Data Exploration Platform

GokuMohandas/Made-With-ML⭐ 34,775

Learn how to design, develop, deploy and iterate on production-grade ML applications.

eugeneyan/applied-ml⭐ 24,828

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

DataTalksClub/data-engineering-zoomcamp⭐ 19,461

Free Data Engineering course!

andkret/Cookbook⭐ 12,557

The Data Engineering Cookbook

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper