Luigi Alternatives

Name: spotify/luigi
Brand: spotify/luigi
SKU: project/spotify/luigi
Rating: 4.94 (17046 reviews)

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Categories > Data Processing > Pipeline

Suggest Alternative

Stars

17,046

Alternatives

License

apache-2.0

Open Issues

124

Most Recent Commit

over 2 years ago

Programming Language

Python

Monthly Downloads

Dependent Repos

338

Dependent Packages

Total Releases

Latest Release

October 05, 2023

Categories

Programming Languages > Python

Data Processing > Pipeline

Data Processing > Hadoop

Repo

Alternatives To spotify/luigi

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
spotify/luigi	17,046	338	76	over 2 years ago	80	October 05, 2023	124	apache-2.0	Python
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
ColZer/DigAndBuried	645	0	0	almost 10 years ago	0		4		GCC Machine Description
挖坑与填坑
apache/tez	446	0	0	over 2 years ago	0		67	apache-2.0	Java
Apache Tez
ShifuML/shifu	235	1	2	over 3 years ago	9	April 03, 2019	237	apache-2.0	Java
An end-to-end machine learning and data mining framework on Hadoop
intel/graphbuilder	90	0	0	almost 12 years ago	0		1	apache-2.0	Java
The GraphBuilder library provides functions to construct large scale graphs. It is implemented on Apache Hadoop.
smart-data-lake/smart-data-lake	87	0	8	over 2 years ago	26	October 25, 2023	64	gpl-3.0	Scala
Smart Automation Tool for building modern Data Lakes and Data Pipelines
bloomreach/briefly	85	0	0	almost 8 years ago	0		2	apache-2.0	Python
Briefly - A Python Meta-programming Library for Job Flow Control
spencertipping/ni	81	0	0	about 3 years ago	0		6	mit	Perl
Say "ni" to data of any size
GoogleCloudPlatform/Data-Pipeline	79	0	0	over 12 years ago	0		2	apache-2.0	Python
Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines.
occidere/TIL	51	0	0	over 3 years ago	0		173	gpl-3.0	DIGITAL Command Language
Today I Learned

Alternatives To spotify/luigi

Select To Compare

spotify/luigi ⭐ 17,046

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

dependent packages 76 total releases 80 most recent commit over 2 years ago downloads badge

ColZer/DigAndBuried ⭐ 645

挖坑与填坑

dependent packages 0 total releases 0 most recent commit almost 10 years ago

apache/tez ⭐ 446

Apache Tez

dependent packages 0 total releases 0 most recent commit over 2 years ago

ShifuML/shifu ⭐ 235

An end-to-end machine learning and data mining framework on Hadoop

dependent packages 2 total releases 9 most recent commit over 3 years ago

intel/graphbuilder ⭐ 90

The GraphBuilder library provides functions to construct large scale graphs. It is implemented on Apache Hadoop.

dependent packages 0 total releases 0 most recent commit almost 12 years ago

smart-data-lake/smart-data-lake ⭐ 87

Smart Automation Tool for building modern Data Lakes and Data Pipelines

dependent packages 8 total releases 26 most recent commit over 2 years ago

bloomreach/briefly ⭐ 85

Briefly - A Python Meta-programming Library for Job Flow Control

dependent packages 0 total releases 0 most recent commit almost 8 years ago

spencertipping/ni ⭐ 81

Say "ni" to data of any size

dependent packages 0 total releases 0 most recent commit about 3 years ago

GoogleCloudPlatform/Data-Pipeline ⭐ 79

Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines.

dependent packages 0 total releases 0 most recent commit over 12 years ago

occidere/TIL ⭐ 51

Today I Learned

dependent packages 0 total releases 0 most recent commit over 3 years ago

Suggest An Alternative To luigi

Alternative Project Comparisons

spotify/luigi vs Luigi

spotify/luigi vs Digandburied

spotify/luigi vs Tez

spotify/luigi vs Shifu

spotify/luigi vs Graphbuilder

spotify/luigi vs Smart Data Lake

spotify/luigi vs Briefly

spotify/luigi vs Ni

spotify/luigi vs Data Pipeline

spotify/luigi vs Til

Popular Hadoop Projects

apache/spark⭐ 37,661

Apache Spark - A unified analytics engine for large-scale data processing

donnemartin/data-science-ipython-notebooks⭐ 25,668

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

dmlc/xgboost⭐ 25,253

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Tencent/APIJSON⭐ 16,277

🏆 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码，前端(客户端) 定制返回 JSON 的数据和结构。 🏆 A JSON Transmission Protocol and an ORM Library 🚀 provides APIs and Docs without writing any code.

heibaiying/BigData-Notes⭐ 14,872

大数据入门指南 :star:

Popular Pipeline Projects

apache/airflow⭐ 33,219

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

nushell/nushell⭐ 28,304

A new type of shell

vectordotdev/vector⭐ 21,215

A high-performance observability data pipeline.

jina-ai/jina⭐ 19,573

☁️ Build multimodal AI applications with cloud-native stack

argoproj/argo-cd⭐ 15,229

Declarative Continuous Deployment for Kubernetes

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper