Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Luigi | 17,046 | 338 | 76 | a year ago | 80 | October 05, 2023 | 124 | apache-2.0 | Python | |
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in. | ||||||||||
Digandburied | 645 | 9 years ago | 4 | GCC Machine Description | ||||||
挖坑与填坑 | ||||||||||
Tez | 446 | a year ago | 67 | apache-2.0 | Java | |||||
Apache Tez | ||||||||||
Graphbuilder | 90 | 11 years ago | 1 | apache-2.0 | Java | |||||
The GraphBuilder library provides functions to construct large scale graphs. It is implemented on Apache Hadoop. | ||||||||||
Smart Data Lake | 87 | 8 | a year ago | 26 | October 25, 2023 | 64 | gpl-3.0 | Scala | ||
Smart Automation Tool for building modern Data Lakes and Data Pipelines | ||||||||||
Briefly | 85 | 7 years ago | 2 | apache-2.0 | Python | |||||
Briefly - A Python Meta-programming Library for Job Flow Control | ||||||||||
Ni | 81 | 2 years ago | 6 | mit | Perl | |||||
Say "ni" to data of any size | ||||||||||
Data Pipeline | 79 | 11 years ago | 2 | apache-2.0 | Python | |||||
Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs. Out of the box it will load files from a source, transform them and then output them (output might be writing to a file or loading them into a data analysis tool). It is designed to be modular and support various sources, transformation technologies and output types. The transformations can be chained together to form complex pipelines. | ||||||||||
Til | 51 | 3 years ago | 173 | gpl-3.0 | DIGITAL Command Language | |||||
Today I Learned | ||||||||||
Teraslice | 50 | 6 | 31 | a year ago | 129 | September 27, 2023 | 143 | apache-2.0 | TypeScript | |
Scalable data processing pipelines in JavaScript |