Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Doris | 11,243 | 23 days ago | 8 | September 27, 2023 | 2,332 | apache-2.0 | Java | |||
Apache Doris is an easy-to-use, high performance and unified analytics database. | ||||||||||
Addax | 1,034 | 67 | 4 months ago | 10 | July 29, 2023 | 8 | apache-2.0 | Java | ||
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration. | ||||||||||
Aws Glue Libs | 568 | 9 months ago | 96 | other | Python | |||||
AWS Glue Libraries are additions and enhancements to Spark for ETL operations. | ||||||||||
Big_data_architect_skills | 353 | 5 years ago | 1 | |||||||
一个大数据架构师应该掌握的技能 | ||||||||||
Cascading | 321 | 5 years ago | n,ull | other | Java | |||||
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on various cluster computing platforms. Please see https://github.com/cwensel/cascading for access to all WIP branches. | ||||||||||
Crunch | 196 | 9 years ago | December 22, 2023 | 1 | Go | |||||
A fast to develop, fast to run, Go based toolkit for ETL and feature extraction on Hadoop. | ||||||||||
Eel Sdk | 140 | 1 | 17 | 3 years ago | 103 | February 11, 2019 | 25 | apache-2.0 | Scala | |
Big Data Toolkit for the JVM | ||||||||||
Sequenceiq Samples | 119 | 9 years ago | apache-2.0 | Java | ||||||
SequenceIQ Hadoop examples | ||||||||||
Chombo | 102 | 3 years ago | 5 | Java | ||||||
Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm | ||||||||||
Flowman | 85 | 24 | 5 months ago | 65 | October 16, 2023 | 55 | apache-2.0 | Scala | ||
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines. |