Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Nessie | 762 | 32 | 7 months ago | 40 | November 21, 2023 | 110 | apache-2.0 | Java | ||
Nessie: Transactional Catalog for Data Lakes with Git-like semantics | ||||||||||
Iceberg | 409 | 3 years ago | 27 | apache-2.0 | Java | |||||
Iceberg is a table format for large, slow-moving tabular data | ||||||||||
Connectors | 383 | a year ago | 5 | December 06, 2022 | apache-2.0 | Java | ||||
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake. | ||||||||||
Parquet Index | 113 | 3 years ago | 16 | apache-2.0 | Scala | |||||
Spark SQL index for Parquet tables | ||||||||||
Tpch Spark | 91 | 8 months ago | 1 | mit | C | |||||
TPC-H queries in Apache Spark SQL using native DataFrames API | ||||||||||
Flowman | 85 | 24 | 9 months ago | 65 | October 16, 2023 | 55 | apache-2.0 | Scala | ||
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines. | ||||||||||
Spark Llap | 82 | 4 years ago | 31 | apache-2.0 | Java | |||||
Spark Acid | 79 | 3 years ago | 19 | apache-2.0 | Scala | |||||
ACID Data Source for Apache Spark based on Hive ACID | ||||||||||
Cc Index Table | 78 | a year ago | 8 | apache-2.0 | Java | |||||
Index Common Crawl archives in tabular format | ||||||||||
Luigi Warehouse | 73 | 7 years ago | other | Python | ||||||
A luigi powered analytics / warehouse stack |