Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Lakesoul | 2,248 | 1 | 3 months ago | 7 | October 12, 2023 | 15 | apache-2.0 | Java | ||
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications. | ||||||||||
Ballista | 2,244 | 13 | 3 years ago | 4 | May 10, 2020 | apache-2.0 | ||||
Distributed compute platform implemented in Rust, and powered by Apache Arrow. | ||||||||||
Datafusion | 626 | 5 years ago | apache-2.0 | Rust | ||||||
DataFusion has now been donated to the Apache Arrow project | ||||||||||
Ballista | 411 | 4 years ago | 32 | apache-2.0 | Rust | |||||
Experimental Distributed Compute Platform based on Kubnernetes and Apache Arrow | ||||||||||
Gazelle_plugin | 250 | a year ago | 5 | July 15, 2022 | 215 | apache-2.0 | Scala | |||
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations. | ||||||||||
Rust Dataframe | 250 | 3 years ago | 12 | apache-2.0 | Rust | |||||
A Rust DataFrame implementation, built on Apache Arrow | ||||||||||
Spark Clickhouse Connector | 156 | 4 months ago | 8 | August 09, 2022 | 33 | apache-2.0 | Scala | |||
Spark ClickHouse Connector build on DataSourceV2 API | ||||||||||
Flight Spark Source | 94 | 9 months ago | 5 | apache-2.0 | Java | |||||
Blog | 44 | a year ago | ||||||||
blog entries | ||||||||||
Learn Data Munging | 37 | 3 months ago | mit | Jupyter Notebook | ||||||
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc. |