Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Spark | 36,783 | 2,394 | 903 | 11 hours ago | 46 | May 09, 2021 | 251 | apache-2.0 | Scala | |
Apache Spark - A unified analytics engine for large-scale data processing | ||||||||||
Sparkinternals | 4,665 | 2 years ago | 27 | |||||||
Notes talking about the design and implementation of Apache Spark | ||||||||||
Synapseml | 4,523 | 3 | 2 days ago | 9 | November 22, 2022 | 318 | mit | Scala | ||
Simple and Distributed Machine Learning | ||||||||||
Hudi | 4,507 | 10 | 11 hours ago | 18 | May 24, 2023 | 792 | apache-2.0 | Java | ||
Upserts, Deletes And Incremental Processing on Big Data. | ||||||||||
Bigdl | 4,382 | 10 | 12 hours ago | 16 | April 19, 2021 | 826 | apache-2.0 | Jupyter Notebook | ||
Accelerating LLM with low-bit (INT3 / INT4 / NF4 / INT5 / INT8) optimizations using bigdl-llm | ||||||||||
Spark Nlp | 3,434 | 25 | 18 hours ago | 128 | August 02, 2023 | 47 | apache-2.0 | Scala | ||
State of the Art Natural Language Processing | ||||||||||
Coolplayspark | 3,397 | a year ago | 35 | Scala | ||||||
酷玩 Spark: Spark 源代码解析、Spark 类库等 | ||||||||||
Koalas | 3,291 | 1 | 13 | 2 days ago | 47 | October 19, 2021 | 112 | apache-2.0 | Python | |
Koalas: pandas API on Apache Spark | ||||||||||
Spark Notebook | 3,147 | 4 months ago | 207 | apache-2.0 | JavaScript | |||||
Interactive and Reactive Data Science using Scala and Spark. | ||||||||||
Deequ | 2,920 | 6 | 20 days ago | 35 | August 08, 2023 | 135 | apache-2.0 | Scala | ||
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. |
Coolplay Spark Spark
Coolplay Spark Spark Spark Spark