Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Spark | 37,661 | 2,394 | 939 | 9 months ago | 46 | May 09, 2021 | 186 | apache-2.0 | Scala | |
Apache Spark - A unified analytics engine for large-scale data processing | ||||||||||
Synapseml | 5,053 | 6 | a month ago | 12 | November 27, 2023 | 335 | mit | Scala | ||
Simple and Distributed Machine Learning | ||||||||||
Bigdl | 4,728 | 10 | 9 months ago | 16 | April 19, 2021 | 958 | apache-2.0 | Jupyter Notebook | ||
Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using bigdl-llm | ||||||||||
Sparkinternals | 4,665 | 3 years ago | 27 | |||||||
Notes talking about the design and implementation of Apache Spark | ||||||||||
Spark Nlp | 3,578 | 30 | 9 months ago | 134 | December 08, 2023 | 43 | apache-2.0 | Scala | ||
State of the Art Natural Language Processing | ||||||||||
Coolplayspark | 3,447 | 2 years ago | 35 | Scala | ||||||
酷玩 Spark: Spark 源代码解析、Spark 类库等 | ||||||||||
Koalas | 3,291 | 1 | 16 | a year ago | 47 | October 19, 2021 | 112 | apache-2.0 | Python | |
Koalas: pandas API on Apache Spark | ||||||||||
Deequ | 3,044 | 6 | 10 months ago | 37 | November 09, 2023 | 141 | apache-2.0 | Scala | ||
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. | ||||||||||
Analytics Zoo | 2,592 | 3 | a year ago | 1 | July 29, 2022 | 533 | apache-2.0 | Jupyter Notebook | ||
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray | ||||||||||
Spark On K8s Operator | 2,526 | 28 | 9 months ago | 19 | April 04, 2023 | 533 | apache-2.0 | Go | ||
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes. |