Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Spark | 37,661 | 2,394 | 939 | 8 months ago | 46 | May 09, 2021 | 186 | apache-2.0 | Scala | |
Apache Spark - A unified analytics engine for large-scale data processing | ||||||||||
Cookbook | 12,557 | 9 months ago | 111 | apache-2.0 | ||||||
The Data Engineering Cookbook | ||||||||||
God Of Bigdata | 8,483 | a year ago | 3 | |||||||
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive... | ||||||||||
Bigdl | 4,728 | 10 | 8 months ago | 16 | April 19, 2021 | 958 | apache-2.0 | Jupyter Notebook | ||
Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using bigdl-llm | ||||||||||
Sparkinternals | 4,665 | 3 years ago | 27 | |||||||
Notes talking about the design and implementation of Apache Spark | ||||||||||
Tensorflowonspark | 3,851 | 5 | a year ago | 32 | April 21, 2022 | 13 | apache-2.0 | Python | ||
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters. | ||||||||||
Spark Nlp | 3,578 | 30 | 8 months ago | 134 | December 08, 2023 | 43 | apache-2.0 | Scala | ||
State of the Art Natural Language Processing | ||||||||||
Roaringbitmap | 3,308 | 435 | 124 | 8 months ago | 187 | September 22, 2023 | 89 | apache-2.0 | Java | |
A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Tablesaw, and many others | ||||||||||
Koalas | 3,291 | 1 | 16 | a year ago | 47 | October 19, 2021 | 112 | apache-2.0 | Python | |
Koalas: pandas API on Apache Spark | ||||||||||
Spark On K8s Operator | 2,526 | 28 | 8 months ago | 19 | April 04, 2023 | 533 | apache-2.0 | Go | ||
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes. |