Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Synapseml | 4,989 | 6 | a month ago | 12 | November 27, 2023 | 335 | mit | Scala | ||
Simple and Distributed Machine Learning | ||||||||||
Spark Nlp | 3,578 | 30 | 5 months ago | 134 | December 08, 2023 | 43 | apache-2.0 | Scala | ||
State of the Art Natural Language Processing | ||||||||||
Ibis | 3,404 | 24 | 29 | 5 months ago | 68 | December 10, 2023 | 157 | apache-2.0 | Python | |
The flexibility of Python with the scale and performance of modern SQL. | ||||||||||
Linkis | 3,250 | 38 | a month ago | 3 | July 29, 2023 | 215 | apache-2.0 | Java | ||
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines. | ||||||||||
Petastorm | 1,693 | 8 | 7 months ago | 86 | February 03, 2023 | 174 | apache-2.0 | Python | ||
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. | ||||||||||
Spark Py Notebooks | 1,515 | a year ago | 9 | other | Jupyter Notebook | |||||
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks | ||||||||||
Mleap | 1,479 | 15 | 12 | 8 months ago | 26 | May 07, 2021 | 109 | apache-2.0 | Scala | |
MLeap: Deploy ML Pipelines to Production | ||||||||||
Awesome Spark | 1,461 | a year ago | 20 | cc0-1.0 | Shell | |||||
A curated list of awesome Apache Spark packages and resources. | ||||||||||
Optimus | 1,447 | 2 months ago | 32 | June 19, 2022 | 29 | apache-2.0 | Python | |||
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark | ||||||||||
Sparkmagic | 1,272 | 25 | 6 | 5 months ago | 54 | September 13, 2023 | 156 | other | Python | |
Jupyter magics and kernels for working with remote Spark clusters |