Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Synapseml | 4,989 | 6 | a month ago | 12 | November 27, 2023 | 335 | mit | Scala | ||
Simple and Distributed Machine Learning | ||||||||||
Spark Py Notebooks | 1,515 | a year ago | 9 | other | Jupyter Notebook | |||||
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks | ||||||||||
Optimus | 1,447 | 2 months ago | 32 | June 19, 2022 | 29 | apache-2.0 | Python | |||
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark | ||||||||||
Sparkling Water | 957 | 6 | 6 months ago | 195 | October 26, 2023 | 44 | apache-2.0 | Scala | ||
Sparkling Water provides H2O functionality inside Spark cluster | ||||||||||
Sparklearning | 451 | 2 years ago | ||||||||
A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher. | ||||||||||
Gimel | 230 | 2 years ago | 9 | apache-2.0 | Scala | |||||
Big Data Processing Framework - Unified Data API or SQL on Any Storage | ||||||||||
Geopyspark | 151 | 4 years ago | 43 | other | Python | |||||
GeoTrellis for PySpark | ||||||||||
Data Algorithms With Spark | 151 | a year ago | Python | |||||||
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian | ||||||||||
Pyspark Cheatsheet | 140 | 2 years ago | cc0-1.0 | Python | ||||||
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster | ||||||||||
Big Data Mapreduce Course | 135 | 7 months ago | HTML | |||||||
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University |