Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Synapseml | 4,960 | 6 | 10 days ago | 12 | November 27, 2023 | 335 | mit | Scala | ||
Simple and Distributed Machine Learning | ||||||||||
Spark Nlp | 3,578 | 30 | 3 months ago | 134 | December 08, 2023 | 43 | apache-2.0 | Scala | ||
State of the Art Natural Language Processing | ||||||||||
Awesome Spark | 1,461 | a year ago | 20 | cc0-1.0 | Shell | |||||
A curated list of awesome Apache Spark packages and resources. | ||||||||||
Sparkit Learn | 1,054 | 5 | 3 years ago | 13 | June 24, 2015 | 35 | apache-2.0 | Python | ||
PySpark + Scikit-learn = Sparkit-learn | ||||||||||
Quinn | 572 | 3 | 3 | 21 days ago | 13 | February 17, 2023 | 32 | Python | ||
pyspark methods to enhance developer productivity 📣 👯 🎉 | ||||||||||
Spark Gotchas | 276 | 7 years ago | 5 | other | ||||||
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks | ||||||||||
Spark Jupyter Aws | 255 | 6 years ago | 2 | Jupyter Notebook | ||||||
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support | ||||||||||
Pysparkling | 253 | 7 | 1 | a year ago | 69 | November 13, 2022 | 9 | other | Python | |
A pure Python implementation of Apache Spark's RDD and DStream interfaces. | ||||||||||
Sql Data Analysis And Visualization Projects | 200 | 2 years ago | mit | Jupyter Notebook | ||||||
SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark. | ||||||||||
Azure Cosmosdb Spark | 194 | 1 | a year ago | 45 | August 11, 2021 | 102 | mit | Scala | ||
Apache Spark Connector for Azure Cosmos DB |