Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Pyspark Examples | 778 | a year ago | 8 | Python | ||||||
Pyspark RDD, DataFrame and Dataset Examples in Python language | ||||||||||
Chispa | 443 | 7 | 6 months ago | 19 | October 01, 2023 | 33 | mit | Python | ||
PySpark test helper methods with beautiful error messages | ||||||||||
Datacompy | 339 | 10 | 3 months ago | 20 | November 15, 2023 | 16 | apache-2.0 | Python | ||
Pandas and Spark DataFrame comparison for humans and more! | ||||||||||
Pyspark Style Guide | 264 | 3 years ago | mit | Python | ||||||
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered. | ||||||||||
Data Algorithms With Spark | 151 | 10 months ago | Python | |||||||
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian | ||||||||||
Pyspark Cheatsheet | 140 | 2 years ago | cc0-1.0 | Python | ||||||
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster | ||||||||||
Handyspark | 129 | 5 years ago | 7 | May 19, 2019 | 8 | mit | Jupyter Notebook | |||
HandySpark - bringing pandas-like capabilities to Spark dataframes | ||||||||||
Aut | 128 | 10 months ago | 27 | November 17, 2022 | 3 | apache-2.0 | Scala | |||
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives. | ||||||||||
Spark With Python | 98 | 4 years ago | mit | Jupyter Notebook | ||||||
Fundamentals of Spark with Python (using PySpark), code examples | ||||||||||
Big_data | 55 | 4 months ago | mit | Jupyter Notebook | ||||||
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark. |