Pyspark Style Guide

This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
Alternatives To Pyspark Style Guide
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Synapseml4,9896a month ago12November 27, 2023335mitScala
Simple and Distributed Machine Learning
Spark Nlp3,578305 months ago134December 08, 202343apache-2.0Scala
State of the Art Natural Language Processing
Ibis3,40424295 months ago68December 10, 2023157apache-2.0Python
The flexibility of Python with the scale and performance of modern SQL.
Linkis3,25038a month ago3July 29, 2023215apache-2.0Java
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Petastorm1,69387 months ago86February 03, 2023174apache-2.0Python
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Spark Py Notebooks1,515
a year ago9otherJupyter Notebook
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Mleap1,47915127 months ago26May 07, 2021109apache-2.0Scala
MLeap: Deploy ML Pipelines to Production
Awesome Spark1,461
a year ago20cc0-1.0Shell
A curated list of awesome Apache Spark packages and resources.
Optimus1,447
2 months ago32June 19, 202229apache-2.0Python
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Sparkmagic1,2722565 months ago54September 13, 2023156otherPython
Jupyter magics and kernels for working with remote Spark clusters
Alternatives To Pyspark Style Guide
Select To Compare


Alternative Project Comparisons
Popular Pyspark Projects
Popular Spark Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Spark
Dataframe
Pyspark