Pyspark Setup Guide

A guide for setting up Spark + PySpark under Ubuntu linux
Alternatives To Pyspark Setup Guide
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Synapseml4,943614 days ago12November 27, 2023335mitScala
Simple and Distributed Machine Learning
Spark Nlp3,578302 months ago134December 08, 202343apache-2.0Scala
State of the Art Natural Language Processing
Ibis3,40424292 months ago68December 10, 2023157apache-2.0Python
The flexibility of Python with the scale and performance of modern SQL.
Linkis3,224382 days ago3July 29, 2023215apache-2.0Java
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Petastorm1,69384 months ago86February 03, 2023174apache-2.0Python
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Spark Py Notebooks1,515
a year ago9otherJupyter Notebook
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Mleap1,47915124 months ago26May 07, 2021109apache-2.0Scala
MLeap: Deploy ML Pipelines to Production
Awesome Spark1,461
a year ago20cc0-1.0Shell
A curated list of awesome Apache Spark packages and resources.
Optimus1,432
10 days ago32June 19, 202229apache-2.0Python
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Sparkmagic1,2722562 months ago54September 13, 2023156otherPython
Jupyter magics and kernels for working with remote Spark clusters
Alternatives To Pyspark Setup Guide
Select To Compare


Alternative Project Comparisons
Popular Pyspark Projects
Popular Spark Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Jupyter Notebook
Scala
Apache
Spark
Ipython
Apache Spark
Pyspark