Spark Jupyter Aws Alternatives

Name: PiercingDan/spark-Jupyter-AWS
Brand: PiercingDan/spark-Jupyter-AWS
SKU: project/PiercingDan/spark-Jupyter-AWS
Rating: 4.51 (255 reviews)

A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support

Categories > Data Processing > Amazon Web Services

Suggest Alternative

Stars

255

Alternatives

License

No license specified

Open Issues

Most Recent Commit

over 8 years ago

Programming Language

Jupyter Notebook

Dependent Repos

Dependent Packages

Total Releases

Categories

Data Processing > Jupyter Notebook

Cloud Computing > Amazon Web Services

Companies > Amazon

Data Processing > Spark

Data Processing > Hadoop

Cloud Computing > Aws Ec2

Cloud Computing > Aws S3

Data Processing > Apache Spark

Data Processing > Pyspark

Repo

Alternatives To PiercingDan/spark-Jupyter-AWS

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
logicalclocks/hopsworks	1,041	0	0	over 2 years ago	1	September 11, 2019	12	agpl-3.0	Java
Hopsworks - Data-Intensive AI platform with a Feature Store
HariSekhon/DevOps-Python-tools	709	0	0	over 2 years ago	0		37	mit	Python
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
aws/sagemaker-spark	285	2	0	almost 3 years ago	36	August 26, 2022	34	apache-2.0	Scala
A Spark library for Amazon SageMaker.
commoncrawl/cc-pyspark	280	0	0	over 3 years ago	0		4	mit	Python
Process Common Crawl data with Python and Spark
PiercingDan/spark-Jupyter-AWS	255	0	0	over 8 years ago	0		2		Jupyter Notebook
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
RubensZimbres/Repo-2019	135	0	0	almost 5 years ago	0		1		Jupyter Notebook
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
adornes/spark_python_ml_examples	81	0	0	almost 7 years ago	0		0		Python
Spark 2.0 Python Machine Learning examples
arverma/TowardsDataEngineering	52	0	0	over 3 years ago	0		7		Python
This repo contains commands that data engineers use in day to day work.
idealo/terraform-emr-pyspark	46	0	0	over 2 years ago	0		2	apache-2.0	HCL
Quickstart PySpark with Anaconda on AWS/EMR using Terraform
datitran/emr-bootstrap-pyspark	43	0	0	over 9 years ago	0		0	mit	Python
Quickstart PySpark with Anaconda on AWS/EMR

Alternatives To PiercingDan/spark-Jupyter-AWS

Select To Compare

logicalclocks/hopsworks ⭐ 1,041

Hopsworks - Data-Intensive AI platform with a Feature Store

dependent packages 0 total releases 1 most recent commit over 2 years ago downloads badge

HariSekhon/DevOps-Python-tools ⭐ 709

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

dependent packages 0 total releases 0 most recent commit over 2 years ago

aws/sagemaker-spark ⭐ 285

A Spark library for Amazon SageMaker.

dependent packages 0 total releases 36 most recent commit almost 3 years ago downloads badge

commoncrawl/cc-pyspark ⭐ 280

Process Common Crawl data with Python and Spark

dependent packages 0 total releases 0 most recent commit over 3 years ago

PiercingDan/spark-Jupyter-AWS ⭐ 255

A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support

dependent packages 0 total releases 0 most recent commit over 8 years ago

RubensZimbres/Repo-2019 ⭐ 135

BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics

dependent packages 0 total releases 0 most recent commit almost 5 years ago

adornes/spark_python_ml_examples ⭐ 81

Spark 2.0 Python Machine Learning examples

dependent packages 0 total releases 0 most recent commit almost 7 years ago

arverma/TowardsDataEngineering ⭐ 52

This repo contains commands that data engineers use in day to day work.

dependent packages 0 total releases 0 most recent commit over 3 years ago

idealo/terraform-emr-pyspark ⭐ 46

Quickstart PySpark with Anaconda on AWS/EMR using Terraform

dependent packages 0 total releases 0 most recent commit over 2 years ago

datitran/emr-bootstrap-pyspark ⭐ 43

Quickstart PySpark with Anaconda on AWS/EMR

dependent packages 0 total releases 0 most recent commit over 9 years ago

Suggest An Alternative To spark-Jupyter-AWS

Alternative Project Comparisons

PiercingDan/spark-Jupyter-AWS vs Hopsworks

PiercingDan/spark-Jupyter-AWS vs Devops Python Tools

PiercingDan/spark-Jupyter-AWS vs Sagemaker Spark

PiercingDan/spark-Jupyter-AWS vs Cc Pyspark

PiercingDan/spark-Jupyter-AWS vs Spark Jupyter Aws

PiercingDan/spark-Jupyter-AWS vs Repo 2019

PiercingDan/spark-Jupyter-AWS vs Spark_python_ml_examples

PiercingDan/spark-Jupyter-AWS vs Towardsdataengineering

PiercingDan/spark-Jupyter-AWS vs Terraform Emr Pyspark

PiercingDan/spark-Jupyter-AWS vs Emr Bootstrap Pyspark

Popular Pyspark Projects

kailashahirwar/cheatsheets-ai⭐ 13,281

Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5

microsoft/SynapseML⭐ 5,228

Simple and Distributed Machine Learning

JohnSnowLabs/spark-nlp⭐ 3,578

State of the Art Natural Language Processing

apache/linkis⭐ 3,407

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

ibis-project/ibis⭐ 3,404

The flexibility of Python with the scale and performance of modern SQL.

Popular Amazon Web Services Projects

bregman-arie/devops-exercises⭐ 60,067

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

localstack/localstack⭐ 51,025

💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline

ByteByteGoHq/system-design-101⭐ 50,529

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

serverless/serverless⭐ 45,767

⚡ Serverless Framework – Build web, mobile and IoT applications with serverless architectures using AWS Lambda, Azure Functions, Google CloudFunctions & more! –

danny-avila/LibreChat⭐ 38,686

Enhanced ChatGPT Clone: Features Agents, MCP, Skills, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active

Popular Data Processing Categories

Jupyter Notebook

Dataset

Sql

Validation

Pipeline

Translation

Data Science

Classification

Transaction

Scraper