Pyspark On Aws Emr

The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
Alternatives To Pyspark On Aws Emr
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Hopsworks1,041
3 months ago1September 11, 201912agpl-3.0Java
Hopsworks - Data-Intensive AI platform with a Feature Store
Devops Python Tools709
4 months ago37mitPython
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Sagemaker Spark285
27 months ago36August 26, 202234apache-2.0Scala
A Spark library for Amazon SageMaker.
Cc Pyspark280
a year ago4mitPython
Process Common Crawl data with Python and Spark
Spark Jupyter Aws255
7 years ago2Jupyter Notebook
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Repo 2019135
3 years ago1Jupyter Notebook
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
Spark_python_ml_examples81
5 years agoPython
Spark 2.0 Python Machine Learning examples
Towardsdataengineering52
a year ago7Python
This repo contains commands that data engineers use in day to day work.
Terraform Emr Pyspark46
5 months ago2apache-2.0HCL
Quickstart PySpark with Anaconda on AWS/EMR using Terraform
Emr Bootstrap Pyspark43
7 years agomitPython
Quickstart PySpark with Anaconda on AWS/EMR
Alternatives To Pyspark On Aws Emr
Select To Compare


Alternative Project Comparisons
Popular Pyspark Projects
Popular Amazon Web Services Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Amazon Web Services
Spark
Big Data
Data Engineering
Pyspark