Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Hopsworks | 1,041 | 2 months ago | 1 | September 11, 2019 | 12 | agpl-3.0 | Java | |||
Hopsworks - Data-Intensive AI platform with a Feature Store | ||||||||||
Devops Python Tools | 709 | 3 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Sagemaker Spark | 285 | 2 | 6 months ago | 36 | August 26, 2022 | 34 | apache-2.0 | Scala | ||
A Spark library for Amazon SageMaker. | ||||||||||
Cc Pyspark | 280 | a year ago | 4 | mit | Python | |||||
Process Common Crawl data with Python and Spark | ||||||||||
Spark Jupyter Aws | 255 | 6 years ago | 2 | Jupyter Notebook | ||||||
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support | ||||||||||
Repo 2019 | 135 | 3 years ago | 1 | Jupyter Notebook | ||||||
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics | ||||||||||
Spark_python_ml_examples | 81 | 4 years ago | Python | |||||||
Spark 2.0 Python Machine Learning examples | ||||||||||
Towardsdataengineering | 52 | a year ago | 7 | Python | ||||||
This repo contains commands that data engineers use in day to day work. | ||||||||||
Terraform Emr Pyspark | 46 | 3 months ago | 2 | apache-2.0 | HCL | |||||
Quickstart PySpark with Anaconda on AWS/EMR using Terraform | ||||||||||
Emr Bootstrap Pyspark | 43 | 7 years ago | mit | Python | ||||||
Quickstart PySpark with Anaconda on AWS/EMR |