Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Devops Python Tools | 709 | 3 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Gather Deployment | 347 | 8 months ago | mit | Jupyter Notebook | ||||||
Gathers Python deployment, infrastructure and practices. | ||||||||||
Spark Standalone Cluster On Docker | 311 | a year ago | 16 | mit | Jupyter Notebook | |||||
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. :zap: | ||||||||||
Hunter | 170 | 3 years ago | mit | Jupyter Notebook | ||||||
A threat hunting / data analysis environment based on Python, Pandas, PySpark and Jupyter Notebook. | ||||||||||
Mastering Big Data Analytics With Pyspark | 118 | a year ago | 6 | mit | Jupyter Notebook | |||||
Mastering Big Data Analytics with PySpark, Published by Packt | ||||||||||
Movalytics Data Warehouse | 114 | 4 years ago | Python | |||||||
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow | ||||||||||
Pysparkgeoanalysis | 60 | 7 years ago | 3 | Jupyter Notebook | ||||||
:globe_with_meridians: Interactive Workshop on GeoAnalysis using PySpark | ||||||||||
Big_data | 55 | 4 months ago | mit | Jupyter Notebook | ||||||
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark. | ||||||||||
Towardsdataengineering | 52 | a year ago | 7 | Python | ||||||
This repo contains commands that data engineers use in day to day work. | ||||||||||
Smv | 41 | 4 years ago | 10 | September 19, 2019 | 73 | apache-2.0 | Python | |||
Spark Modularized View |