Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Goodreads_etl_pipeline | 593 | 4 years ago | mit | Python | ||||||
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. | ||||||||||
Pysparkling | 253 | 7 | 1 | a year ago | 69 | November 13, 2022 | 9 | other | Python | |
A pure Python implementation of Apache Spark's RDD and DStream interfaces. | ||||||||||
Rumble | 194 | a year ago | 4 | December 03, 2019 | 134 | other | Java | |||
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more | ||||||||||
Geotrellis Chatta Demo | 44 | 6 years ago | 11 | JavaScript | ||||||
Demo of GeoTrellis - weighted overlay and zonal summary for University of Tennessee at Chattanooga. | ||||||||||
Etlflow | 43 | 11 | 9 months ago | 37 | July 19, 2023 | apache-2.0 | Scala | |||
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more. | ||||||||||
Udacity Data Engineering | 42 | 4 years ago | 1 | Jupyter Notebook | ||||||
Udacity Data Engineering Nano Degree (DEND) | ||||||||||
Etl Light | 38 | 7 years ago | mit | Scala | ||||||
A light Kafka to HDFS/S3 ETL library based on Apache Spark | ||||||||||
Jobanalytics_and_search | 22 | 2 years ago | 8 | mit | Python | |||||
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters. | ||||||||||
Spark Movies Etl | 21 | 8 months ago | 2 | Python | ||||||
Spark data pipeline that ingests and transforms movie ratings data. | ||||||||||
Cloud Integration | 21 | a year ago | 4 | apache-2.0 | Scala | |||||
Spark cloud integration: tests, cloud committers and more |