Pravda

A clojure-friendly event log processing library using S3 and Spark
Alternatives To Pravda
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Goodreads_etl_pipeline593
4 years agomitPython
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Pysparkling253712 years ago69November 13, 20229otherPython
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Rumble194
a year ago4December 03, 2019134otherJava
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Geotrellis Chatta Demo44
6 years ago11JavaScript
Demo of GeoTrellis - weighted overlay and zonal summary for University of Tennessee at Chattanooga.
Etlflow431110 months ago37July 19, 2023apache-2.0Scala
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.
Udacity Data Engineering42
4 years ago1Jupyter Notebook
Udacity Data Engineering Nano Degree (DEND)
Etl Light38
7 years agomitScala
A light Kafka to HDFS/S3 ETL library based on Apache Spark
Jobanalytics_and_search22
2 years ago8mitPython
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Spark Movies Etl21
9 months ago2Python
Spark data pipeline that ingests and transforms movie ratings data.
Cloud Integration21
a year ago4apache-2.0Scala
Spark cloud integration: tests, cloud committers and more
Alternatives To Pravda
Select To Compare


Alternative Project Comparisons
Popular Spark Projects
Popular S3 Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Clojure
Spark
S3
Partitioning