Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark s3
s3
x
spark
x
21 search results found
Goodreads_etl_pipeline
⭐
593
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Pysparkling
⭐
253
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Rumble
⭐
194
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Geotrellis Chatta Demo
⭐
44
Demo of GeoTrellis - weighted overlay and zonal summary for University of Tennessee at Chattanooga.
Etlflow
⭐
43
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.
Udacity Data Engineering
⭐
42
Udacity Data Engineering Nano Degree (DEND)
Etl Light
⭐
38
A light Kafka to HDFS/S3 ETL library based on Apache Spark
Jobanalytics_and_search
⭐
22
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Spark Movies Etl
⭐
21
Spark data pipeline that ingests and transforms movie ratings data.
Cloud Integration
⭐
21
Spark cloud integration: tests, cloud committers and more
Edge Fuse
⭐
18
Edge-X S3 API, mount S3 bucket for full POSIX read/write access
S3 Inventory Usage Examples
⭐
17
Examples demonstrating how to use Amazon S3 Inventory to analyze your S3 storage using Spark and EMR.
Lambda Architecture Demo
⭐
14
Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python
Spark Adaptive File Connector
⭐
13
Adaptive File Source Connector for Spark, optimised for reading from object stores
Nyc_taxi_pipeline
⭐
12
Design/Implement stream/batch architecture on NYC taxi data | #DE
Spark S3
⭐
11
Spark Plugin for Amazon S3
Spark History Server
⭐
10
Helm Chart for deploying Spark history server in Amazon EKS for S3 Spark Event Logs
Insight Zone Defense
⭐
8
One-click automation of big data pipeline with monitoring
Pravda
⭐
6
A clojure-friendly event log processing library using S3 and Spark
Udacity Data Engineering Projects
⭐
5
My solutions for the Udacity Data Engineering Nanodegree
Assortmentofjunitrules
⭐
5
An assortment of (for me anyway) useful JUnit rules.
Related Searches
Scala Spark (3,279)
Amazon Web Services S3 (2,551)
Python Spark (2,053)
Java Spark (1,587)
Javascript S3 (1,284)
Apache Spark (1,207)
Spark Hadoop (1,188)
Python S3 (1,171)
Jupyter Notebook Spark (1,151)
Amazon S3 (1,059)
1-21 of 21 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.