Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for apache etl
apache
x
etl
x
24 search results found
Airflow
⭐
34,468
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Kafka Connect File Pulse
⭐
289
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
Airflow_for_beginners
⭐
166
Spark Etl
⭐
62
Apache Spark based ETL Engine
Datasphere Integration
⭐
38
an data-centric integration platform
Spark Ref Architecture
⭐
38
Reference Architectures for Apache Spark
Sope
⭐
37
Apache Spark ETL Utilities
Avro
⭐
27
Apache AVRO for go
Iati Datastore
⭐
23
An open-source datastore for IATI data with RESTful web API providing XML, JSON, CSV plus ETL tools
Airflow Snowflake
⭐
22
Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data Warehouse.
Sparklanes
⭐
16
A lightweight data processing framework for Apache Spark
Etllib
⭐
16
This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading for ETL via Apache OODT (or other libs) into Apache Solr.
Bigdata Tech Index
⭐
16
Big Data Technology Index
Mqtt To Kafka Bridge
⭐
15
Move your messages from MQTT to Apache Kafka in real-time 🚀
Hadoop Data Ingestion Tool
⭐
15
OLAP and ETL of Big Data
Etl Airflow S3
⭐
14
ETL of newspaper article keywords using Apache Airflow, Newspaper3k, Quilt T4 and AWS S3
Pyspark Atlas
⭐
11
PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection
Awesome Dataops
⭐
10
Awesome list of dataops products, open source and resources
Flowetl
⭐
8
This is a framework designed for the creation of testable components which can be interconnected via arbitrary inputs and outputs and those components can be executed in the correct order (inputs satisfied before running) automatically. This is useful since it aids developers in thinking in the paradigm where they plan components ahead of time, allowing for simpler reuse and refactoring later. At yahoo! this was created for a ETL like framework & pipeline, but its applications are not limited t
Apache Spark Etl Pipeline Example
⭐
8
Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computing.
Teleporter
⭐
6
Automatically synchronizes any database in RDBMS to OrientDB database. Open Source Project - Apache 2 license.
Dcat Ap Viewer
⭐
6
Viewer of DCAT-AP 2.0.1 compatible dataset metadata
Awesome Airflow
⭐
6
A curated list of awesome Airflow platform, links, tips and resources
Datafastlane
⭐
5
Data in the Fast Lane is a powerful and extensible ETL that leverages Apache Spark.
Related Searches
Java Apache (4,331)
Php Apache (2,627)
Javascript Apache (1,555)
Python Apache (1,541)
Shell Apache (1,492)
Docker Apache (1,277)
Apache Spark (1,207)
Mysql Apache (961)
Apache Kafka (836)
Python Etl (814)
1-24 of 24 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.