Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for shell etl
etl
x
shell
x
26 search results found
Etl With Airflow
⭐
1,053
ETL best practices with airflow, with examples
Open Semantic Search
⭐
741
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
Freebase Triples
⭐
123
A methodology to process triples data from the Freebase data dumps.
Open Data Etl Utility Kit
⭐
87
Use Pentaho's open source data integration tool (Kettle) to create Extract-Transform-Load (ETL) processes to update a Socrata open data portal. Documentation is available at http://open-data-etl-utility-kit.readthedocs.io/en
Flowman
⭐
85
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Logstash Test Runner
⭐
82
Logstash configuration testing framework
Openrefine Batch
⭐
70
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Redis Connect Dist
⭐
38
Real-Time Event Streaming & Change Data Capture
Aws Glue Docker
⭐
22
🐋 Docker image for AWS Glue Spark/Python
Cobol On K8s
⭐
18
Running an ETL pipeline with COBOL on Kubernetes
Websitetordf
⭐
17
A simple ETL pipeline for HTML+RDFa websites
Ds4fnp
⭐
15
Data Stacks For Fun & Nonprofits!
Bigetl
⭐
13
This project is a unified ETL platform that support various data processing technologies, including Spark, Hive, Hadoop, Python, Linux Shell script, etc.
Oesophagus
⭐
12
Enterprise Grade Single-Step Streaming Data Infrastructure Setup. (Under Development)
Infoq Kafka Ksql
⭐
12
Code samples to go with InfoQ article
Airflowjob
⭐
11
Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE
Marketing Data Connectors
⭐
11
Command line batch job that run java runtime environment to extract and load marketing data using Facebook Marketing API, Google Analytics API, Mailchimp API, Google Webmasters API, Google Sheets API, Mysql, Postgresql, Clickhouse, etc
Orcli
⭐
10
OpenRefine command-line interface written in Bash (💎+🤖). Supports batch processing (import, transform, export).
Food Plan Organizer
⭐
7
Stay healthy by checking nutrition facts & manage your favourite paleo diet recipes.
Mysql Ksql Etl Demo
⭐
7
A playground and demo for streaming etl on data imported from mysql. Based on confluent's ksql clickstream demo
Greenplum Streamsets
⭐
7
Greenplum with Streamsets
Gem
⭐
6
General ETL Machine, a customizable ETL framework built in Pentaho Data Integration (Kettle)
Forex Histdata Etl
⭐
6
Snowplow Gcp Template
⭐
6
Simple template for semiautomated Snowplow deployment and configuration.
Markets Etl
⭐
5
expressive clojure for finance
Aws Redshift Matillion Workshop
⭐
5
Scripts, Instructions and Materials for AWS Redshift and Matillion ETL workshop
Related Searches
Shell Docker (20,660)
Shell Script (15,351)
Shell Bash (10,338)
Shell Command Line (6,542)
Shell Dotfiles (5,338)
Shell Git (4,715)
Shell Ansible (4,427)
Shell Server (3,563)
Shell Ssh (3,562)
Shell Docker Image (3,406)
1-26 of 26 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.