Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for ingestion
ingestion
x
20 search results found
Gobblin
⭐
2,196
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Automatic_log_collector_and_analyzer
⭐
345
Replace Splunk in your small company with this one weird trick!
Azure Event Hubs Spark
⭐
225
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Data Prepper
⭐
210
Data Prepper is a component of the OpenSearch project that accepts, filters, transforms, enriches, and routes data at scale.
Bulker
⭐
92
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)
7 Docs
⭐
87
Use local files or public GitHub repository as a source and ask questions through ChatGPT about it
Rocket Bi
⭐
79
A free, open-source, web-based self-service BI tailor-made for clickhouse, google bigquery, mysql, postgresql, vertica
Net.jgp.labs.spark
⭐
63
Apache Spark examples exclusively in Java
Ibis
⭐
44
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
Hyperdrive
⭐
41
Extensible streaming ingestion pipeline on top of Apache Spark
Borrow Bot
⭐
24
💰 A bot for maximizing the borrow subreddit
Pstl
⭐
10
Parallel Streaming Transformation Loader
Ingestum
⭐
8
Read-only mirror of https://gitlab.com/sorcero/community/ingestum
Ingest.new
⭐
8
A simple demo application for uploading, ingesting, embedding videos and converting them to mp4s. From api.video (https://api.video)
Crowd
⭐
7
👥 [WIP] An experimental High Available Reverse Proxy for Massive Asynchronous Message Consumption
Tagbase Server
⭐
7
tagbase-server is a data management web service for working with eTUFF and nc-eTAG files.
Zmon Data Service
⭐
6
Receiving end of new worker to push data across DC boundaries
Deluge
⭐
6
Fast and sustainable Elasticsearch ingestion, migration, and cloning
Net.jgp.books.spark.ch09
⭐
5
Spark in Action, 2e - chapter 9 - Advanced ingestion: finding data sources and building your own
Foundry Es
⭐
5
Biocaddie Data Processing Pipeline. A data ingestion pipeline that collects and transforms original metadata information to a unified metadata model, called DatA Tag Suite (DATS).
1-20 of 20 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.