Awesome Open Source

Programming Languages

Search results for ingestion

20 search results found

Gobblin ⭐ 2,196

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

Automatic_log_collector_and_analyzer ⭐ 345

Replace Splunk in your small company with this one weird trick!

Azure Event Hubs Spark ⭐ 225

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

Data Prepper ⭐ 210

Data Prepper is a component of the OpenSearch project that accepts, filters, transforms, enriches, and routes data at scale.

Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)

Use local files or public GitHub repository as a source and ask questions through ChatGPT about it

Rocket Bi ⭐ 79

A free, open-source, web-based self-service BI tailor-made for clickhouse, google bigquery, mysql, postgresql, vertica

Net.jgp.labs.spark ⭐ 63

Apache Spark examples exclusively in Java

IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.

Hyperdrive ⭐ 41

Extensible streaming ingestion pipeline on top of Apache Spark

Borrow Bot ⭐ 24

💰 A bot for maximizing the borrow subreddit

Parallel Streaming Transformation Loader

Read-only mirror of https://gitlab.com/sorcero/community/ingestum

Ingest.new ⭐ 8

A simple demo application for uploading, ingesting, embedding videos and converting them to mp4s. From api.video (https://api.video)

👥 [WIP] An experimental High Available Reverse Proxy for Massive Asynchronous Message Consumption

Tagbase Server ⭐ 7

tagbase-server is a data management web service for working with eTUFF and nc-eTAG files.

Zmon Data Service ⭐ 6

Receiving end of new worker to push data across DC boundaries

Fast and sustainable Elasticsearch ingestion, migration, and cloning

Net.jgp.books.spark.ch09 ⭐ 5

Spark in Action, 2e - chapter 9 - Advanced ingestion: finding data sources and building your own

Foundry Es ⭐ 5

Biocaddie Data Processing Pipeline. A data ingestion pipeline that collects and transforms original metadata information to a unified metadata model, called DatA Tag Suite (DATS).

1-20 of 20 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.