Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java etl
etl
x
java
x
117 search results found
Airbyte
⭐
12,918
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Pentaho Kettle
⭐
7,194
Pentaho Data Integration ( ETL ) a.k.a Kettle
Kestra
⭐
5,257
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Addax
⭐
1,034
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
Tis
⭐
833
Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI
Zingg
⭐
828
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Ananas Desktop
⭐
563
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Datacleaner
⭐
557
The premier open source Data Quality solution
Exchangis
⭐
401
Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources
Zdh_web
⭐
379
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批
Smooks
⭐
377
Extensible data integration Java framework for building XML and non-XML fragment-based applications
Replicadb
⭐
304
ReplicaDB is open source tool for database replication, designed for efficiently transferring bulk data between relational and non-relational databases
Kafka Connect File Pulse
⭐
289
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
Orbital
⭐
255
Orbital automates integration between data sources (APIs, Databases, Queues and Functions). BFF's, API Composition and ETL pipelines that adapt as your specs change.
Extract
⭐
229
A cross-platform command line tool for parallelised content extraction and analysis.
Kettle Web
⭐
197
基于spring boot通过java代码调用kette
Metl
⭐
195
Metl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Bender
⭐
186
Bender - Serverless ETL Framework
Whiterabbit
⭐
158
WhiteRabbit is a small application that can be used to analyse the structure and contents of a database as preparation for designing an ETL. It comes with RabbitInAHat, an application for interactive design of an ETL to the OMOP Common Data Model with the help of the the scan report generated by White Rabbit.
Etl_unicorn
⭐
156
数据可视化, 数据挖掘, 数据处理 ETL
Hydrograph
⭐
138
A visual ETL development and debugging tool for big data
Hale
⭐
136
(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Etl
⭐
135
LinkedPipes ETL is an RDF based, lightweight ETL tool
Marklogic Data Hub
⭐
129
The MarkLogic Data Hub: documentation ==>
Neo4j Jdbc
⭐
122
JDBC driver for Neo4j
Sequenceiq Samples
⭐
119
SequenceIQ Hadoop examples
Fhir Data Pipes
⭐
107
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Deta_cache
⭐
106
缓存cache服务器
Chombo
⭐
102
Big Data ETL and Utilities for Hadoop Map Reduce, Spark and Storm
Lsc
⭐
99
LSC engine
Batch Scheduler
⭐
96
Kettleinaction100
⭐
85
Kettle实战100篇博文
Scriptella Etl
⭐
75
Scriptella is an open source ETL (Extract-Transform-Load) and script execution tool written in Java
Dflib
⭐
71
In-memory Java DataFrame library
Mongosyphon
⭐
68
A tool for transferring data from a Relational Database to MongoDB
Bigquery Etl Dataflow Sample
⭐
62
Skaetl
⭐
55
Open Source ETL designed for and dedicated to Log processing and transformation
Dswarm
⭐
51
an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wi
Mydataharbor
⭐
50
🇨🇳 MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数
Smartetl
⭐
45
A light weight ETL engine and smart transformation framework
Ruby For Pentaho Kettle
⭐
42
Ruby scripting for pentaho-kettle
Terminatooor
⭐
40
Brute force your OpenERP data integration with OOOR inside the Kettle ETL (aka Pentaho Data Integration - PDI)
Datasphere Integration
⭐
38
an data-centric integration platform
Flinkproj
⭐
36
Flink 案例开发数据清洗、数据报表
Link Move
⭐
32
A model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Orientdb Etl
⭐
32
OrientDB ETL tools
Loganalyzehelper
⭐
26
论坛日志分析系统清洗程序(包含IP规则库,UDF开发,MapReduce程序,日志数据)
Cvparser
⭐
24
CVparser is software for parsing or extracting data out of CV/resumes.
Vivo Harvester
⭐
24
An ETL tool for transferring data from traditional systems into Semantic Systems.
Cubed
⭐
24
Data Mart As A Service
Mongodb Rdbms Sync
⭐
23
Bi-directional data sync between Relational Databases (Oracle, MySql) & MongoDB.
Dqcs
⭐
22
数据质量控制系统
Zephyr
⭐
21
Zephyr is a big data, platform agnostic ETL API, with Hadoop MapReduce, Storm, and other big data bindings.
Jare
⭐
19
Java Business Rule Engine (Jare) - Sources and Jar file
Clickhouse Highlevel Sinker
⭐
18
clickhouse-highlevel-sinker
Dstream
⭐
18
Pyplyn
⭐
18
ETL tool that allows you to visualize the health of historical time-series data in real-time
Hadoop Etl Udfs
⭐
17
The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Sql To Redis
⭐
17
Convert SQL tables to Redis as JSON
Maxwell Sink
⭐
16
consume maxwell generated message from kafka,export it to another mysql.
Datax Src
⭐
16
DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase 等各种异构数据源之间高效的数据同步功能。
Bi_project
⭐
14
一个简单的Hive项目,使用了Sqoop、Hadoop、Hive、MySQL,对电商数据进行分析
Dogetl
⭐
14
A lib to transform data from jdbc,csv,json to ecah other.
Kafka Connect Datagen
⭐
14
A Kafka Connect source connector that generates data for tests
Hop Gis Plugins
⭐
13
🗺 GIS plugins for Apache Hop Orchestration Platform
Experience Platform Etl Reference
⭐
13
Examples for ETL Integrations with Adobe Experience Platform
Skycloud Datax
⭐
13
基于阿里Datax改版web datax ,支持管理平台与restful风格API
Mongomovie
⭐
12
MongoDB movie data model, ETL loader, and queries.
Spark
⭐
12
Camus Compressor
⭐
12
Camus Compressor merges files created by Camus and saves them in a compressed format.
Marketing Data Connectors
⭐
11
Command line batch job that run java runtime environment to extract and load marketing data using Facebook Marketing API, Google Analytics API, Mailchimp API, Google Webmasters API, Google Sheets API, Mysql, Postgresql, Clickhouse, etc
Etlfit
⭐
11
FitNesses fixture to automate ETL testing.
Cdc
⭐
11
Community Distributed Cache
Tajo Cdh
⭐
10
Tajo is a distributed data warehouse system on Hadoop that provides low-latency and scalable ad-hoc queries and ETL on large-data sets stored on HDFS and other data sources. This repository is for another Tajo distribution based on CDH.
Datacooker Etl
⭐
10
Data transformation framework for ETL processing with SQL-like syntax and GIS extensions, based on Apache Spark
Dcc Release
⭐
10
Second generation of the ICGC DCC release ETL built on Spark
Univocity Api
⭐
9
uniVocity public API. This project contains interfaces and configuration classes used by our data integration solution. Learn more:
Jeql
⭐
9
A SQL-esque scripting language for spatial processing and ETL
Etl By Example
⭐
9
Gotz
⭐
8
Gotz - Heavy duty ETL to automate data extraction from tons of HTML pages
Hedera Etl
⭐
8
ETL scripts for Hedera Hashgraph
Flinksupport
⭐
8
Flink应用程序开发支持框架
Bigdatawarehouse
⭐
8
Flowetl
⭐
8
This is a framework designed for the creation of testable components which can be interconnected via arbitrary inputs and outputs and those components can be executed in the correct order (inputs satisfied before running) automatically. This is useful since it aids developers in thinking in the paradigm where they plan components ahead of time, allowing for simpler reuse and refactoring later. At yahoo! this was created for a ETL like framework & pipeline, but its applications are not limited t
Kafka Sqlreader Datax
⭐
8
A delta ETL flow trigger by BINLOG in kafka messages.
Coyote
⭐
8
Coyote Data Exchange Toolkit - Mediate or Integrate Anything
Etl Admin
⭐
7
ETL调度管理平台
Chompsky
⭐
7
An NLP pipeline for Wikia data
Lobid Resources
⭐
7
Transformation, web frontend, and API 2.0 for the hbz catalog as LOD
Contube
⭐
7
ConTube: A scalable data connector framework that facilitates efficient data transfer between diverse systems.
Alfresco Etl Connector
⭐
7
The ETL Connector extension for Alfresco allows mass import of documents in an Alfresco repository by using compatible ETL Tools (for now Talend). It also provides an ETL client library that makes it easy to integrate in any ETL tool.
Vau
⭐
7
Data Vault data model and ETL generator for Oracle Databases
Openmrs Module Mysqletl
⭐
7
This is the OpenMRS Module that will perform ETL from MySQL database to datawarehouse.
Spark Kafka Simple Consumer Receiver
⭐
7
Northstar
⭐
6
北极星数据管理中台
Coding Dojo
⭐
6
Coding Dojo
Preprocessor Sample
⭐
6
新版预处理的使用样例
Mailspider
⭐
6
Opensource ETL for {email,http,ftp} files retrieval, filtering (against a set of rules), tagging and uploading via http.
Etl Tool
⭐
6
ETL tool for a telecomm company's data warehouse
Scs Etl Demo
⭐
6
Spring Cloud Stream - Event Driven ETL reference examples
Related Searches
Java Spring (21,350)
Java Spring Boot (11,982)
Java Video Game (8,093)
Java Gradle (8,072)
Java Docker (6,180)
Java Database (6,015)
Java Mysql (5,954)
Java Sdk (5,864)
Javascript Java (5,468)
Java Rest (4,956)
1-100 of 117 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.