Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for pipeline big data
big-data
x
pipeline
x
12 search results found
Beam
⭐
7,355
Apache Beam is a unified programming model for Batch and Streaming data processing.
Pachyderm
⭐
6,035
Data-Centric Pipelines and Data Versioning
Dataflowjavasdk
⭐
853
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Tez
⭐
446
Apache Tez
Smooks
⭐
377
Extensible data integration Java framework for building XML and non-XML fragment-based applications
Dataengineering Roadmap
⭐
297
Un repositorio más con conceptos básicos, desafíos técnicos y recursos sobre ingeniería de datos en español 🧙✨
Big Data Rosetta Code
⭐
283
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Shifu
⭐
235
An end-to-end machine learning and data mining framework on Hadoop
Setl
⭐
177
A simple Spark-powered ETL framework that just works 🍺
Mobydq
⭐
175
🐳 Tool to automate data quality checks on data pipelines
Incubator Liminal
⭐
131
Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Crunch
⭐
100
Mirror of Apache Crunch (Incubating)
Falcon
⭐
95
Mirror of Apache Falcon
Ni
⭐
81
Say "ni" to data of any size
Automaticanalysis
⭐
70
Automatic Analysis (aa)
Banias
⭐
37
Opinionated serverless event analytics pipeline
Nifi
⭐
32
Deploy a secured, clustered, auto-scaling NiFi service in AWS.
Lidbox
⭐
26
End-to-end spoken language identification out of the box. Rewrite in progress for first release (version 1).
Hazelcast Jet Contrib
⭐
19
Extension modules for Hazelcast Jet
Real Time Stock Analyzer
⭐
16
Bigdata Pipeline
Snowplow Pipeline
⭐
11
End-to-end Snowplow Analytics Pipeline for real time events
Lambdaconf 2017 Bigdata
⭐
9
Materials for "Big Data Pipelines with Scala" Workshop at LambdaConf 2017
Airflow Data Pipeline
⭐
7
Udacity Data Engineer Nanodegree - Airflow data pipeline
Opiec Pipeline
⭐
7
Diagnosisextraction_ml
⭐
6
Pipeline for building Machine Learning Classifiers for the diagnosis of EHR text-data. We used this pipeline for our study, published here: https://doi.org/10.2196/23930.
Related Searches
Python Pipeline (4,255)
Javascript Pipeline (1,369)
Pipeline Jenkins (1,150)
Shell Pipeline (1,143)
Docker Pipeline (1,018)
Jupyter Notebook Pipeline (976)
Java Pipeline (868)
1-12 of 12 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.