Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for pipeline apache spark
apache-spark
x
pipeline
x
18 search results found
Transmogrifai
⭐
2,099
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Goodreads_etl_pipeline
⭐
593
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Sparkflow
⭐
301
Easy to use library to bring Tensorflow on Apache Spark
Sparktorch
⭐
297
Train and run Pytorch models on Apache Spark.
Cuelake
⭐
266
Use SQL to build ELT pipelines on a data lakehouse.
Whylogs Java
⭐
179
Profile and monitor your ML data pipeline end-to-end
Envelope
⭐
133
Build configuration-driven ETL pipelines on Apache Spark
Qstreaming
⭐
89
A simplified, lightweight ETL pipeline framework for build stream/batch processing applications on top of Apache Spark
Learn By Examples
⭐
72
Real-world Spark pipelines examples
Lighthouse
⭐
54
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
Aardpfark
⭐
47
A library for exporting Spark ML models and pipelines to PFA
Sparkplug
⭐
42
A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.
Hyperdrive
⭐
41
Extensible streaming ingestion pipeline on top of Apache Spark
Deep Learning Pyspark
⭐
40
Deep Learning with Apache Spark and Deep Cognition
Sparkxgb
⭐
40
R interface for XGBoost on Spark
Spark Flow
⭐
32
Library for organizing batch processing pipelines in Apache Spark
Odsc_india_2018
⭐
26
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Sparklanes
⭐
16
A lightweight data processing framework for Apache Spark
Graphsense Transformation
⭐
15
GraphSense Transformation Pipeline
Data Factory R Server Apache Spark Pipeline
⭐
15
This tutorial highlights how to build a scalable machine-learning based data processing pipeline using Microsoft R Server with Apache Spark utilizing Azure Data Factory (ADF)
Deepvariant On Spark
⭐
11
DeepVariant-on-Spark is a germline short variant calling pipeline that runs Google DeepVariant on Apache Spark at scale.
Stackexchange Spark Scala Analyser
⭐
10
Still in Beta
Spark Pipeline
⭐
9
Example End-to-End Data Pipeline with Apache Spark from Data Analysis to Data Product
Spark Pipeline
⭐
9
Machine learning pipeline for Apache Spark
Apache Spark Etl Pipeline Example
⭐
8
Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computing.
Morenlp
⭐
6
Capabilities of StanfordNLP and OpenNLP on Spark
Physonline
⭐
5
PhysOnline: An Open Source Machine Learning Pipeline for Real-Time Analysis of Waveform Data
Machine Learning Pipeline Lr Pyspark
⭐
5
Power Plant ML Pipeline Application - Apache Spark
Related Searches
Python Pipeline (4,255)
Javascript Pipeline (1,369)
Pipeline Jenkins (1,150)
Shell Pipeline (1,143)
Docker Pipeline (1,018)
Jupyter Notebook Pipeline (976)
Java Pipeline (868)
Scala Apache Spark (497)
1-18 of 18 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.