Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for pipeline pyspark
pipeline
x
pyspark
x
15 search results found
Mleap
⭐
1,479
MLeap: Deploy ML Pipelines to Production
Butterfree
⭐
269
A tool for building feature stores.
Morphl Community Edition
⭐
233
MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization
Learn By Examples
⭐
72
Real-world Spark pipelines examples
Jgit Spark Connector
⭐
67
jgit-spark-connector is a library for running scalable data retrieval pipelines that process any number of Git repositories for source code analysis.
Spark Nba Analytics
⭐
41
Analyzing NBA data using Spark 2.1
Basin
⭐
29
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Sparkdltrigger
⭐
28
Repo for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
Odsc_india_2018
⭐
26
My presentation at ODSC India 2018 about Deep Learning with Apache Spark
Spark Movies Etl
⭐
21
Spark data pipeline that ingests and transforms movie ratings data.
Pyspark_dl_pipeline
⭐
17
Sparklanes
⭐
16
A lightweight data processing framework for Apache Spark
Lineage
⭐
14
Generate beautiful documentation for your data pipelines in markdown format
Pipeasy Spark
⭐
14
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
Nyc_taxi_pipeline
⭐
12
Design/Implement stream/batch architecture on NYC taxi data | #DE
Dagster Graph Project
⭐
12
Repo demonstrating a Dagster pipeline to generate Neo4j Graph
Databricks Connect Pyspark
⭐
9
A guide of how to build good Data Pipelines with Databricks Connect using best practices
Airflow
⭐
8
This set of code and instructions has the porpouse to instanciate a compiled environment with set of docker images like airflow webserver, airflow scheduler, postgresql, pyspark, Data Pipeline consuming data from weather api , processing with pyspark and storing in postgresql
Pyspark Boilerplate Mehdio
⭐
7
Pyspark boilerplate for running prod ready data pipeline
Aws Etl
⭐
7
This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/A it's a zipped file with some .csvs inside that we will apply transformations.
Morphl Model User Search Intent
⭐
7
Google Cloud Storage connector, pre-processor and model for predicting user search intent based on keywords
Machine Learning Pipeline Lr Pyspark
⭐
5
Power Plant ML Pipeline Application - Apache Spark
Related Searches
Python Pipeline (4,284)
Javascript Pipeline (1,369)
Pipeline Jenkins (1,150)
Shell Pipeline (1,143)
Docker Pipeline (1,034)
Jupyter Notebook Pipeline (976)
Java Pipeline (868)
Spark Pyspark (773)
Python Pyspark (689)
1-15 of 15 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.