Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python pipeline
pipeline
x
python
x
1,518 search results found
Jina
⭐
19,573
☁️ Build multimodal AI applications with cloud-native stack
Luigi
⭐
17,046
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Prefect
⭐
14,603
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
Airbyte
⭐
12,918
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Dagster
⭐
9,467
An orchestration platform for the development, production, and observation of data assets.
Kedro
⭐
9,353
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
Great_expectations
⭐
9,179
Always know what to expect from your data.
Beam
⭐
7,355
Apache Beam is a unified programming model for Batch and Streaming data processing.
Stanza
⭐
6,931
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Mage Ai
⭐
6,324
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Awesome Pipeline
⭐
5,752
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
Papermill
⭐
5,513
📚 Parameterize, execute, and analyze notebooks
Gaia
⭐
5,099
Build powerful pipelines in any programming language.
Taipy
⭐
4,311
Turns Data and AI algorithms into production-ready web applications in no time.
Orchest
⭐
3,876
Build data pipelines, the easy way 🛠️
Datascienceresources
⭐
3,826
Open Source Data Science Resources.
Jenkins Zero To Hero
⭐
3,748
Install Jenkins, configure Docker as slave, set up cicd, deploy applications to k8s using Argo CD in GitOps way.
Pipelines
⭐
3,368
Machine Learning Pipelines for Kubeflow
Ploomber
⭐
3,318
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
Opensfm
⭐
3,122
Open source Structure-from-Motion pipeline
Marimo
⭐
3,037
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Towhee
⭐
2,903
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Hmmlearn
⭐
2,892
Hidden Markov Models in Python, with scikit-learn like API
Professional Services
⭐
2,635
Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.
Blenderproc
⭐
2,402
A procedural Blender pipeline for photorealistic training image generation
Zero_nlp
⭐
2,248
中文nlp解决方案(大模型、数据、模型、训练、推理)
Pyfunctional
⭐
2,232
Python library for creating data pipelines with chain functional programming
Mara Pipelines
⭐
2,053
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Tfx
⭐
2,051
TFX is an end-to-end platform for deploying production ML pipelines
Elyra
⭐
1,721
Elyra extends JupyterLab with an AI centric approach.
Automl Gs
⭐
1,642
Provide an input CSV and a target field to predict, generate a model + code to run it.
Pyslam
⭐
1,605
pySLAM contains a monocular Visual Odometry (VO) pipeline in Python. It supports many modern local features based on Deep Learning.
Vdp
⭐
1,556
💧 Instill VDP (Versatile Data Pipeline) is an open-source tool to seamlessly integrate AI to process unstructured data in the modern data stack
Roadtools
⭐
1,540
A collection of Azure AD tools for offensive and defensive security purposes
Mleap
⭐
1,479
MLeap: Deploy ML Pipelines to Production
Meltano
⭐
1,460
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Pytorch Toolbelt
⭐
1,458
PyTorch extensions for fast R&D prototyping and Kaggle farming
Pypeln
⭐
1,412
Concurrent data pipelines in Python >>>
Mlbox
⭐
1,403
MLBox is a powerful Automated Machine Learning python library.
Transcoder
⭐
1,360
Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf
Saas Boilerplate
⭐
1,358
SaaS Boilerplate - Open Source and free SaaS stack that lets you build SaaS products faster in React, Django and AWS. Focus on essential business logic instead of coding repeatable features!
Deepnlp
⭐
1,311
Deep Learning NLP Pipeline implemented on Tensorflow
Deepstream_python_apps
⭐
1,247
DeepStream SDK Python bindings and sample applications
Galaxy
⭐
1,211
Data intensive science for everyone.
Mlrun
⭐
1,177
Machine Learning automation and tracking
Tods
⭐
1,078
TODS: An Automated Time-series Outlier Detection System
Bk Sops
⭐
1,012
蓝鲸智云标准运维(SOPS)
Deepaudioclassification
⭐
983
Finding the genre of a song with Deep Learning
Renderpipeline
⭐
929
Physically Based Shading and Deferred Rendering for the Panda3D game engine
Sematic
⭐
913
An open-source ML pipeline development platform
Toil
⭐
860
A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.
Couler
⭐
847
Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.
Serving
⭐
834
A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)
Klio
⭐
822
Smarter data pipelines for audio.
Lightautoml
⭐
769
LAMA - automatic model creation framework
Syntax_sugar_python
⭐
730
A library adding some anti-Pythonic syntatic sugar to Python
Rain
⭐
713
Framework for large distributed pipelines
Neumai
⭐
693
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Trankit
⭐
693
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing
Evalml
⭐
679
EvalML is an AutoML library written in python.
Sklearn2pmml
⭐
674
Python library for converting Scikit-Learn pipelines to PMML
Houdini
⭐
649
Houdini pipeline and learning database
Openisp
⭐
633
Image Signal Processor
Image_pipeline
⭐
624
An image processing pipeline for ROS.
Covalent
⭐
608
Pythonic tool for running machine-learning/high performance/quantum-computing workflows in heterogeneous environments.
Goodreads_etl_pipeline
⭐
593
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Data Pipelines With Apache Airflow
⭐
587
Code for Data Pipelines with Apache Airflow
Baikal
⭐
573
A graph-based functional API for building complex scikit-learn pipelines.
Socorro
⭐
573
Socorro is the Mozilla crash ingestion pipeline. It accepts and processes Breakpad-style crash reports. It provides analysis tools.
Devsecopsguideline
⭐
567
The OWASP DevSecOps Guideline can help us to embedding security as a part of the development pipeline.
Pylivetrader
⭐
563
Python live trade execution library with zipline interface.
Pypyr
⭐
560
pypyr task-runner cli & api for automation pipelines. Automate anything by combining commands, different scripts in different languages & applications into one pipeline process.
Paddleocr2pytorch
⭐
553
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/Paddle
Monk_object_detection
⭐
550
A one-stop repository for low-code easily-installable object detection pipelines.
Zr Obp
⭐
542
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Python Opengl
⭐
538
An open access book on Python, OpenGL and Scientific Visualization, Nicolas P. Rougier, 2018
Neuraxle
⭐
533
The world's cleanest AutoML library ✨ - Do hyperparameter tuning with the right pipeline abstractions to write clean deep learning production pipelines. Let your pipeline steps have hyperparameter spaces. Design steps in your pipeline like components. Compatible with Scikit-Learn, TensorFlow, and most other libraries, frameworks and MLOps environments.
Pipelinec
⭐
519
A C-like hardware description language (HDL) adding high level synthesis(HLS)-like automatic pipelining as a language construct/compiler feature.
Kale
⭐
517
Kubeflow’s superfood for Data Scientists
Vehicle Detection
⭐
509
Created vehicle detection pipeline with two approaches: (1) deep neural networks (YOLO framework) and (2) support vector machines ( OpenCV + HOG).
Sklearn Onnx
⭐
491
Convert scikit-learn models and pipelines to ONNX
Mario
⭐
487
Powerful Python pipelines for your shell
Torchgpipe
⭐
479
A GPipe implementation in PyTorch
Openfda
⭐
471
openFDA is a research project to provide open APIs, raw data downloads, documentation and examples, and a developer community for an important collection of FDA public datasets.
Whispers
⭐
457
Identify hardcoded secrets in static structured text
Open Solution Home Credit
⭐
444
Open solution to the Home Credit Default Risk challenge 🏡
Cnvkit
⭐
435
Copy number variant detection from targeted DNA sequencing
Aspect Based Sentiment Analysis
⭐
413
💭 Aspect-Based-Sentiment-Analysis: Transformer & Explainable ML (TensorFlow)
Wordbatch
⭐
413
Python library for distributed AI processing pipelines, using swappable scheduler backends.
Aws Lambda Handler Cookbook
⭐
399
This repository provides a working, deployable, open source-based, serverless service template with an AWS Lambda function and AWS CDK Python code with all the best practices and a complete CI/CD pipeline.
Versatile Data Kit
⭐
389
One framework to develop, deploy and operate data workflows with Python and SQL.
Singlehdr
⭐
385
[CVPR 2020] Single-Image HDR Reconstruction by Learning to Reverse the Camera Pipeline
Model Card Toolkit
⭐
381
A toolkit that streamlines and automates the generation of model cards
Megflow
⭐
380
Efficient ML solution for long-tailed demands.
Recon Pipeline
⭐
374
An automated target reconnaissance pipeline.
Cookiecutter Fastapi
⭐
370
Cookiecutter template for FastAPI projects using: Machine Learning, Poetry, Github Actions and Pytests
Open Solution Mapping Challenge
⭐
363
Open solution to the Mapping Challenge 🌎
Pyterrier
⭐
359
A Python framework for performing information retrieval experiments, building on http://terrier.org/
Bodywork Core
⭐
358
ML pipeline orchestration and model deployments on Kubernetes, made really easy.
Karton
⭐
353
Distributed malware processing framework based on Python, Redis and S3.
Related Searches
Python Machine Learning (20,195)
Python Jupyter Notebook (17,055)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Database (10,521)
Python Natural Language Processing (9,064)
Python Artificial Intelligence (8,580)
Python Pytorch (7,877)
Python Server (7,793)
1-100 of 1,518 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.