Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for dataset pipeline
dataset
x
pipeline
x
51 search results found
Zr Obp
⭐
542
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Building Machine Learning Pipelines
⭐
403
Code repository for the O'Reilly publication "Building Machine Learning Pipelines" by Hannes Hapke & Catherine Nelson
Bactopia
⭐
281
A flexible pipeline for complete analysis of bacterial genomes
Dffml
⭐
235
The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.
Batchflow
⭐
195
BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Whylogs Java
⭐
179
Profile and monitor your ML data pipeline end-to-end
Pureml
⭐
174
Developer platform for production ML.
Setl
⭐
173
A simple Spark-powered ETL framework that just works 🍺
Mlx
⭐
155
Machine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Emotion_dataset
⭐
140
😄 Dataset for Emotion Classification
Biojupies
⭐
94
Automated generation of tailored bioinformatics Jupyter Notebooks via a user interface.
Autoalbument
⭐
89
AutoML for image augmentation. AutoAlbument uses the Faster AutoAugment algorithm to find optimal augmentation policies. Documentation - https://albumentations.ai/docs/autoalbument/
Keras Lstm Trajectory Prediction
⭐
81
A Keras multi-input multi-output LSTM-based RNN for object trajectory forecasting
Test Datasets
⭐
81
Test data to be used for automated testing with the nf-core pipelines
Dlp Dataflow Deidentification
⭐
80
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
Understaing Datasets Estimators Tfrecords
⭐
75
Try to use tf.estimator and tf.data together to train a cnn model.
Oboe
⭐
70
An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.
Bigquery
⭐
62
BigQuery import and processing pipelines
M2g
⭐
56
NeuroData's MRI to Graphs (m2g) - structural connectome estimation package and pipeline
Tensorflow Input Pipeline
⭐
54
TensorFlow Input Pipeline Examples based on multi-thread and FIFOQueue
Tensorflow Input Pipelines
⭐
53
TensorFlow input pipelines for multiple datasets for easy data fetching
Creativeflow
⭐
47
Code accompanying the Creative Flow+ Dataset, CVPR 2019.
Street View House Numbers Svhn Detection And Classification Using Cnn
⭐
38
A 2-CNN pipeline to do both detection (using bounding box regression) and classification of numbers on SVHN dataset.
Dlinputs
⭐
37
Input pipelines for large scale, sharded training of deep learning models.
Preprocessy
⭐
36
Python package for Customizable Data Preprocessing Pipelines
Faze_preprocess
⭐
35
Preprocessing pipeline for the MPIIGaze and GazeCapture datasets for evaluations for Faze
Classification
⭐
35
Catalyst.Classification
Cifar 10
⭐
32
Use the famous CIFAR-10 dataset to train a multi-layer neural network to recognize images of cats, dogs, and other things.
Dreem Learning Open
⭐
32
Benchmark code for the paper: "Dreem Open Datasets: Multi-Scored Sleep Datasets to compare Human and Automated sleep staging"
Carrada_dataset
⭐
29
Mnist
⭐
27
Handwritten digit recognizer using a feed-forward neural network and the MNIST dataset of 70,000 human-labeled handwritten digits.
Kraps Haskell
⭐
26
Experimental Haskell bindings to Spark Datasets and DataFrames
Smashed
⭐
26
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
Sa Babi
⭐
21
sa-bAbI is a software assurance dataset generator similar to the natural language dataset generator
Gutenberg Dialog
⭐
20
Build a dialog dataset from online books in many languages
Driblet
⭐
20
Setka
⭐
18
Utilities for Neural Network training
Credit
⭐
18
An example project that predicts risk of credit card default using a Logistic Regression classifier and a 30,000 sample dataset.
Har
⭐
18
Recognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
Kedro Introduction Tutorial
⭐
17
It's the Complete Beginner's Guide to Kedro! See the video here: https://youtu.be/x97ChYDd12U
Simpleml
⭐
17
Machine learning that just works, for effortless production applications
Obfuscated Code2vec
⭐
16
Code for the paper "Embedding Java Classes with code2vec: Improvements from Variable Obfuscation" in MSR 2020
Nnfabrik
⭐
15
A generalized model fitting pipeline that houses models, trainers, and datasets in datajoint and returns as well as stores trained models.
Juice
⭐
15
Code for generating the JuICe dataset.
Vgg_face_search
⭐
15
(MIRROR) Face finding engine that runs on a local service. Includes a pipeline for preprocessing a user-defined image dataset.
L1000 Bayesian
⭐
13
L1000 peak deconvolution based on Bayesian analysis
Kata Clean Machine Learning From Dirty Code
⭐
13
A coding exercise: let's convert dirty machine learning code into clean code using a Pipeline - which is the Pipe and Filter Design Pattern applied to Machine Learning.
Apolloscape Sfm
⭐
11
C++ Structure from Motion (SfM) pipeline with OpenGL visualization for Apolloscape Dataset
Actk
⭐
11
Automated Cell Toolkit
Maven
⭐
11
Maven provides easy access to open datasets in both raw and model-ready formats.
Face Recognition Pipeline
⭐
11
Pipeline for training face recognition models (based on pytorch 1.1)
Dsbox Ta2
⭐
10
The DSBox TA2 component
Kedro Wings
⭐
10
Kedro Wings automatically creates catalog entries to simplify Kedro pipeline writing.
Assnake
⭐
10
Snakemake based framework for NGS data analysis and management
Smart
⭐
9
Semi-manual alignment to reference templates (SMART): An open source pipeline in R for whole brain mapping
Deepframework
⭐
9
A python framework for Deep Learning projects
Xanthus
⭐
9
System traces dataset generation tool.
Spectral_jaccard_similarity
⭐
8
Analysis Driver
⭐
8
Pipelines for Illumina HiSeqX demultiplexing, sequence QC and variant calling.
Ebola Predictor
⭐
8
Prediction pipeline to generate prognosis predictors for Ebola Virus Disease
Bowinversion
⭐
8
Visualizing quantization effects in common bag-of-visual-words representations fro images.
Pytorch Pipeline
⭐
8
🎯Simple ETL Framework for PyTorch
Expanda
⭐
8
The universal integrated corpus-building environment.
Datapackage Pipelines Datahub
⭐
8
Datahub Extensions for datapackage-pipelines
Nba Attendance Prediction
⭐
7
Attendance prediction tool for NBA games using machine learning. Full pipeline implemented in Python from data ingestion to prediction. Attained mean absolute error of around 800 people (about 5% capacity) on test set.
Medip Seq
⭐
7
A set of scripts and functions for analyzing MeDIP-seq datasets
Magie
⭐
7
Interpret all the models - a genetic optimization approach to model agnostic black box explanations based on MAGIX.
Am Pipeline
⭐
7
Simple data pipeline showing how to create full text search over some dataset
Multiassayexperiment.tcga
⭐
7
Sordi Data Pipeline Reader
⭐
7
SORDI dataset has per frame annotation file in json format. Following tools create a COCO style annotation out of it. Thus the SORDI data can be easily fed into COCO style training pipelines.
Colombia_covid_19_pipe
⭐
7
Pipeline to get data sources from Instituto Nacional de Salud - INS related to Covid19 cases daily report in Colombia to create datasets.
Article Microarrays
⭐
6
Direct integration of microarray data — R scripts for comparing different microarray annotations and probesets selection for cross-platform direct data integration
Scrna_pipelines_paper
⭐
6
Handdetection_maskrcnn
⭐
6
An ML project to train and use a Mask R CNN pipeline for detecting hands in pictures
Lc Ms Pachyderm
⭐
6
Start-to-end LC-MS-analysis workflow definition on Pachyderm
Sql To Bigquery Dataflow
⭐
6
This project contains a basic pipeline for migrating a MS SQL Server catalog to a BigQuery dataset.
Isetimagepipeline
⭐
6
Bayesian reconstruction analysis of the initial visual encoding, using ISETBio as the visual system model.
Ml Pipelines
⭐
6
Application for managing machine learning pipelines and human workflows around them.
Ciml
⭐
5
a machine learning pipeline for analyzing CI results.
Lnisks
⭐
5
Mambo
⭐
5
A simple in-memory, configuration driven, data processing pipeline for Apache Spark.
Polya_analysis
⭐
5
Analysis pipeline of poly(A) tails lengths from ONT poly(A) standards dataset. Accompanies our forthcoming publication.
Pipelinr
⭐
5
Real-Time Visualization of Big Data - Master Thesis of Robin Wieruch - 2014
Droprna
⭐
5
Processing 10X,Drop-seq and inDrop RNA-Seq dataset.
360audiovisual
⭐
5
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
Scenic Nf
⭐
5
DEPRECATED | pySCENIC pipeline implemented in Nextflow using containers
Neuropipe
⭐
5
Easy scaffolding for machine learning pipelines in Scikit-Learn
Beam_summit
⭐
5
Workshop for 2020 Apache Beam Summit: using Beam to build data pipelines for deep learning.
Factory
⭐
5
Datahub factory for dataflows
Related Searches
Python Dataset (14,792)
Jupyter Notebook Dataset (6,824)
Python Pipeline (4,255)
Deep Learning Dataset (2,364)
Machine Learning Dataset (2,279)
Dataset Pytorch (1,847)
Dataset Tensorflow (1,583)
Dataset Classification (1,500)
Javascript Pipeline (1,369)
Dataset Convolutional Neural Networks (1,264)
1-51 of 51 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.