Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python data preprocessing
data-preprocessing
x
python
x
53 search results found
Automatic_speech_recognition
⭐
2,743
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Klib
⭐
446
Easy to use Python library of customized functions for cleaning and analyzing data.
Transbigdata
⭐
351
A Python package develop for transportation spatio-temporal big data processing, analysis and visualization.
Nonechucks
⭐
315
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
100 Days Of Ml Code
⭐
201
A day to day plan for this challenge. Covers both theoritical and practical aspects
Customizable Gpt Chatbot
⭐
186
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
Convtools Ita
⭐
176
convtools is a python library to declaratively define conversions for processing collections, doing complex aggregations and joins.
Semsegpipeline
⭐
145
A simpler way of reading and augmenting image segmentation data into TensorFlow
Pandas Tutorial
⭐
124
Jupyter Notebooks and Data Sets for Pandas Library
Smmt
⭐
118
Social Media Mining Toolkit (SMMT) main repository
Mzutils
⭐
109
Cocosplit
⭐
108
Simple tool to split COCO annotations into train/test datasets.
Dali_backend
⭐
104
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
Tensormsa
⭐
103
Deep learning GUI frame work for enterprise
Segan Pytorch
⭐
83
SEGAN pytorch implementation https://arxiv.org/abs/1703.09452
Prosto
⭐
53
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
25daysinmachinelearning
⭐
48
I will update this repository to learn Machine learning with python with statistics content and materials
Sciblox
⭐
46
sciblox - Easier Data Science and Machine Learning
Candock
⭐
41
A time series signal analysis and classification framework
Modelscript
⭐
40
REPO MOVED TO https://github.com/repetere/jsonstack-data - Data Science and Machine learning in JavaScript
Data Science Using Python University Course Module
⭐
40
“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumPy; SciPy; a helping hand from Python's Standard Library.
Nuts Ml
⭐
29
Flow-based data pre-processing for deep learning
Data Purifier
⭐
26
A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning, and Automated Data Preprocessing For Machine Learning and Natural Language Processing Applications in Python.
Yandexcatboost Python Demo
⭐
26
Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data exploration, cleaning, preprocessing and model tuning are performed on the dataset
Cereja
⭐
21
Cereja is a bundle of useful functions we don't want to rewrite and .. just pure fun!
Stock Predictor V4
⭐
21
A reinforcement learning model specialized in stock prediction utilizing deep learning techniques, incorporating reward mechanisms, compatible with any machine equipped with Python.
Gpuparallel
⭐
21
Joblib-like interface for parallel GPU computations (e.g. data preprocessing)
Sumstatsrehab
⭐
20
GWAS summary statistics files QC tool
Machinera 2020
⭐
19
This is an AI Series where we will cover Machine Learning and Deep Learning topics from the very basics.
Learn2clean
⭐
18
Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning
Stock Trading Using Machine Learning
⭐
17
A comprehensive approach for stock trading implemented using Neural Network and Reinforcement Learning separately.
Ptrail
⭐
16
PTRAIL is a state-of-the art parallel computation library for Mobility Data Preprocessing and feature extraction.
Sparklanes
⭐
16
A lightweight data processing framework for Apache Spark
Automated Data Preprocessing
⭐
14
A command-line utility program for automating the trivial, frequently occurring data preparation tasks: missing value interpolation, outlier removal, and encoding categorical variables.
Teal
⭐
14
Library of TensorFlow layers for audio data processing and data augmentation
Klar Eda
⭐
14
A python library for automated exploratory data analysis
Xplore
⭐
13
A python package built for data scientist/analysts, AI/ML engineers for exploring features of a dataset in minimal number of lines of code for quick analysis before data wrangling and feature extraction.
Hr Analytics
⭐
12
Analyzing the HR Criteria of a Company and how they promote their Employees and keep Balance between them using Data Analytics, Data Visualizations, and Machine Learning Models for Classification Purposes.
Split Markdown4gpt
⭐
11
A Python tool for splitting large Markdown files into smaller sections based on a specified token limit. This is particularly useful for processing large Markdown files with GPT models, as it allows the models to handle the data in manageable chunks.
Android App Malware Detector
⭐
11
A Deep Learning Model for detecting Malware Applications
Linked Eed
⭐
10
Aim is to come up with a job recommender system, which takes the skills from LinkedIn and jobs from Indeed and throws the best jobs available for you according to your skills.
Knead
⭐
10
A command line tool for preprocessing, manipulating and serializing font files for deep learning applications.
Atlantic
⭐
10
Atlantic - Automated Data Preprocessing Framework for Supervised Machine Learning
Data Modori
⭐
10
Monotonic Optimal Binning
⭐
9
Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical variables.
Luciferml
⭐
8
Semi-Auto Machine Learning Library by d4rk-lucif3r
Pypreprocessing
⭐
8
Especially useful for preprocessing of datasets like Raman spectra, infrared spectra, UV/Vis spectra, but also HPLC data and many other types of data. pyPreprocessing includes baseline correction, smoothing, filtering, normalization and transformation.
Customizable Web Crawler
⭐
8
This web crawler can be customized to scrape almost all types of websites.
Data Preprocessing Template
⭐
7
This repository includes all the Data Preprocessing required before using a dataset on a Machine Learning Model. Please refer README on how to use.
Ceemdan Ewt Lstm
⭐
7
Wind Power Forecasting Based on Hybrid CEEMDAN-EWT Deep Learning Method
Eeg_signalsclassification
⭐
7
Preprocessing, analysis and classification of EEG signals into 4 classes.
Predict Blog Author Features
⭐
7
Predicts gender, age, label, and zodiac sign of the writer from the given text.
Pyhelpers
⭐
7
PyHelpers: an open-source toolkit for facilitating Python users' data manipulation tasks
Deep Learning For Data Science
⭐
6
Deep Learning Case Studies with Tensorflow and Keras for Beginners-Advanced: ANN, CNN, RNN, Self-Organizing Maps, Boltzmann Machines, Stacked Autoencoders
Machinelearninginhealthcare
⭐
6
This repository focuses on two machine learning projects in the healthcare domain.
Step Detection Using Machine Learning
⭐
6
Implements an entire machine learning pipeline to train and evaluate a Random Forest Classifier on labeled gait data for walking. Data generated during the experiment has led to helpful insights in to the problem domain.
Loren_frank_data_processing
⭐
6
Python tools for reading in data from Loren Frank's lab
Kaggle Brainnetprediction Toolbox
⭐
6
A Python toolbox for predicting brain network (graph) evolution over time from a single observation. The codes of the 20 competing Kaggle teams along with the competition datasets are made available.
Docx Content Modify
⭐
5
Python编写的处理法务邮单自动批量生成的脚本小工具-提取判决书内容免去手输填充邮单-Legal agency postal receipt automatically generate app
Mlimputer
⭐
5
MLimputer - Null Imputation Framework for Supervised Machine Learning
A Z Machine Learning
⭐
5
This repository contains the code related to machine learning knowledge. Each code has been provided from start to end with systematical vew of each concept that you will need in your journey of learning ML.
Img_colorization
⭐
5
This project uses Keras and Python to convert a grayscale image to color without any additional information.
Beijing Multi Site Air Quality Data Data Set
⭐
5
The present project aims to predict air pollution in Beijing, China, using the data set "Beijing Multi-Site Air-Quality Data Data Set"
Ml Toolkit Project
⭐
5
A general-purpose toolkit for data preprocessing, machine learning modeling, and visualization.
Machine Learning In Python
⭐
5
My learnings on different algorithms of Machine Learning with Python .
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Network (11,495)
Python Html (10,924)
Python Algorithms (10,033)
1-53 of 53 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.