Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for machine learning data cleaning
data-cleaning
x
machine-learning
x
38 search results found
Dat8
⭐
1,549
General Assembly's 2015 Data Science course in Washington, DC
Optimus
⭐
1,446
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Skrub
⭐
1,010
Prepping tables for machine learning
Encord Active
⭐
385
The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
Voicebook
⭐
325
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Nonechucks
⭐
315
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
Feature Engineering Tutorials
⭐
217
Data Science Feature Engineering and Selection Tutorials
Allie
⭐
126
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
Mzutils
⭐
109
Holoclean Legacy Deprecated
⭐
74
A Machine Learning System for Data Enrichment.
Opendataval
⭐
60
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
Pydvl
⭐
52
pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
Sliceguard
⭐
43
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
Bunkatopics
⭐
43
🗺️ Data Cleaning and Textual Data Visualization 🗺️
Amora Data Build Tool
⭐
37
Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.
Drugs Recommendation Using Reviews
⭐
27
Analyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Fifa 2019 Analysis
⭐
21
This is a project based on the FIFA World Cup 2019 and Analyzes the Performance and Efficiency of Teams, Players, Countries and other related things using Data Analysis and Data Visualizations
Learn2clean
⭐
18
Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning
Natural Language Processing With Machine Learning
⭐
18
This repository builds a basic understanding of Natural Language Processing and Machine Learning tasks around it.
Cleanlab Studio
⭐
16
Client interface for all things Cleanlab Studio
World Food Production
⭐
14
Comparing Top food and feed Producers around the globe and also seeking some interesting answers, solutions, patterns, hints and warnings through the power of Data Analysis and Data Visualization using Machine Learning.
Flight_delay_prediction
⭐
12
A two-stage predictive machine learning engine that forecasts the on-time performance of flights for 15 different airports in the USA based on data collected in 2016 and 2017.
Twitter Sentiment Analysis
⭐
12
It is a Natural Language Processing Problem where Sentiment Analysis is done by Classifying the Positive tweets from negative tweets by machine learning models for classification, text mining, text analysis, data analysis and data visualization
Awesome Ml Monitoring
⭐
11
A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profiling data 🚀
Churn Modelling Dataset
⭐
10
Predicting which set of the customers are gong to churn out from the organization by looking into some of the important attributes and applying Machine Learning and Deep Learning on it.
Aqua
⭐
9
AQuA: A Benchmarking Tool for Label Quality Assessment
Google Job Skills
⭐
9
Having an Exploratory Analysis at what kind of Jobs and Job Locations are provided by Google and Youtube, also we look into some specific details which are important to get hired by youtube and google.
Scikit Clean
⭐
9
A collection of algorithms for detecting and handling label noise
Black Friday Regression Analysis
⭐
9
Predicting Prices for the products to be sold on Black Friday in US using Regression Analysis, Feature Engineering, Feature Selection, Feature Extraction and Data analysis - Data Visualizations.
Ai Ml Jupyter Notebooks
⭐
8
A collection of Jupyter notebooks for AI and ML tasks. Explore, learn, and contribute to advance your skills in artificial intelligence and machine learning. #Hacktoberfest friendly!
Graduate Admissions Analysis
⭐
8
Analyzing the Factors on which Graduates get Admissions in Abroad and Visualizing some of the most intriguing and interesting patterns followed onto it using Data Analysis and Data Visualizations Using Machine Learning.
Hackathon_motorica_2022
⭐
8
3 этапа хакатона, совместно проведенного Motorica и Skillfactory (numpy, tensorflow)
Allstate Claims Severity
⭐
7
Udacity Machine Learning Engineer Nanodegree capstone proposal.
Av_ultimate_student_hunt
⭐
7
Solution for the Ultimate Student Hunt Challenge (1st place).
Big Mart Sales Prediction
⭐
7
Using Machine Learning Algorithms for Regression Analysis to predict the sales pattern and Using Data Analysis and Data Visualizations to Support it.
Kaggle
⭐
7
Kaggle Courses - All Exercises of the respective courses.
Pakistan Suicide Bombing Dataset
⭐
6
Analyzing the Suicide Bombing Patterns and seeking some of the most tangled questions with good visualizations with the help of Machine Learning and Data Science.
Loan Default Prediction
⭐
5
L&T Financial Services & Analytics Vidhya presents ‘DataScience FinHack’. where I have predicted whether the customer will be defaulter in the first EMI payment using different algorithms from machine learning
Titanic Passenger Survival Prediction
⭐
5
Using Classification Techniques, Data reprocessing, Feature Engineering, Feature Extraction and Classification Algorithms from Machine Learning to Predict who can Survive the attack of Tsunami.
Malware Classification
⭐
5
A Z Machine Learning
⭐
5
This repository contains the code related to machine learning knowledge. Each code has been provided from start to end with systematical vew of each concept that you will need in your journey of learning ML.
Related Searches
Python Machine Learning (14,099)
Jupyter Notebook Machine Learning (12,247)
Machine Learning Neural Network (4,397)
Machine Learning Tensorflow (4,050)
Machine Learning Natural Language Processing (3,891)
Machine Learning Artificial Intelligence (3,877)
Machine Learning Data Science (3,802)
Machine Learning Pytorch (2,910)
Machine Learning Dataset (2,298)
Machine Learning Classification (2,091)
1-38 of 38 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.