Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for machine learning data quality
data-quality
x
machine-learning
x
27 search results found
Made With Ml
⭐
35,496
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Applied Ml
⭐
24,828
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Ydata Profiling
⭐
11,983
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Feast
⭐
5,053
Feature Store for Machine Learning
Whylogs
⭐
2,533
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
Mlops Course
⭐
2,427
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Feathr
⭐
1,886
Feathr – A scalable, unified data and AI engineering platform for enterprise
Featureform
⭐
1,670
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Data Centric Ai
⭐
892
A curated, but incomplete, list of data-centric AI resources.
Zingg
⭐
828
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Chaos_genius
⭐
671
ML powered analytics engine for outlier detection and root cause analysis.
Failed Ml
⭐
585
Compilation of high-profile real-world examples of failed machine learning projects
Awesome Data Catalogs
⭐
441
📙 Awesome Data Catalogs and Observability Platforms.
Encord Active
⭐
385
The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
Lale
⭐
321
Library for Semi-Automated Data Science
Awesome Data Centric Ai
⭐
282
Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖
Feathub
⭐
255
FeatHub - A stream-batch unified feature store for real-time machine learning
Whylogs Java
⭐
179
Profile and monitor your ML data pipeline end-to-end
Pandas_dq
⭐
101
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
Pydvl
⭐
52
pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
Awesome Python For Data Science
⭐
51
A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into Data Science! 📊
Dqlab Career Track
⭐
42
A collection of scripts written to complete DQLab Data Analyst Career Track 📊
Amora Data Build Tool
⭐
37
Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.
Acharya
⭐
35
A Data Centric annotation tool for your Named Entity Recognition projects
Osm Data Classification
⭐
24
OpenStreetMap Data Classification
Redflag
⭐
19
Safety net for machine learning pipelines. Plays nice with sklearn and pandas.
Cleanlab Studio
⭐
16
Client interface for all things Cleanlab Studio
Iau Course
⭐
12
Intelligent Data Analysis (IAU_B) @ FIIT STU in Bratislava
Awesome Ml Monitoring
⭐
11
A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profiling data 🚀
Nlp Data Readiness
⭐
11
This is a document concerning Data Readiness in the context of machine learning and Natural Language Processing.
Data Iq
⭐
5
Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data (NeurIPS 2022)
Data Suite
⭐
5
Data-SUITE: Data-centric identification of in-distribution incongruous examples (ICML 2022)
Geoscience Data Quality For Machine Learning
⭐
5
Looking at the problems associated with geoscience datasets for data science
Data Imputation Paper
⭐
5
Research code for the paper "A Benchmark for Data Imputation Methods".
Related Searches
Python Machine Learning (14,099)
Jupyter Notebook Machine Learning (12,247)
Machine Learning Neural Network (4,397)
Machine Learning Tensorflow (4,050)
Machine Learning Natural Language Processing (3,891)
Machine Learning Artificial Intelligence (3,877)
Machine Learning Data Science (3,802)
Machine Learning Pytorch (2,910)
Machine Learning Dataset (2,298)
Machine Learning Classification (2,111)
1-27 of 27 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.