Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data science feature engineering
data-science
x
feature-engineering
x
81 search results found
Nni
⭐
13,725
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Tpot
⭐
9,516
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Featuretools
⭐
7,109
An open source python library for automated feature engineering
Mljar Supervised
⭐
2,867
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
Metarank
⭐
1,949
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
Feathr
⭐
1,886
Feathr – A scalable, unified data and AI engineering platform for enterprise
Featureform
⭐
1,716
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Feature_engine
⭐
1,651
Feature engineering package with sklearn like functionality
Hamilton
⭐
1,538
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
Auto_ml
⭐
1,442
[UNMAINTAINED] Automated machine learning for analytics & production
Deep_learning_machine_learning_stock
⭐
1,169
Deep Learning and Machine Learning stocks represent promising opportunities for both long-term and short-term investors and traders.
Hopsworks
⭐
1,041
Hopsworks - Data-Intensive AI platform with a Feature Store
Autodl
⭐
999
Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL challenge@NeurIPS.
Hamilton
⭐
877
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
Lightautoml
⭐
769
LAMA - automatic model creation framework
Tsfel
⭐
758
An intuitive library to extract features from time series.
Evalml
⭐
679
EvalML is an AutoML library written in python.
Featexp
⭐
656
Feature exploration for supervised learning
Hyperparameter_hunter
⭐
635
Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Complete Life Cycle Of A Data Science Project
⭐
499
Complete-Life-Cycle-of-a-Data-Science-Project
Amazing Feature Engineering
⭐
485
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Open_source_demos
⭐
478
A collection of demos showcasing automated feature engineering and machine learning in diverse use cases
Hyperactive
⭐
475
An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.
Feature Selection
⭐
475
Features selector based on the self selected-algorithm, loss function and validation method
Deltapy
⭐
411
DeltaPy - Tabular Data Augmentation (by @firmai)
Tsflex
⭐
340
Flexible time series feature extraction & processing
Awesome Feature Engineering
⭐
316
A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Feature Engineering For Machine Learning
⭐
314
Code repository for the online course Feature Engineering for Machine Learning
Upgini
⭐
272
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
My Data Competition Experience
⭐
271
本人多次机器学习与大数据竞赛Top5的经验总结,满满的干货,拿好不谢
Feathub
⭐
255
FeatHub - A stream-batch unified feature store for real-time machine learning
Feature Engineering Tutorials
⭐
217
Data Science Feature Engineering and Selection Tutorials
The Data Science Workshop
⭐
156
A New, Interactive Approach to Learning Data Science
Datasist
⭐
137
A Python library for easy data analysis, visualization, exploration and modeling
Raptor
⭐
136
Transform your pythonic research to an artifact that engineers can deploy easily.
Tpot2
⭐
118
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Evolutionaryforest
⭐
108
An open source python library for automated feature engineering based on Genetic Programming
Nba_betting
⭐
93
Using data analytics and machine learning to create a comprehensive and profitable system for predicting the outcomes of NBA games.
Featurehub
⭐
80
A collaborative feature engineering system built on JupyterHub
Gallia Core
⭐
79
A schema-aware Scala library for data transformation
Anovos
⭐
78
Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Caafe
⭐
55
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
Desbordante
⭐
54
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
Mindware
⭐
54
An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.
Prosto
⭐
53
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Datacon
⭐
49
🏆DataCon大数据安全分析大赛,2019年方向二(恶意代码检测)冠军源码、2020年方向五(恶
Autotabular
⭐
45
Automatic machine learning for tabular data. ⚡🔥⚡
Data Science Regular Bootcamp
⭐
39
Regular practice on Data Science, Machien Learning, Deep Learning, Solving ML Project problem, Analytical Issue. Regular boost up my knowledge. The goal is to help learner with learning resource on Data Science filed.
Datacook
⭐
38
Machine Learning and Data Analysis in JavaScript.
Ds2
⭐
37
Easiest way to use AI models without coding (Web UI & API support)
Cooka
⭐
36
A lightweight and visual AutoML system
Kaggle Berlin
⭐
36
Material of the Kaggle Berlin meetup group!
Feagen
⭐
33
(deprecated) A fast and memory-efficient Python data engineering framework for machine learning.
Bytehub
⭐
22
ByteHub: making feature stores simple
Data Science End To End
⭐
22
A Respository to get you job ready as a Data Scientist
Skrobot
⭐
19
skrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Tricentis Lead Scoring
⭐
18
Lead Scoring: Optimizing SaaS Marketing-Sales Funnel by Extracting the Best Leads with Applied Machine Learning
Predict Household Poverty
⭐
18
Predict the poverty of households in Costa Rica using automated feature engineering.
Bubble_plot
⭐
18
Visualize linear and non-linear connections between numerical/categorical features (2D histogram with bubbles)
Data Science
⭐
17
Utilizing Kaggle Data and Real-World Data for Data Science and Prediction in Python, R, Excel, Power BI, and Tableau.
Insolver
⭐
16
Low code machine learning library, specified for insurance tasks: prepare data, build model, implement into production.
Rheoceros
⭐
15
Cloud-based AI / ML workflow and data application development framework
Cortana Intelligence Customer360
⭐
13
This repository contains instructions and code to deploy a customer 360 profile solution on Azure stack using the Cortana Intelligence Suite.
Data_analysis
⭐
12
Notebooks on some of the past data analysis and data science projects I've done
Featuretools_sql
⭐
11
Automated creation of EntitySets from relational data stored in SQL databases
Autolearn
⭐
11
AutoLearn, a domain independent regression-based feature learning algorithm.
Feature Selection Techniques
⭐
11
Python code source for features selection 👨🔬 series on medium website. 📰
Mercury Dataschema
⭐
11
Utility package that, given a Pandas DataFrame, it uses the DataSchema class which auto-infers feature types and automatically calculates different statistics depending on the types.
Zoish
⭐
11
Zoish is a Python package that streamlines machine learning by leveraging SHAP values for feature selection and interpretability, making model development more efficient and user-friendly
Musigan
⭐
10
Music generation with GAN!
Kts
⭐
10
Interactive ML Toolset
Avito Demand Prediction Challenge
⭐
7
It is a Competition for Regression Challenge held by Kaggle, It is based on a Avito Dataset whose size is 123GB which can be accessed from Kaggle, I have done Data Pre-processing, feature engineering, feature extraction, data visualization, machine learning, stacking and boosting
Kaggle
⭐
7
Kaggle Courses - All Exercises of the respective courses.
Fepipeline
⭐
7
A easy to get start, scalable, distributed feature engineering framework based on Spark.
Sk Transformers
⭐
7
A collection of pandas & scikit-learn compatible transformers for preprocessing and feature engineering 🛠
Predicting Kickstarter Campaign Outcomes Using Nlp Feature Engineering
⭐
6
Predicting Kickstarter Campaign Outcomes Using NLP - Springboard - Capstone 2
Featurehub
⭐
6
The most comprehensive library of AI/ML features across multiple domains. Our goal is to create a dataset that serves as a valuable resource for researchers and data scientists worldwide
Data Scientist
⭐
6
A Minimalist RoadMap to the Data Science World
Introduction To Data Analyst And Data Science
⭐
6
Introduction to Data Analyst and Data Science for Beginners
Codes And Presentations
⭐
5
Materiais Meetup de Machine Learning de BH
Titanic Passenger Survival Prediction
⭐
5
Using Classification Techniques, Data reprocessing, Feature Engineering, Feature Extraction and Classification Algorithms from Machine Learning to Predict who can Survive the attack of Tsunami.
Related Searches
Machine Learning Data Science (5,390)
Jupyter Notebook Data Science (4,295)
Python Data Science (4,282)
Deep Learning Data Science (1,250)
R Data Science (1,164)
Html Data Science (872)
Data Science Pandas (794)
Artificial Intelligence Data Science (749)
Data Science Scikit Learn (432)
Visualization Data Science (422)
1-81 of 81 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.