Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python feature engineering
feature-engineering
x
python
x
161 search results found
Nni
⭐
13,725
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Tpot
⭐
9,463
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Featuretools
⭐
7,009
An open source python library for automated feature engineering
Mljar Supervised
⭐
2,867
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
Fe4ml Zh
⭐
2,467
📖 [译] 面向机器学习的特征工程
Featureform
⭐
1,670
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Feature_engine
⭐
1,651
Feature engineering package with sklearn like functionality
Auto_ml
⭐
1,442
[UNMAINTAINED] Automated machine learning for analytics & production
Hamilton
⭐
1,272
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
Sgx Full Orderbook Tick Data Trading Strategy
⭐
1,151
Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.
Hopsworks
⭐
1,041
Hopsworks - Data-Intensive AI platform with a Feature Store
Autodl
⭐
999
Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL challenge@NeurIPS.
Autots
⭐
935
Automated Time Series Forecasting
Hamilton
⭐
877
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
Feature Engineering And Feature Selection
⭐
798
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
Pyfm
⭐
770
Factorization machines in python
Lightautoml
⭐
769
LAMA - automatic model creation framework
Functime
⭐
768
Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.
Tsfel
⭐
758
An intuitive library to extract features from time series.
Kaggler
⭐
723
Code for Kaggle Data Science Competitions
Kaggle Quora Question Pairs
⭐
684
Kaggle:Quora Question Pairs, 4th/3396 (https://www.kaggle.com/c/quora-question-pairs)
Evalml
⭐
679
EvalML is an AutoML library written in python.
Intelligent Trading Bot
⭐
650
Intelligent Trading Bot: Automatically generating signals and trading based on machine learning and feature engineering
Hyperparameter_hunter
⭐
635
Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Featurewiz
⭐
516
Use advanced feature engineering strategies and select best features from your data set with a single line of code.
Complete Life Cycle Of A Data Science Project
⭐
499
Complete-Life-Cycle-of-a-Data-Science-Project
Open_source_demos
⭐
478
A collection of demos showcasing automated feature engineering and machine learning in diverse use cases
Hyperactive
⭐
475
An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.
Feature Selection
⭐
475
Features selector based on the self selected-algorithm, loss function and validation method
Temporian
⭐
461
Temporian is an open-source Python library for preprocessing ⚡ and feature engineering 🛠 temporal data 📈 for machine learning applications 🤖
Open Solution Home Credit
⭐
444
Open solution to the Home Credit Default Risk challenge 🏡
Gan For Tabular Data
⭐
442
We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review and examine some recent papers about tabular GANs in action.
Autofeat
⭐
410
Linear Prediction Model with Automated Feature Engineering and Selection Capabilities
Deep Ctr Prediction
⭐
389
CTR prediction models based on deep learning(基于深度学习的广告推荐CTR预估模型)
Tsflex
⭐
340
Flexible time series feature extraction & processing
Mistql
⭐
331
A query / expression language for performing computations on JSON-like structures. Tuned for clientside ML feature extraction.
Hrv Analysis
⭐
318
Package for Heart Rate Variability analysis in Python
Feature Engineering For Machine Learning
⭐
314
Code repository for the online course Feature Engineering for Machine Learning
Nlpython
⭐
302
This repository contains the code related to Natural Language Processing using python scripting language. All the codes are related to my book entitled "Python Natural Language Processing"
Nyaggle
⭐
276
Code for Kaggle and Offline Competitions
Upgini
⭐
272
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
My Data Competition Experience
⭐
271
本人多次机器学习与大数据竞赛Top5的经验总结,满满的干货,拿好不谢
Feathub
⭐
255
FeatHub - A stream-batch unified feature store for real-time machine learning
Joblib Spark
⭐
226
Joblib Apache Spark Backend
Feature Engineering Tutorials
⭐
217
Data Science Feature Engineering and Selection Tutorials
Geomancer
⭐
194
Automated feature engineering for geospatial data
Hanzi_char_featurizer
⭐
185
汉字字符特征提取器 (featurizer),提取汉字的特征(发音特征、字形特征)用做深度学习的特征 | A Chinese character feature extractor, which extracts the features of Chinese characters (pronunciation features, glyph features) as features for deep learning
The Data Science Workshop
⭐
156
A New, Interactive Approach to Learning Data Science
Albedo
⭐
142
A recommender system for discovering GitHub repos, built with Apache Spark
Datasist
⭐
137
A Python library for easy data analysis, visualization, exploration and modeling
Dominance Analysis
⭐
128
This package can be used for dominance analysis or Shapley Value Regression for finding relative importance of predictors on given dataset. This library can be used for key driver analysis or marginal resource allocation models.
Tpot2
⭐
118
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Evolutionaryforest
⭐
108
An open source python library for automated feature engineering based on Genetic Programming
Nba_betting
⭐
93
Using data analytics and machine learning to create a comprehensive and profitable system for predicting the outcomes of NBA games.
Nitrofe
⭐
84
NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.
Arfs
⭐
80
All Relevant Feature Selection
Featurehub
⭐
80
A collaborative feature engineering system built on JupyterHub
Sklearn Feature Engineering
⭐
78
使用sklearn做特征工程
Anovos
⭐
78
Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Autogbt Alt
⭐
73
An experimental Python package that reimplements AutoGBT using LightGBM and Optuna.
Ads Recsys Datasets
⭐
67
This repository collects some datasets for Ads & RecSys uses, and provide easy-to-use hdf5 iterative access.
Home Credit Default Risk
⭐
59
Default risk prediction for Home Credit competition - Fast, scalable and maintainable SQL-based feature engineering pipeline
Caafe
⭐
55
Semi-automatic feature engineering process using Language Models and your dataset descriptions. Based on the paper "LLMs for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering" by Hollmann, Müller, and Hutter (2023).
Mindware
⭐
54
An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.
Prosto
⭐
53
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Fxy
⭐
53
Security-Scenes-Feature-Engineering-Toolkit, Continuous Integration.一款安全数据特征化工具
Eeg_riemannian
⭐
49
Accepted in IEEE Transactions on Emerging Topics in Computational Intelligence
Numer.ai
⭐
49
Build And Deploy Real Time Feature Pipeline
⭐
45
Develop and deploy a real-time feature pipeline in Python, using Bytewax 🐝 and Hopsworks Feature Store.
Autotabular
⭐
45
Automatic machine learning for tabular data. ⚡🔥⚡
Ayniy
⭐
40
Ayniy, All You Need is YAML
Data Science Regular Bootcamp
⭐
39
Regular practice on Data Science, Machien Learning, Deep Learning, Solving ML Project problem, Analytical Issue. Regular boost up my knowledge. The goal is to help learner with learning resource on Data Science filed.
Ds2
⭐
37
Easiest way to use AI models without coding (Web UI & API support)
Bytewax Hopsworks Example
⭐
37
Compute and store real-time features for crypto trading using Bytwax (stream processing) and Hopsworks (Feature Store)
Ml Forecast Features Eng
⭐
37
Machine Learning for Retail Sales Forecasting — Features Engineering
Cooka
⭐
36
A lightweight and visual AutoML system
Msda
⭐
34
multi-dimensional, multi-sensor, multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector
Feagen
⭐
33
(deprecated) A fast and memory-efficient Python data engineering framework for machine learning.
Tubular
⭐
32
Python package implementing transformers for pre processing steps for machine learning.
Edaspy
⭐
30
Estimation of Distribution algorithms Python package
Feng
⭐
27
feng - feature engineering for machine-learning champions
Pic2vec
⭐
27
Lightweight Image Featurization Made Easy
Rtbcontrol
⭐
26
A feedback controller for stabilizing RTB performance to a target value.
Guided Machine Learning
⭐
26
Self learning guide for machine learning
Ballet
⭐
25
☀️🦶 A lightweight framework for collaborative, open-source feature engineering
Predicting Transportation Modes Of Gps Trajectories
⭐
24
Understanding transportation mode from GPS (Global Positioning System) traces is an essential topic in the data mobility domain. In this paper, a framework is proposed to predict transportation modes. This framework follows a sequence of five steps: (i) data preparation, where GPS points are grouped in trajectory samples; (ii) point features generation; (iii) trajectory features extraction; (iv) noise removal; (v) normalization. We show that the extraction of the new point features: bearing rate
Spotify_song_recommender
⭐
24
This project leverages spotify's api and provided user playlists to create and tune a neural network model that generates song recommendations based off of song data in provided playlists.
Disentangled Attribution Curves
⭐
23
Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"
Bytehub
⭐
22
ByteHub: making feature stores simple
Data Science End To End
⭐
22
A Respository to get you job ready as a Data Scientist
Quora Paraphrase Question Identification
⭐
21
Paraphrase question identification using Feature Fusion Network (FFN).
Kivyandroidclassification
⭐
21
Image Classification for Android using Artificial Neural Network using NumPy and Kivy.
Zca
⭐
21
ZCA whitening in python
Feature_engineering
⭐
20
(Under Development) Extract features from text and links. Useful for machine learning algorithms.
Som
⭐
19
Self-Organizing Map for unsupervised feature engineering and dimensionality reduction
2019 Sohu Contest
⭐
19
2019年4月8日,第三届搜狐校园内容识别算法大赛。
Skrobot
⭐
19
skrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.
Kaggle Vsb Baseline
⭐
19
Predict Household Poverty
⭐
18
Predict the poverty of households in Costa Rica using automated feature engineering.
Real Time Technical Indicators
⭐
18
Learn to build a modular real-time feature pipeline, so you avoid Offline-Online Feature Skew, and your deployed ML models work as expected.
Related Searches
Python Python35 (791,354)
Python Machine Learning (20,195)
Python Dataset (14,792)
Python Tensorflow (13,736)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
Python Algorithms (10,033)
Python Artificial Intelligence (8,580)
Python Pytorch (7,877)
Python Keras (6,821)
1-100 of 161 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.