Project Name	Stars	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Alink	3,479	16	3 months ago	19	November 03, 2023	53	apache-2.0	Java
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.
Disentangled Attribution Curves	23		3 years ago				mit	Python
Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"
Data Science End To End	22		a year ago				mit	Jupyter Notebook
A Respository to get you job ready as a Data Scientist
50 Days Of Statistics For Data Science	15		2 years ago					Jupyter Notebook
This repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.
Aps2020	11		2 years ago				gpl-3.0	R
Code for the paper 'Variable Selection with Copula Entropy' published on Chinese Journal of Applied Probability and Statistics
Monotonic Optimal Binning	9		6 months ago	4	August 03, 2023		mit	Python
Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical variables.
Describer_ml	8		8 months ago	28	January 17, 2023	2	mit	Python
A set of descriptive statistics and hypothesis tests across different types of data
Target Likelihood Encoding	8		5 years ago					Python
Generate target statistics
Outrank	8		5 months ago			4	bsd-3-clause	Python
A Python library for efficient feature ranking and selection on sparse data sets.
Avito Demand Prediction Challenge	7		5 years ago				gpl-3.0	Jupyter Notebook
It is a Competition for Regression Challenge held by Kaggle, It is based on a Avito Dataset whose size is 123GB which can be accessed from Kaggle, I have done Data Pre-processing, feature engineering, feature extraction, data visualization, machine learning, stacking and boosting

Alternatives To Monotonic Optimal Binning

Select To Compare

Alink ⭐ 3,479

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

dependent packages 16total releases 19most recent commit 3 months ago

Disentangled Attribution Curves ⭐ 23

Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"

most recent commit 3 years ago

Data Science End To End ⭐ 22

A Respository to get you job ready as a Data Scientist

most recent commit a year ago

50 Days Of Statistics For Data Science ⭐ 15

This repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.

most recent commit 2 years ago

Aps2020 ⭐ 11

Code for the paper 'Variable Selection with Copula Entropy' published on Chinese Journal of Applied Probability and Statistics

most recent commit 2 years ago

Monotonic Optimal Binning ⭐ 9

Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical variables.

total releases 4most recent commit 6 months ago

Describer_ml ⭐ 8

A set of descriptive statistics and hypothesis tests across different types of data

total releases 28most recent commit 8 months ago

Target Likelihood Encoding ⭐ 8

Generate target statistics

most recent commit 5 years ago

Outrank ⭐ 8

A Python library for efficient feature ranking and selection on sparse data sets.

most recent commit 5 months ago

Avito Demand Prediction Challenge ⭐ 7

It is a Competition for Regression Challenge held by Kaggle, It is based on a Avito Dataset whose size is 123GB which can be accessed from Kaggle, I have done Data Pre-processing, feature engineering, feature extraction, data visualization, machine learning, stacking and boosting

most recent commit 5 years ago

Suggest An Alternative To Monotonic-Optimal-Binning

Alternative Project Comparisons

Monotonic Optimal Binning vs Alink

Monotonic Optimal Binning vs Disentangled Attribution Curves

Monotonic Optimal Binning vs Data Science End To End

Monotonic Optimal Binning vs 50 Days Of Statistics For Data Science

Monotonic Optimal Binning vs Aps2020

Monotonic Optimal Binning vs Describer_ml

Monotonic Optimal Binning vs Target Likelihood Encoding

Monotonic Optimal Binning vs Outrank

Monotonic Optimal Binning vs Avito Demand Prediction Challenge

Popular Statistics Projects

Scikit Learn ⭐ 57,160

scikit-learn: machine learning in Python

dependent packages 11,480total releases 73latest release October 23, 2023most recent commit 5 months ago

Probabilistic Programming And Bayesian Methods For Hackers ⭐ 26,097

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

most recent commit 7 months ago

Umami ⭐ 19,745

Umami is a simple, fast, privacy-focused alternative to Google Analytics.

dependent packages 2total releases 2latest release July 24, 2020most recent commit 2 months ago

Analytics ⭐ 18,140

Simple, open source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.

most recent commit 3 months ago

Excelize ⭐ 17,554

Go language library for reading and writing Microsoft Excel™ (XLAM / XLSM / XLSX / XLTM / XLTX) spreadsheets

dependent packages 479total releases 187latest release August 27, 2023most recent commit 9 days ago

Popular Feature Engineering Projects

Nni ⭐ 13,725

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

dependent packages 27total releases 55latest release September 14, 2023most recent commit 4 months ago

Tpot ⭐ 9,516

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

dependent packages 22total releases 62latest release August 15, 2023most recent commit 2 months ago

Featuretools ⭐ 7,109

An open source python library for automated feature engineering

dependent packages 43total releases 103latest release October 26, 2023most recent commit 10 days ago

Mljar Supervised ⭐ 2,867

Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation

dependent packages 2total releases 84latest release September 26, 2023most recent commit 5 months ago

Fe4ml Zh ⭐ 2,469

:book: [译] 面向机器学习的特征工程

dependent packages 2total releases 1latest release September 19, 2020most recent commit 10 months ago

Popular Data Processing Categories