Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Smile | 5,736 | 121 | 30 | a month ago | 30 | December 05, 2020 | 10 | other | Java | |
Statistical Machine Intelligence & Learning Engine | ||||||||||
Alink | 3,343 | 1 | 2 months ago | 16 | September 08, 2022 | 48 | apache-2.0 | Java | ||
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform. | ||||||||||
Machine Learning With Python | 2,712 | 14 days ago | 8 | bsd-2-clause | Jupyter Notebook | |||||
Practice and tutorial-style notebooks covering wide variety of machine learning techniques | ||||||||||
Awesome_time_series_in_python | 1,811 | 4 months ago | 4 | |||||||
This curated list contains python packages for time series analysis | ||||||||||
Mlr | 1,595 | 65 | 36 | 4 months ago | 22 | October 05, 2020 | 9 | other | R | |
Machine Learning in R | ||||||||||
Mlj.jl | 1,589 | 22 days ago | 90 | other | Julia | |||||
A Julia machine learning framework | ||||||||||
Pycm | 1,382 | 5 | 8 | 6 days ago | 39 | April 27, 2022 | 12 | mit | Python | |
Multi-class confusion matrix library in Python | ||||||||||
Machine_learning_and_deep_learning | 425 | 19 days ago | gpl-3.0 | Jupyter Notebook | ||||||
Uci Ml Api | 189 | 2 years ago | 3 | mit | Python | |||||
Simple API for UCI Machine Learning Dataset Repository (search, download, analyze) | ||||||||||
Data Science Toolkit | 185 | a year ago | 1 | HTML | ||||||
Collection of stats, modeling, and data science tools in Python and R. |
This repository contains the prediction of baseball statistics using MLB Statcast Metrics.
Goals
Classification
Build and train models to predict home runs and extra-base hits implementing the following approaches:
Implement over-sampling for imbalanced data to improve the quality of predictive modeling (i.e., generalizability).
Apply regularization and cross-validation techniques for model evaluation, selection, and optimization.
Regression
Build and train models to predict hit distance implementing the following approaches:
Apply regularization (Ridge, Lasso, Elastic Net) and cross-validation (k-fold) techniques for model evaluation, selection, and optimization.