Pycaret

An open-source, low-code machine learning library in Python
Alternatives To Pycaret
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Learnopencv18,698
21 hours ago251Jupyter Notebook
Learn OpenCV : C++ and Python Examples
Turicreate11,02817212 days ago31September 30, 2020514bsd-3-clauseC++
Turi Create simplifies the development of custom machine learning models.
Pytorch Tutorial7,372
3 months ago28mitJupyter Notebook
Build your neural network easy and fast, 莫烦Python中文教学
Pycaret7,367133 days ago83June 06, 2022284mitJupyter Notebook
An open-source, low-code machine learning library in Python
Awesome Project Ideas6,856
3 months ago1mit
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
Jetson Inference6,449
3 days ago217mitC++
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
Tensorflow_cookbook6,085
2 months ago28mitJupyter Notebook
Code for Tensorflow Machine Learning Cookbook
Cleanlab6,045317 hours ago18April 20, 202257agpl-3.0Python
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Smile5,73612130a month ago30December 05, 202010otherJava
Statistical Machine Intelligence & Learning Engine
Pyaudioanalysis4,9731189 months ago23February 07, 2022184apache-2.0Python
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Alternatives To Pycaret
Select To Compare


Alternative Project Comparisons
Readme
drawing

An open-source, low-code machine learning library in Python

🎉🎉🎉 PyCaret 3.0 is now available. 🎉🎉🎉

pip install --upgrade pycaret

DocsTutorialsBlogLinkedInYouTubeSlack

Overview
CI/CD pytest on push Documentation Status
Code !pypi !python-versions !black
Downloads Downloads Downloads Downloads
License License
Community Slack

alt text

Welcome to PyCaret

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive.

In comparison with the other open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few lines only. This makes experiments exponentially fast and efficient. PyCaret is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, Optuna, Hyperopt, Ray, and few more.

The design and simplicity of PyCaret are inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more technical expertise. PyCaret was inspired by the caret library in R programming language.

🚀 Installation

🌐 Option 1: Install via PyPi

PyCaret is tested and supported on 64-bit systems with:

  • Python 3.7, 3.8, 3.9, and 3.10
  • Ubuntu 16.04 or later
  • Windows 7 or later

You can install PyCaret with Python's pip package manager:

# install pycaret
pip install pycaret

PyCaret's default installation will not install all the optional dependencies automatically. Depending on the use case, you may be interested in one or more extras:

# install analysis extras
pip install pycaret[analysis]

# models extras
pip install pycaret[models]

# install tuner extras
pip install pycaret[tuner]

# install mlops extras
pip install pycaret[mlops]

# install parallel extras
pip install pycaret[parallel]

# install test extras
pip install pycaret[test]

##

# install multiple extras together
pip install pycaret[analysis,models]

Check out all optional dependencies. If you want to install everything including all the optional dependencies:

# install full version
pip install pycaret[full]

📄 Option 2: Build from Source

Install the development version of the library directly from the source. The API may be unstable. It is not recommended for production use.

pip install git+https://github.com/pycaret/[email protected] --upgrade

📦 Option 3: Docker

Docker creates virtual environments with containers that keep a PyCaret installation separate from the rest of the system. PyCaret docker comes pre-installed with a Jupyter notebook. It can share resources with its host machine (access directories, use the GPU, connect to the Internet, etc.). The PyCaret Docker images are always tested for the latest major releases.

# default version
docker run -p 8888:8888 pycaret/slim

# full version
docker run -p 8888:8888 pycaret/full

🏃‍♂️ Quickstart

1. Functional API

# Classification Functional API Example

# loading sample dataset
from pycaret.datasets import get_data
data = get_data('juice')

# init setup
from pycaret.classification import *
s = setup(data, target = 'Purchase', session_id = 123)

# model training and selection
best = compare_models()

# evaluate trained model
evaluate_model(best)

# predict on hold-out/test set
pred_holdout = predict_model(best)

# predict on new data
new_data = data.copy().drop('Purchase', axis = 1)
predictions = predict_model(best, data = new_data)

# save model
save_model(best, 'best_pipeline')

2. OOP API

# Classification OOP API Example

# loading sample dataset
from pycaret.datasets import get_data
data = get_data('juice')

# init setup
from pycaret.classification import ClassificationExperiment
s = ClassificationExperiment()
s.setup(data, target = 'Purchase', session_id = 123)

# model training and selection
best = s.compare_models()

# evaluate trained model
s.evaluate_model(best)

# predict on hold-out/test set
pred_holdout = s.predict_model(best)

# predict on new data
new_data = data.copy().drop('Purchase', axis = 1)
predictions = s.predict_model(best, data = new_data)

# save model
s.save_model(best, 'best_pipeline')

📁 Modules

Classification

Functional API OOP API

Regression

Functional API OOP API

Time Series

Functional API OOP API

Clustering

Functional API OOP API

Anomaly Detection

Functional API OOP API

👥 Who should use PyCaret?

PyCaret is an open source library that anybody can use. In our view the ideal target audience of PyCaret is:

  • Experienced Data Scientists who want to increase productivity.
  • Citizen Data Scientists who prefer a low code machine learning solution.
  • Data Science Professionals who want to build rapid prototypes.
  • Data Science and Machine Learning students and enthusiasts.

🎮 Training on GPUs

To train models on the GPU, simply pass use_gpu = True in the setup function. There is no change in the use of the API; however, in some cases, additional libraries have to be installed. The following models can be trained on GPUs:

  • Extreme Gradient Boosting
  • CatBoost
  • Light Gradient Boosting Machine requires GPU installation
  • Logistic Regression, Ridge Classifier, Random Forest, K Neighbors Classifier, K Neighbors Regressor, Support Vector Machine, Linear Regression, Ridge Regression, Lasso Regression requires cuML >= 0.15

🖥️ PyCaret Intel sklearnex support

You can apply Intel optimizations for machine learning algorithms and speed up your workflow. To train models with Intel optimizations use sklearnex engine. There is no change in the use of the API, however, installation of Intel sklearnex is required:

pip install scikit-learn-intelex

🤝 Contributors

📝 License

PyCaret is completely free and open-source and licensed under the MIT license.

ℹ️ More Information

Important Links Description
⭐️ Tutorials Tutorials developed and maintained by core developers
📋 Example Notebooks Example notebooks created by community
📙 Blog Official blog by creator of PyCaret
📚 Documentation API docs
📺 Videos Video resources
✈️ Cheat sheet Community Cheat sheet
📢 Discussions Community Discussion board on GitHub
🛠 Release Notes Release Notes
Popular Machine Learning Projects
Popular Classification Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Jupyter Notebook
Machine Learning
Classification
Data Science
Gpu
Time Series
Anomaly Detection