Awesome Open Source
Awesome Open Source

Best-of Machine Learning with Python

  A ranked list of awesome machine learning Python libraries. Updated weekly.

This curated list contains 910 awesome open-source projects with a total of 3.6M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!


  Discover other best-of lists or create your own.
  Subscribe to our newsletter for updates and trending projects.


Contents

Explanation

  •   Combined project-quality score
  •   Star count from GitHub
  •   New project (less than 6 months old)
  •   Inactive project (6 months no activity)
  •   Dead project (12 months no activity)
  •   Project is trending up or down
  •   Project was recently added
  •   Warning (e.g. missing/risky license)
  •   Contributors count from GitHub
  •   Fork count from GitHub
  •   Issue count from GitHub
  •   Last update timestamp on package manager
  •   Download count from package manager
  •   Number of dependent projects
  •   Tensorflow related project
  •   Sklearn related project
  •   PyTorch related project
  •   MxNet related project
  •   Apache Spark related project
  •   Jupyter related project
  •   PaddlePaddle related project
  •   Pandas related project
  •   Jax related project

Machine Learning Frameworks

Back to top

General-purpose machine learning and deep learning frameworks.

Tensorflow (55 170K) - An Open Source Machine Learning Framework for Everyone. Apache-2
  • GitHub ( 4.3K 88K 240K 37K - 6% open 02.02.2023):

     git clone https://github.com/tensorflow/tensorflow
    
  • PyPi ( 15M / month 14K 18.11.2022):

     pip install tensorflow
    
  • Conda ( 4M 08.01.2023):

     conda install -c conda-forge tensorflow
    
  • Docker Hub ( 71M 2.1K 02.02.2023):

     docker pull tensorflow/tensorflow
    
scikit-learn (52 53K) - scikit-learn: machine learning in Python. BSD-3
  • GitHub ( 2.8K 24K 840 450K 11K - 20% open 02.02.2023):

     git clone https://github.com/scikit-learn/scikit-learn
    
  • PyPi ( 34M / month 26K 24.01.2023):

     pip install scikit-learn
    
  • Conda ( 18M 25.01.2023):

     conda install -c conda-forge scikit-learn
    
PyTorch (50 62K) - Tensors and Dynamic neural networks in Python with strong GPU.. BSD-3
  • GitHub ( 3.8K 17K 10K 32K - 34% open 02.02.2023):

     git clone https://github.com/pytorch/pytorch
    
  • PyPi ( 8.3M / month 7.6K 28.06.2022):

     pip install torch
    
  • Conda ( 22M 15.12.2022):

     conda install -c pytorch pytorch
    
Keras (45 57K ) - Deep Learning for humans. Apache-2
  • GitHub ( 1.1K 19K 12K - 3% open 01.02.2023):

     git clone https://github.com/keras-team/keras
    
  • PyPi ( 10M / month 340 14.11.2022):

     pip install keras
    
  • Conda ( 2.9M 21.11.2022):

     conda install -c conda-forge keras
    
StatsModels (45 8.1K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
  • GitHub ( 400 2.6K 26 79K 5.1K - 48% open 29.01.2023):

     git clone https://github.com/statsmodels/statsmodels
    
  • PyPi ( 9.1M / month 4.7K 02.11.2022):

     pip install statsmodels
    
  • Conda ( 8.6M 04.11.2022):

     conda install -c conda-forge statsmodels
    
PySpark (44 35K) - Apache Spark Python API. Apache-2
  • GitHub ( 2.8K 26K 02.02.2023):

     git clone https://github.com/apache/spark
    
  • PyPi ( 26M / month 840 15.06.2022):

     pip install pyspark
    
  • Conda ( 2.3M 20.12.2022):

     conda install -c conda-forge pyspark
    
XGBoost (44 24K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
  • GitHub ( 590 8.5K 5.8K 43K 4.7K - 7% open 31.01.2023):

     git clone https://github.com/dmlc/xgboost
    
  • PyPi ( 8.7M / month 1.4K 09.05.2022):

     pip install xgboost
    
  • Conda ( 3.6M 07.11.2022):

     conda install -c conda-forge xgboost
    
jax (43 22K) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2
  • GitHub ( 500 2K 7.1K 4.1K - 31% open 02.02.2023):

     git clone https://github.com/google/jax
    
  • PyPi ( 780K / month 330 28.06.2022):

     pip install jax
    
  • Conda ( 590K 14.12.2022):

     conda install -c conda-forge jaxlib
    
LightGBM (43 15K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
  • GitHub ( 280 3.7K 180K 18K 2.9K - 8% open 01.02.2023):

     git clone https://github.com/microsoft/LightGBM
    
  • PyPi ( 6.2M / month 640 07.01.2022):

     pip install lightgbm
    
  • Conda ( 1.5M 24.01.2023):

     conda install -c conda-forge lightgbm
    
PaddlePaddle (42 19K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
  • GitHub ( 910 4.8K 15K 190 18K - 16% open 02.02.2023):

     git clone https://github.com/PaddlePaddle/Paddle
    
  • PyPi ( 79K / month 50 05.07.2022):

     pip install paddlepaddle
    
Catboost (42 6.9K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
  • GitHub ( 1.1K 1.1K 120K 2K - 24% open 19.01.2023):

     git clone https://github.com/catboost/catboost
    
  • PyPi ( 2M / month 240 19.05.2022):

     pip install catboost
    
  • Conda ( 1.2M 30.01.2023):

     conda install -c conda-forge catboost
    
pytorch-lightning (41 21K) - Deep learning framework to train, deploy, and ship AI.. Apache-2
  • GitHub ( 830 2.7K 10K 5.8K - 10% open 02.02.2023):

     git clone https://github.com/Lightning-AI/lightning
    
  • PyPi ( 3M / month 400 01.06.2022):

     pip install pytorch-lightning
    
  • Conda ( 680K 28.01.2023):

     conda install -c conda-forge pytorch-lightning
    
Fastai (39 23K) - The fastai deep learning library. Apache-2
  • GitHub ( 650 7.3K 12K 1.7K - 8% open 21.01.2023):

     git clone https://github.com/fastai/fastai
    
  • PyPi ( 370K / month 300 04.07.2022):

     pip install fastai
    
MXNet (39 20K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning.. Apache-2
  • GitHub ( 980 6.9K 26K 3.8K 9.5K - 18% open 26.01.2023):

     git clone https://github.com/apache/incubator-mxnet
    
  • PyPi ( 430K / month 280 17.05.2022):

     pip install mxnet
    
  • Conda ( 8.8K 5 24.10.2022):

     conda install -c anaconda mxnet
    
Jina (39 17K) - Build multimodal AI services via cloud native technologies Neural Search.. Apache-2
  • GitHub ( 160 2K 450 1.8K - 2% open 02.02.2023):

     git clone https://github.com/jina-ai/jina
    
  • PyPi ( 57K / month 2 06.07.2022):

     pip install jina
    
  • Conda ( 36K 16.08.2022):

     conda install -c conda-forge jina-core
    
  • Docker Hub ( 1.2M 7 02.02.2023):

     docker pull jinaai/jina
    
Theano (38 9.7K) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
  • GitHub ( 390 2.5K 13K 2.8K - 24% open 20.12.2022):

     git clone https://github.com/Theano/Theano
    
  • PyPi ( 280K / month 2.8K 27.07.2020):

     pip install theano
    
  • Conda ( 2.3M 16.03.2022):

     conda install -c conda-forge theano
    
PyFlink (37 21K) - Apache Flink Python API. Apache-2
  • GitHub ( 1.7K 12K 02.02.2023):

     git clone https://github.com/apache/flink
    
  • PyPi ( 71K / month 18 05.07.2022):

     pip install apache-flink
    
Vowpal Wabbit (37 8.1K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
  • GitHub ( 330 1.8K 1.2K - 10% open 01.02.2023):

     git clone https://github.com/VowpalWabbit/vowpal_wabbit
    
  • PyPi ( 95K / month 31 06.04.2022):

     pip install vowpalwabbit
    
  • Conda ( 99K 09.11.2022):

     conda install -c conda-forge vowpalwabbit
    
Flax (36 4K) - Flax is a neural network library for JAX that is designed for.. Apache-2
  • GitHub ( 190 460 42 2K 680 - 18% open 02.02.2023):

     git clone https://github.com/google/flax
    
  • PyPi ( 400K / month 75 21.06.2022):

     pip install flax
    
  • Conda ( 18K 04.10.2022):

     conda install -c conda-forge flax
    
Thinc (36 2.7K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
  • GitHub ( 60 260 28K 140 - 19% open 27.01.2023):

     git clone https://github.com/explosion/thinc
    
  • PyPi ( 4.2M / month 620 22.06.2022):

     pip install thinc
    
  • Conda ( 2.3M 13.01.2023):

     conda install -c conda-forge thinc
    
einops (34 6.3K) - Deep learning operations reinvented (for pytorch, tensorflow, jax and.. MIT
  • GitHub ( 22 280 6.8K 140 - 27% open 18.01.2023):

     git clone https://github.com/arogozhnikov/einops
    
  • PyPi ( 1.7M / month 260 04.03.2022):

     pip install einops
    
  • Conda ( 68K 09.12.2022):

     conda install -c conda-forge einops
    
ivy (33 8.7K) - The Unified Machine Learning Framework. Apache-2
  • GitHub ( 720 2.9K 5.3K - 16% open 02.02.2023):

     git clone https://github.com/unifyai/ivy
    
  • PyPi ( 160 / month 6 01.06.2022):

     pip install ivy-core
    
Ludwig (33 8.7K) - Data-centric declarative deep learning framework. Apache-2
  • GitHub ( 130 1K 150 900 - 25% open 02.02.2023):

     git clone https://github.com/ludwig-ai/ludwig
    
  • PyPi ( 2.2K / month 9 25.06.2022):

     pip install ludwig
    
Chainer (33 5.8K) - A flexible framework of neural networks for deep learning. MIT
  • GitHub ( 320 1.4K 2.9K 2K - 0% open 17.10.2022):

     git clone https://github.com/chainer/chainer
    
  • PyPi ( 21K / month 400 05.01.2022):

     pip install chainer
    
  • Conda ( 12K 21.01.2022):

     conda install -c conda-forge chainer
    
mlpack (33 4.2K) - mlpack: a fast, header-only C++ machine learning library. BSD-3
  • GitHub ( 300 1.5K 1.5K - 2% open 02.02.2023):

     git clone https://github.com/mlpack/mlpack
    
  • PyPi ( 12K / month 1 28.10.2020):

     pip install mlpack
    
  • Conda ( 130K 30.12.2022):

     conda install -c conda-forge mlpack
    
tensorflow-upstream (33 620) - TensorFlow ROCm port. Apache-2
  • GitHub ( 4.3K 74 21 340 - 20% open 27.01.2023):

     git clone https://github.com/ROCmSoftwarePlatform/tensorflow-upstream
    
  • PyPi ( 1.5K / month 5 05.06.2022):

     pip install tensorflow-rocm
    
Sonnet (32 9.5K) - TensorFlow-based neural network library. Apache-2
  • GitHub ( 56 1.3K 990 190 - 17% open 15.12.2022):

     git clone https://github.com/deepmind/sonnet
    
  • PyPi ( 23K / month 53 27.03.2020):

     pip install dm-sonnet
    
  • Conda ( 20K 14.11.2020):

     conda install -c conda-forge sonnet
    
tensorpack (32 6.3K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2
  • GitHub ( 58 1.8K 140 1.2K 1.4K - 0% open 26.11.2022):

     git clone https://github.com/tensorpack/tensorpack
    
  • PyPi ( 13K / month 46 22.01.2021):

     pip install tensorpack
    
  • Conda ( 6.9K 06.02.2022):

     conda install -c conda-forge tensorpack
    
skorch (31 5K) - A scikit-learn compatible neural network library that wraps.. BSD-3
  • GitHub ( 54 330 690 460 - 11% open 19.01.2023):

     git clone https://github.com/skorch-dev/skorch
    
  • PyPi ( 79K / month 41 31.10.2021):

     pip install skorch
    
  • Conda ( 700K 30.11.2021):

     conda install -c conda-forge skorch
    
Haiku (31 2.3K) - JAX-based neural network library. Apache-2
  • GitHub ( 69 200 740 250 - 38% open 01.02.2023):

     git clone https://github.com/deepmind/dm-haiku
    
  • PyPi ( 99K / month 42 04.07.2022):

     pip install dm-haiku
    
  • Conda ( 8.9K 21.09.2022):

     conda install -c conda-forge dm-haiku
    
CNTK (30 17K) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit. MIT
  • GitHub ( 270 4.4K 14K 3.4K - 24% open 23.09.2022):

     git clone https://github.com/microsoft/CNTK
    
  • PyPi ( 770 / month 18 09.12.2020):

     pip install cntk
    
Ignite (30 4.2K) - High-level library to help with training and evaluating neural.. BSD-3
  • GitHub ( 180 570 1.2K - 12% open 02.02.2023):

     git clone https://github.com/pytorch/ignite
    
  • PyPi ( 85K / month 5 08.11.2022):

     pip install pytorch-ignite
    
  • Conda ( 120K 05.09.2022):

     conda install -c pytorch ignite
    
Neural Network Libraries (30 2.6K) - Neural Network Libraries. Apache-2
  • GitHub ( 71 330 540 88 - 37% open 01.02.2023):

     git clone https://github.com/sony/nnabla
    
  • PyPi ( 5.3K / month 51 19.06.2022):

     pip install nnabla
    
ktrain (30 1.1K) - ktrain is a Python library that makes deep learning and AI more.. Apache-2
  • GitHub ( 15 260 380 440 - 0% open 14.01.2023):

     git clone https://github.com/amaiya/ktrain
    
  • PyPi ( 21K / month 3 20.05.2022):

     pip install ktrain
    
dyNET (29 3.3K) - DyNet: The Dynamic Neural Network Toolkit. Apache-2
  • GitHub ( 160 700 8.9K 230 930 - 28% open 14.08.2022):

     git clone https://github.com/clab/dynet
    
  • PyPi ( 4.8K / month 28 21.10.2020):

     pip install dyNET
    
Neural Tangents (27 1.9K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
  • GitHub ( 23 210 260 61 130 - 38% open 09.01.2023):

     git clone https://github.com/google/neural-tangents
    
  • PyPi ( 4.3K / month 1 23.02.2022):

     pip install neural-tangents
    
Geomstats (27 900) - Computations and statistics on manifolds with geometric structures. MIT
  • GitHub ( 76 200 490 - 37% open 31.01.2023):

     git clone https://github.com/geomstats/geomstats
    
  • PyPi ( 1.4K / month 22.04.2022):

     pip install geomstats
    
  • Conda ( 970 01.06.2022):

     conda install -c conda-forge geomstats
    
Towhee (26 1.8K) - Towhee is a framework that is dedicated to making neural data.. Apache-2
  • GitHub ( 29 180 1.1K 540 - 2% open 02.02.2023):

     git clone https://github.com/towhee-io/towhee
    
  • PyPi ( 830 / month 07.07.2022):

     pip install towhee
    
xLearn (25 3K ) - High performance, easy-to-use, and scalable machine learning (ML).. Apache-2
  • GitHub ( 30 530 3.7K 100 310 - 62% open 05.06.2022):

     git clone https://github.com/aksnzhy/xlearn
    
  • PyPi ( 4.1K / month 12 04.12.2018):

     pip install xlearn
    
fklearn (25 1.4K) - fklearn: Functional Machine Learning. Apache-2
  • GitHub ( 50 160 13 59 - 61% open 21.10.2022):

     git clone https://github.com/nubank/fklearn
    
  • PyPi ( 5.8K / month 30.12.2021):

     pip install fklearn
    
NeuPy (24 730) - NeuPy is a Tensorflow based python library for prototyping and building.. MIT
  • GitHub ( 8 160 140 270 - 12% open 03.01.2023):

     git clone https://github.com/itdxer/neupy
    
  • PyPi ( 4.5K / month 12 04.04.2019):

     pip install neupy
    
mace (22 4.7K ) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
  • GitHub ( 64 810 1.5K 670 - 8% open 30.05.2022):

     git clone https://github.com/XiaoMi/mace
    
Objax (21 730) - Objax is a machine learning framework that provides an Object.. Apache-2
  • GitHub ( 24 69 31 110 - 43% open 30.08.2022):

     git clone https://github.com/google/objax
    
  • PyPi ( 660 / month 3 31.01.2022):

     pip install objax
    
NeoML (21 710) - Machine learning framework for both deep learning and traditional.. Apache-2
  • GitHub ( 33 110 78 - 37% open 02.02.2023):

     git clone https://github.com/neoml-lib/neoml
    
  • PyPi ( 52 / month 31.05.2022):

     pip install neoml
    
ThunderSVM (20 1.5K ) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
  • GitHub ( 34 200 2.6K 210 - 29% open 09.04.2022):

     git clone https://github.com/Xtra-Computing/thundersvm
    
  • PyPi ( 270 / month 13.03.2020):

     pip install thundersvm
    
elegy (19 440 ) - A High Level API for Deep Learning in JAX. MIT
  • GitHub ( 18 31 26 100 - 37% open 23.05.2022):

     git clone https://github.com/poets-ai/elegy
    
  • PyPi ( 470 / month 22.04.2022):

     pip install elegy
    
chefboost (19 380) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT
  • GitHub ( 6 92 31 32 - 15% open 25.01.2023):

     git clone https://github.com/serengil/chefboost
    
  • PyPi ( 1.2K / month 16.02.2022):

     pip install chefboost
    
ThunderGBM (17 650) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
  • GitHub ( 10 84 2 76 - 44% open 13.09.2022):

     git clone https://github.com/Xtra-Computing/thundergbm
    
  • PyPi ( 54 / month 01.05.2020):

     pip install thundergbm
    
Show 10 hidden projects...
  • dlib (38 12K) - A toolkit for making real world machine learning and data analysis.. BSL-1.0
  • MindsDB (33 13K) - In-Database Machine Learning. GPL-3.0
  • Turi Create (32 11K ) - Turi Create simplifies the development of custom machine.. BSD-3
  • TFlearn (32 9.6K ) - Deep learning library featuring a higher-level API for TensorFlow. MIT
  • NuPIC (28 6.3K ) - Numenta Platform for Intelligent Computing is an implementation.. AGPL-3.0
  • Lasagne (28 3.8K ) - Lightweight library to build and train neural networks in Theano. MIT
  • SHOGUN (26 2.9K ) - Unified and efficient Machine Learning. BSD-3
  • neon (23 3.9K ) - Intel Nervana reference deep learning framework committed to best.. Apache-2
  • Torchbearer (21 630 ) - torchbearer: A model fitting library for PyTorch. MIT
  • StarSpace (17 3.8K ) - Learning embeddings for classification, retrieval and ranking. MIT

Data Visualization

Back to top

General-purpose and task-specific data visualization libraries.

Matplotlib (50 17K) - matplotlib: plotting with Python. Python-2.0
  • GitHub ( 1.5K 6.7K 720K 9.6K - 19% open 01.02.2023):

     git clone https://github.com/matplotlib/matplotlib
    
  • PyPi ( 31M / month 55K 25.01.2023):

     pip install matplotlib
    
  • Conda ( 16M 27.01.2023):

     conda install -c conda-forge matplotlib
    
Seaborn (43 10K) - Statistical data visualization in Python. BSD-3
  • GitHub ( 180 1.7K 240 210K 2.2K - 4% open 02.02.2023):

     git clone https://github.com/mwaskom/seaborn
    
  • PyPi ( 9.3M / month 9.4K 27.06.2022):

     pip install seaborn
    
  • Conda ( 5.8M 31.12.2022):

     conda install -c conda-forge seaborn
    
Bokeh (42 17K) - Interactive Data Visualization in the browser, from Python. BSD-3
  • GitHub ( 630 4K 190 7.2K - 9% open 01.02.2023):

     git clone https://github.com/bokeh/bokeh
    
  • PyPi ( 3.9M / month 3.6K 20.12.2022):

     pip install bokeh
    
  • Conda ( 9.9M 12.12.2022):

     conda install -c conda-forge bokeh
    
Plotly (42 13K) - The interactive graphing library for Python (includes Plotly Express). MIT
  • GitHub ( 220 2.3K 18 2.6K - 51% open 31.01.2023):

     git clone https://github.com/plotly/plotly.py
    
  • PyPi ( 7.1M / month 4.1K 24.06.2022):

     pip install plotly
    
  • Conda ( 3.7M 24.01.2023):

     conda install -c conda-forge plotly
    
  • npm ( 41K / month 5 12.01.2021):

     npm install plotlywidget
    
Altair (41 8K ) - Declarative statistical visualization library for Python. BSD-3
  • GitHub ( 150 700 2 44K 1.8K - 14% open 27.01.2023):

     git clone https://github.com/altair-viz/altair
    
  • PyPi ( 9M / month 420 29.12.2021):

     pip install altair
    
  • Conda ( 1.6M 31.01.2023):

     conda install -c conda-forge altair
    
dash (40 18K) - Data Apps & Dashboards for Python. No JavaScript Required. MIT
  • GitHub ( 130 1.8K 40K 1.5K - 50% open 30.01.2023):

     git clone https://github.com/plotly/dash
    
  • PyPi ( 1.1M / month 1.2K 13.06.2022):

     pip install dash
    
  • Conda ( 800K 31.01.2023):

     conda install -c conda-forge dash
    
pyecharts (37 13K) - Python Echarts Plotting Library. MIT
  • GitHub ( 38 2.8K 2.8K 1.7K - 0% open 08.01.2023):

     git clone https://github.com/pyecharts/pyecharts
    
  • PyPi ( 100K / month 220 16.11.2021):

     pip install pyecharts
    
pandas-profiling (36 10K ) - Create HTML profiling reports from pandas DataFrame.. MIT
  • GitHub ( 110 1.4K 3 630 - 20% open 30.01.2023):

     git clone https://github.com/ydataai/pandas-profiling
    
  • PyPi ( 1.1M / month 170 27.09.2021):

     pip install pandas-profiling
    
  • Conda ( 330K 25.01.2023):

     conda install -c conda-forge pandas-profiling
    
FiftyOne (36 2.5K) - Visualize, create, and debug image and video datasets.. Apache-2
  • GitHub ( 72 290 240 1.1K - 36% open 02.02.2023):

     git clone https://github.com/voxel51/fiftyone
    
  • PyPi ( 80K / month 2 24.06.2022):

     pip install fiftyone
    
HoloViews (36 2.4K ) - With Holoviews, your data visualizes itself. BSD-3
  • GitHub ( 130 370 3K - 32% open 01.02.2023):

     git clone https://github.com/holoviz/holoviews
    
  • PyPi ( 440K / month 220 07.07.2022):

     pip install holoviews
    
  • Conda ( 1.1M 17.01.2023):

     conda install -c conda-forge holoviews
    
  • npm ( 880 / month 24.05.2020):

     npm install @pyviz/jupyterlab_pyviz
    
PyQtGraph (35 3.1K) - Fast data visualization and GUI tools for scientific / engineering.. MIT
  • GitHub ( 250 980 1.1K - 30% open 30.01.2023):

     git clone https://github.com/pyqtgraph/pyqtgraph
    
  • PyPi ( 120K / month 820 04.03.2022):

     pip install pyqtgraph
    
  • Conda ( 350K 03.10.2022):

     conda install -c conda-forge pyqtgraph
    
UMAP (34 6K) - Uniform Manifold Approximation and Projection. BSD-3
  • GitHub ( 110 700 7.5K 680 - 54% open 30.01.2023):

     git clone https://github.com/lmcinnes/umap
    
  • PyPi ( 770K / month 330 13.04.2022):

     pip install umap-learn
    
  • Conda ( 1.7M 14.04.2022):

     conda install -c conda-forge umap-learn
    
VisPy (34 3K) - High-performance interactive 2D/3D data visualization library. BSD-3
  • GitHub ( 180 600 980 1.4K - 22% open 23.01.2023):

     git clone https://github.com/vispy/vispy
    
  • PyPi ( 42K / month 100 04.07.2022):

     pip install vispy
    
  • Conda ( 340K 14.11.2022):

     conda install -c conda-forge vispy
    
  • npm ( 12 / month 15.03.2020):

     npm install vispy
    
PyVista (34 1.6K) - 3D plotting and mesh analysis through a streamlined interface for.. MIT
  • GitHub ( 120 300 690 1.2K 1.1K - 28% open 02.02.2023):

     git clone https://github.com/pyvista/pyvista
    
  • PyPi ( 87K / month 120 02.11.2022):

     pip install pyvista
    
  • Conda ( 280K 03.11.2022):

     conda install -c conda-forge pyvista
    
wordcloud (33 9.2K) - A little word cloud generator in Python. MIT
  • GitHub ( 67 2.3K 500 - 25% open 04.01.2023):

     git clone https://github.com/amueller/word_cloud
    
  • PyPi ( 800K / month 740 27.06.2022):

     pip install wordcloud
    
  • Conda ( 370K 25.08.2022):

     conda install -c conda-forge wordcloud
    
datashader (33 2.9K) - Quickly and accurately render even the largest data. BSD-3
  • GitHub ( 51 360 1.8K 530 - 23% open 02.02.2023):

     git clone https://github.com/holoviz/datashader
    
  • PyPi ( 58K / month 97 21.06.2022):

     pip install datashader
    
  • Conda ( 480K 18.11.2022):

     conda install -c conda-forge datashader
    
Graphviz (33 1.3K) - Simple Python interface for Graphviz. MIT
  • GitHub ( 19 180 40K 150 - 4% open 30.01.2023):

     git clone https://github.com/xflr6/graphviz
    
  • PyPi ( 8.8M / month 3K 16.04.2022):

     pip install graphviz
    
  • Conda ( 33K 10.08.2022):

     conda install -c anaconda python-graphviz
    
plotnine (32 3.3K) - A grammar of graphics for Python. MIT
  • GitHub ( 99 180 540 - 13% open 01.02.2023):

     git clone https://github.com/has2k1/plotnine
    
  • PyPi ( 270K / month 220 01.07.2022):

     pip install plotnine
    
  • Conda ( 240K 10.10.2022):

     conda install -c conda-forge plotnine
    
bqplot (31 3.4K) - Plotting library for IPython/Jupyter notebooks. Apache-2
  • GitHub ( 59 470 38 600 - 39% open 02.02.2023):

     git clone https://github.com/bqplot/bqplot
    
  • PyPi ( 130K / month 97 11.02.2022):

     pip install bqplot
    
  • Conda ( 1.1M 02.09.2022):

     conda install -c conda-forge bqplot
    
  • npm ( 3.5K / month 11 02.09.2022):

     npm install bqplot
    
Perspective (30 5.2K) - A data visualization and analytics component, especially.. Apache-2
  • GitHub ( 76 570 7 610 - 16% open 02.02.2023):

     git clone https://github.com/finos/perspective
    
  • PyPi ( 4.1K / month 9 06.06.2022):

     pip install perspective-python
    
  • Conda ( 250K 19.01.2023):

     conda install -c conda-forge perspective
    
  • npm ( 1.3K / month 1 20.01.2023):

     npm install @finos/perspective-jupyterlab
    
D-Tale (30 3.8K) - Visualizer for pandas data structures. LGPL-2.1
  • GitHub ( 29 320 610 490 - 9% open 23.01.2023):

     git clone https://github.com/man-group/dtale
    
  • PyPi ( 130K / month 12 17.06.2022):

     pip install dtale
    
  • Conda ( 190K 23.01.2023):

     conda install -c conda-forge dtale
    
hvPlot (30 710) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
  • GitHub ( 39 79 2.3K 590 - 40% open 29.12.2022):

     git clone https://github.com/holoviz/hvplot
    
  • PyPi ( 190K / month 65 23.06.2022):

     pip install hvplot
    
  • Conda ( 300K 25.11.2022):

     conda install -c conda-forge hvplot
    
Facets Overview (29 7.1K ) - Visualizations for machine learning datasets. Apache-2
  • GitHub ( 29 900 160 160 - 52% open 30.01.2023):

     git clone https://github.com/pair-code/facets
    
  • PyPi ( 390K / month 7 24.07.2019):

     pip install facets-overview
    
missingno (29 3.4K ) - Missing data visualization module for Python. MIT
  • GitHub ( 17 440 10K 130 - 7% open 27.02.2022):

     git clone https://github.com/ResidentMario/missingno
    
  • PyPi ( 360K / month 120 27.02.2022):

     pip install missingno
    
  • Conda ( 250K 15.02.2020):

     conda install -c conda-forge missingno
    
mpld3 (29 2.2K) - D3 Renderings of Matplotlib Graphics. BSD-3
  • GitHub ( 51 350 36 360 - 59% open 10.12.2022):

     git clone https://github.com/mpld3/mpld3
    
  • PyPi ( 280K / month 400 27.05.2022):

     pip install mpld3
    
  • Conda ( 170K 10.12.2022):

     conda install -c conda-forge mpld3
    
  • npm ( 620 / month 5 10.12.2022):

     npm install mpld3
    
data-validation (29 690) - Library for exploring and validating machine learning.. Apache-2
  • GitHub ( 25 140 370 620 170 - 21% open 31.01.2023):

     git clone https://github.com/tensorflow/data-validation
    
  • PyPi ( 860K / month 27 29.06.2022):

     pip install tensorflow-data-validation
    
AutoViz (27 1.2K) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
  • GitHub ( 14 160 340 65 - 4% open 30.12.2022):

     git clone https://github.com/AutoViML/AutoViz
    
  • PyPi ( 120K / month 6 17.06.2022):

     pip install autoviz
    
  • Conda ( 25K 03.10.2022):

     conda install -c conda-forge autoviz
    
openTSNE (27 1.1K) - Extensible, parallel implementations of t-SNE. BSD-3
  • GitHub ( 10 130 480 120 - 5% open 02.02.2023):

     git clone https://github.com/pavlin-policar/openTSNE
    
  • PyPi ( 68K / month 11 18.03.2022):

     pip install opentsne
    
  • Conda ( 180K 06.12.2022):

     conda install -c conda-forge opentsne
    
pythreejs (27 850) - A Jupyter - Three.js bridge. BSD-3
  • GitHub ( 30 180 24 220 - 25% open 25.08.2022):

     git clone https://github.com/jupyter-widgets/pythreejs
    
  • PyPi ( 74K / month 42 26.02.2021):

     pip install pythreejs
    
  • Conda ( 460K 06.09.2022):

     conda install -c conda-forge pythreejs
    
  • npm ( 4.3K / month 7 24.08.2022):

     npm install jupyter-threejs
    
lets-plot (27 830) - An open-source plotting library for statistical data. MIT
  • GitHub ( 18 42 380 22 310 - 23% open 02.02.2023):

     git clone https://github.com/JetBrains/lets-plot
    
  • PyPi ( 1.4K / month 1 20.06.2022):

     pip install lets-plot
    
Chartify (26 3.3K) - Python library that makes it easy for data scientists to create.. Apache-2
  • GitHub ( 23 300 71 75 - 58% open 13.12.2022):

     git clone https://github.com/spotify/chartify
    
  • PyPi ( 7.7K / month 9 02.11.2020):

     pip install chartify
    
  • Conda ( 24K 07.11.2020):

     conda install -c conda-forge chartify
    
HyperTools (25 1.8K ) - A Python toolbox for gaining geometric insights into high-.. MIT
  • GitHub ( 21 160 24 240 190 - 33% open 12.02.2022):

     git clone https://github.com/ContextLab/hypertools
    
  • PyPi ( 550 / month 10 12.02.2022):

     pip install hypertools
    
Pandas-Bokeh (24 830 ) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
  • GitHub ( 14 100 420 100 - 33% open 25.03.2022):

     git clone https://github.com/PatrikHlobil/Pandas-Bokeh
    
  • PyPi ( 23K / month 11 11.04.2021):

     pip install pandas-bokeh
    
HiPlot (23 2.4K ) - HiPlot makes understanding high dimensional data easy. MIT
  • GitHub ( 8 120 7 84 - 15% open 05.12.2022):

     git clone https://github.com/facebookresearch/hiplot
    
  • PyPi ( 39K / month 12 05.07.2022):

     pip install hiplot
    
  • Conda ( 120K 31.05.2022):

     conda install -c conda-forge hiplot
    
Sweetviz (23 2.3K ) - Visualize and compare datasets, target values and associations, with.. MIT
  • GitHub ( 6 220 100 - 31% open 08.06.2022):

     git clone https://github.com/fbdesignpro/sweetviz
    
  • PyPi ( 70K / month 8 14.06.2022):

     pip install sweetviz
    
  • Conda ( 18K 15.06.2022):

     conda install -c conda-forge sweetviz
    
Plotly-Resampler (23 640) - Visualize large time series data with plotly.py. MIT
  • GitHub ( 9 40 83 90 - 33% open 02.02.2023):

     git clone https://github.com/predict-idlab/plotly-resampler
    
  • PyPi ( 41K / month 1 29.06.2022):

     pip install plotly-resampler
    
  • Conda ( 12K 31.01.2023):

     conda install -c conda-forge plotly-resampler
    
vega (23 340) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3
  • GitHub ( 11 59 2 95 - 13% open 01.12.2022):

     git clone https://github.com/vega/ipyvega
    
  • PyPi ( 11K / month 84 10.02.2022):

     pip install vega
    
  • Conda ( 530K 05.12.2022):

     conda install -c conda-forge vega
    
python-ternary (22 620) - Ternary plotting library for python with matplotlib. MIT
  • GitHub ( 27 140 18 120 140 - 27% open 31.12.2022):

     git clone https://github.com/marcharper/python-ternary
    
  • PyPi ( 43K / month 21 17.02.2021):

     pip install python-ternary
    
  • Conda ( 71K 17.02.2021):

     conda install -c conda-forge python-ternary
    
Popmon (22 410) - Monitor the stability of a Pandas or Spark dataframe. MIT
  • GitHub ( 16 31 38 18 46 - 28% open 19.10.2022):

     git clone https://github.com/ing-bank/popmon
    
  • PyPi ( 33K / month 2 21.06.2022):

     pip install popmon
    
PyWaffle (19 530 ) - Make Waffle Charts in Python. MIT
  • GitHub ( 6 99 190 19 - 26% open 08.06.2022):

     git clone https://github.com/gyli/PyWaffle
    
  • PyPi ( 3.6K / month 2 08.06.2022):

     pip install pywaffle
    
  • Conda ( 8.3K 05.06.2022):

     conda install -c conda-forge pywaffle
    
Show 14 hidden projects...
  • cartopy (32 1.2K) - Cartopy - a cartographic python library with matplotlib support. LGPL-3.0
  • Cufflinks (29 2.7K ) - Productivity Tools for Plotly + Pandas. MIT
  • PandasGUI (23 2.8K ) - A GUI for Pandas DataFrames. MIT-0
  • Multicore-TSNE (23 1.8K ) - Parallel t-SNE implementation with Python and Torch.. BSD-3
  • PDPbox (22 730 ) - python partial dependence plot toolbox. MIT
  • pivottablejs (22 520 ) - Dragndrop Pivot Tables and Charts for Jupyter/IPython.. MIT
  • joypy (21 470 ) - Joyplots in Python with matplotlib & pandas. MIT
  • ivis (20 290) - Dimensionality reduction in very large datasets using Siamese.. Apache-2
  • vegafusion (20 150) - Serverside scaling for the Vega and Altair visualizations. BSD-3
  • animatplot (18 400 ) - A python package for animating plots build on matplotlib. MIT
  • data-describe (18 290 ) - datadescribe: Pythonic EDA Accelerator for Data Science. Apache-2
  • pdvega (17 340 ) - Interactive plotting for Pandas using Vega-Lite. MIT
  • nx-altair (15 210 ) - Draw interactive NetworkX graphs with Altair. MIT
  • nptsne (12 30 ) - nptsne is a numpy compatible python binary package that offers a.. Apache-2

Text Data & NLP

Back to top

Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.

transformers (49 79K) - Transformers: State-of-the-art Machine Learning for.. Apache-2
  • GitHub ( 1.7K 18K 640 48K 11K - 4% open 02.02.2023):

     git clone https://github.com/huggingface/transformers
    
  • PyPi ( 9.1M / month 970 21.06.2022):

     pip install transformers
    
  • Conda ( 790K 25.01.2023):

     conda install -c conda-forge transformers
    
nltk (45 11K ) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
  • GitHub ( 440 2.7K 170K 1.7K - 13% open 02.02.2023):

     git clone https://github.com/nltk/nltk
    
  • PyPi ( 11M / month 12K 09.02.2022):

     pip install nltk
    
  • Conda ( 1.7M 02.01.2023):

     conda install -c conda-forge nltk
    
spaCy (44 25K) - Industrial-strength Natural Language Processing (NLP) in Python. MIT
  • GitHub ( 720 4K 50K 5.4K - 1% open 02.02.2023):

     git clone https://github.com/explosion/spaCy
    
  • PyPi ( 4.8M / month 2.4K 05.04.2022):

     pip install spacy
    
  • Conda ( 2.9M 24.01.2023):

     conda install -c conda-forge spacy
    
Rasa (41 16K) - Open source machine learning framework to automate text- and voice-.. Apache-2
  • GitHub ( 570 4.2K 3K 6.8K - 1% open 02.02.2023):

     git clone https://github.com/RasaHQ/rasa
    
  • PyPi ( 100K / month 60 06.07.2022):

     pip install rasa
    
gensim (41 14K) - Topic Modelling for Humans. LGPL-2.1
  • GitHub ( 440 4.3K 4.2K 41K 1.8K - 21% open 21.12.2022):

     git clone https://github.com/RaRe-Technologies/gensim
    
  • PyPi ( 4.3M / month 2.9K 01.05.2022):

     pip install gensim
    
  • Conda ( 970K 21.12.2022):

     conda install -c conda-forge gensim
    
flair (39 12K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT
  • GitHub ( 240 2K 1.9K 2.1K - 4% open 01.02.2023):

     git clone https://github.com/flairNLP/flair
    
  • PyPi ( 84K / month 76 20.05.2022):

     pip install flair
    
  • Conda ( 15K 21.05.2022):

     conda install -c conda-forge python-flair
    
fairseq (36 21K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT
  • GitHub ( 410 5.2K 290 1.2K 3.8K - 23% open 26.01.2023):

     git clone https://github.com/facebookresearch/fairseq
    
  • PyPi ( 53K / month 39 27.06.2022):

     pip install fairseq
    
  • Conda ( 23K 13.07.2022):

     conda install -c conda-forge fairseq
    
sentence-transformers (36 9.3K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2
  • GitHub ( 110 1.8K 5.5K 1.6K - 53% open 26.01.2023):

     git clone https://github.com/UKPLab/sentence-transformers
    
  • PyPi ( 1.8M / month 130 26.06.2022):

     pip install sentence-transformers
    
  • Conda ( 74K 27.06.2022):

     conda install -c conda-forge sentence-transformers
    
spark-nlp (36 3.1K) - State of the Art Natural Language Processing. Apache-2
  • GitHub ( 140 630 230 770 - 4% open 02.02.2023):

     git clone https://github.com/JohnSnowLabs/spark-nlp
    
  • PyPi ( 2.8M / month 13 01.07.2022):

     pip install spark-nlp
    
fastText (35 24K ) - Library for fast text representation and classification. MIT
  • GitHub ( 59 4.5K 3.8K 1.1K - 46% open 04.03.2022):

     git clone https://github.com/facebookresearch/fastText
    
  • PyPi ( 1.1M / month 190 28.04.2020):

     pip install fasttext
    
  • Conda ( 47K 01.11.2022):

     conda install -c conda-forge fasttext
    
AllenNLP (35 11K ) - An open-source NLP research library, built on PyTorch. Apache-2
  • GitHub ( 260 2.2K 53 3.1K 2.6K - 3% open 22.11.2022):

     git clone https://github.com/allenai/allennlp
    
  • PyPi ( 110K / month 180 14.04.2022):

     pip install allennlp
    
  • Conda ( 96K 15.07.2022):

     conda install -c conda-forge allennlp
    
sentencepiece (35 6.5K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
  • GitHub ( 73 840 24K 22K 580 - 5% open 24.01.2023):

     git clone https://github.com/google/sentencepiece
    
  • PyPi ( 7.7M / month 410 18.06.2021):

     pip install sentencepiece
    
  • Conda ( 320K 22.01.2023):

     conda install -c conda-forge sentencepiece
    
ParlAI (34 9.8K) - A framework for training and evaluating AI models on a variety of.. MIT
  • GitHub ( 200 1.9K 100 1.5K - 5% open 19.01.2023):

     git clone https://github.com/facebookresearch/ParlAI
    
  • PyPi ( 2.7K / month 3 30.03.2022):

     pip install parlai
    
haystack (34 6.6K) - Haystack is an open source NLP framework that leverages pre-trained.. Apache-2
  • GitHub ( 160 1K 15 290 1.9K - 16% open 02.02.2023):

     git clone https://github.com/deepset-ai/haystack
    
  • PyPi ( 1.5K / month 85 15.12.2021):

     pip install haystack
    
OpenNMT (34 5.9K) - Open Source Neural Machine Translation in PyTorch. MIT
  • GitHub ( 180 2.1K 160 1.4K - 3% open 30.01.2023):

     git clone https://github.com/OpenNMT/OpenNMT-py
    
  • PyPi ( 3.5K / month 9 14.09.2021):

     pip install OpenNMT-py
    
torchtext (34 3.2K) - Data loaders and abstractions for text and NLP. BSD-3
  • GitHub ( 140 760 770 - 38% open 02.02.2023):

     git clone https://github.com/pytorch/text
    
  • PyPi ( 530K / month 440 28.06.2022):

     pip install torchtext
    
Tokenizers (33 6.4K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
  • GitHub ( 66 530 56 720 - 32% open 23.01.2023):

     git clone https://github.com/huggingface/tokenizers
    
  • PyPi ( 8.4M / month 130 13.04.2022):

     pip install tokenizers
    
  • Conda ( 740K 26.01.2023):

     conda install -c conda-forge tokenizers
    
NeMo (33 5.5K) - NeMo: a toolkit for conversational AI. Apache-2
  • GitHub ( 200 1.4K 22K 1.5K - 5% open 01.02.2023):

     git clone https://github.com/NVIDIA/NeMo
    
  • PyPi ( 23K / month 8 01.07.2022):

     pip install nemo-toolkit
    
TensorFlow Text (33 1K) - Making text a first-class citizen in TensorFlow. Apache-2
  • GitHub ( 99 260 2.8K 260 - 41% open 02.02.2023):

     git clone https://github.com/tensorflow/text
    
  • PyPi ( 3.4M / month 84 18.05.2022):

     pip install tensorflow-text
    
stanza (32 6.5K) - Official Stanford NLP Python Library for Many Human Languages. Apache-2
  • GitHub ( 54 820 1.5K 750 - 10% open 15.09.2022):

     git clone https://github.com/stanfordnlp/stanza
    
  • PyPi ( 160K / month 70 23.04.2022):

     pip install stanza
    
  • Conda ( 6.2K 14.09.2022):

     conda install -c stanfordnlp stanza
    
Dedupe (32 3.6K) - A python library for accurate and scalable fuzzy matching, record.. MIT
  • GitHub ( 69 490 250 790 - 8% open 29.01.2023):

     git clone https://github.com/dedupeio/dedupe
    
  • PyPi ( 240K / month 48 06.07.2022):

     pip install dedupe
    
  • Conda ( 22K 12.12.2022):

     conda install -c conda-forge dedupe
    
snowballstemmer (32 610) - Snowball compiler and stemming algorithms. BSD-3
  • GitHub ( 30 160 4 78 - 35% open 16.11.2022):

     git clone https://github.com/snowballstem/snowball
    
  • PyPi ( 8.6M / month 6.7K 16.11.2021):

     pip install snowballstemmer
    
  • Conda ( 5.9M 17.11.2021):

     conda install -c conda-forge snowballstemmer
    
DeepPavlov (31 6K) - An open source library for deep learning end-to-end dialog.. Apache-2
  • GitHub ( 73 1.1K 320 620 - 8% open 10.01.2023):

     git clone https://github.com/deepmipt/DeepPavlov
    
  • PyPi ( 7.3K / month 6 31.05.2022):

     pip install deeppavlov
    
jellyfish (31 1.8K) - a python library for doing approximate and phonetic matching of.. BSD-2
  • GitHub ( 28 150 5K 120 - 10% open 31.12.2022):

     git clone https://github.com/jamesturk/jellyfish
    
  • PyPi ( 2.3M / month 410 07.01.2022):

     pip install jellyfish
    
  • Conda ( 450K 28.10.2022):

     conda install -c conda-forge jellyfish
    
GluonNLP (30 2.5K) - Toolkit that enables easy text preprocessing, datasets loading.. Apache-2
  • GitHub ( 84 520 1.1K 560 - 46% open 25.12.2022):

     git clone https://github.com/dmlc/gluon-nlp
    
  • PyPi ( 200K / month 22 13.08.2020):

     pip install gluonnlp
    
nlpaug (29 3.8K ) - Data augmentation for NLP. MIT
  • GitHub ( 33 420 540 200 - 25% open 07.07.2022):

     git clone https://github.com/makcedward/nlpaug
    
  • PyPi ( 150K / month 19 07.07.2022):

     pip install nlpaug
    
  • Conda ( 6.9K 30.01.2023):

     conda install -c conda-forge nlpaug
    
ftfy (29 3.4K) - Fixes mojibake and other glitches in Unicode text, after the fact. MIT
  • GitHub ( 18 110 8.7K 130 - 9% open 25.10.2022):

     git clone https://github.com/rspeer/python-ftfy
    
  • PyPi ( 3.1M / month 490 09.02.2022):

     pip install ftfy
    
  • Conda ( 220K 13.03.2022):

     conda install -c conda-forge ftfy
    
Sumy (29 3K) - Module for automatic summarization of text documents and HTML pages. Apache-2
  • GitHub ( 24 480 1.6K 110 - 15% open 23.10.2022):

     git clone https://github.com/miso-belica/sumy
    
  • PyPi ( 27K / month 100 21.04.2022):

     pip install sumy
    
  • Conda ( 3.8K 25.10.2022):

     conda install -c conda-forge sumy
    
SciSpacy (29 1.3K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
  • GitHub ( 27 180 600 280 - 10% open 06.11.2022):

     git clone https://github.com/allenai/scispacy
    
  • PyPi ( 29K / month 19 10.03.2022):

     pip install scispacy
    
spacy-transformers (29 1.2K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT spacy
  • GitHub ( 20 160 830 30.01.2023):

     git clone https://github.com/explosion/spacy-transformers
    
  • PyPi ( 190K / month 22 05.07.2022):

     pip install spacy-transformers
    
  • Conda ( 8.6K 27.01.2023):

     conda install -c conda-forge spacy-transformers
    
TextDistance (28 3K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT
  • GitHub ( 13 240 880 3.5K 18.09.2022):

     git clone https://github.com/life4/textdistance
    
  • PyPi ( 470K / month 43 29.06.2022):

     pip install textdistance
    
  • Conda ( 310K 18.09.2022):

     conda install -c conda-forge textdistance
    
scattertext (28 2K) - Beautiful visualizations of how language differs among document.. Apache-2
  • GitHub ( 13 270 350 94 - 20% open 01.02.2023):

     git clone https://github.com/JasonKessler/scattertext
    
  • PyPi ( 8.2K / month 10 26.03.2022):

     pip install scattertext
    
  • Conda ( 74K 08.12.2022):

     conda install -c conda-forge scattertext
    
textacy (28 2K ) - NLP, before and after spaCy. Apache-2
  • GitHub ( 32 240 260 - 11% open 06.03.2022):

     git clone https://github.com/chartbeat-labs/textacy
    
  • PyPi ( 57K / month 100 06.12.2021):

     pip install textacy
    
  • Conda ( 120K 06.02.2022):

     conda install -c conda-forge textacy
    
CLTK (28 760) - The Classical Language Toolkit. MIT
  • GitHub ( 120 310 25 220 540 - 5% open 16.10.2022):

     git clone https://github.com/cltk/cltk
    
  • PyPi ( 1.4K / month 42 09.06.2022):

     pip install cltk
    
english-words (27 8.6K) - A text file containing 479k English words for all your.. Unlicense
  • GitHub ( 30 1.6K 120 - 72% open 08.11.2022):

     git clone https://github.com/dwyl/english-words
    
  • PyPi ( 220K / month 7 29.01.2022):

     pip install english-words
    
T5 (27 4.7K) - Code for the paper Exploring the Limits of Transfer Learning with a.. Apache-2
  • GitHub ( 54 640 140 440 - 21% open 17.01.2023):

     git clone https://github.com/google-research/text-to-text-transfer-transformer
    
  • PyPi ( 11K / month 2 18.10.2021):

     pip install t5
    
vaderSentiment (27 3.9K ) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary.. MIT
  • GitHub ( 11 930 4.8K 120 - 37% open 01.04.2022):

     git clone https://github.com/cjhutto/vaderSentiment
    
  • PyPi ( 120K / month 180 22.05.2020):

     pip install vadersentiment
    
  • Conda ( 11K 22.03.2021):

     conda install -c conda-forge vadersentiment
    
rubrix (27 1.6K) - Open-source tool for data-centric NLP. Argilla helps domain experts.. Apache-2
  • GitHub ( 36 140 8 860 - 18% open 01.02.2023):

     git clone https://github.com/recognai/rubrix
    
  • PyPi ( 990 / month 08.06.2022):

     pip install rubrix
    
  • Conda ( 17K 06.10.2022):

     conda install -c conda-forge rubrix
    
Ciphey (26 11K) - Automatically decrypt encryptions without knowing the key or cipher, decode.. MIT
  • GitHub ( 46 730 310 - 16% open 05.12.2022):

     git clone https://github.com/Ciphey/Ciphey
    
  • PyPi ( 18K / month 06.06.2021):

     pip install ciphey
    
  • Docker Hub ( 17K 9 17.12.2022):

     docker pull remnux/ciphey
    
fastNLP (26 2.8K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
  • GitHub ( 61 440 66 120 210 - 25% open 13.12.2022):

     git clone https://github.com/fastnlp/fastNLP
    
  • PyPi ( 6.3K / month 3 04.02.2019):

     pip install fastnlp
    
PyTextRank (26 2K ) - Python implementation of TextRank algorithms (textgraphs) for.. MIT
  • GitHub ( 18 340 340 92 - 21% open 27.07.2022):

     git clone https://github.com/DerwenAI/pytextrank
    
  • PyPi ( 95K / month 13 06.03.2022):

     pip install pytextrank
    
DeepKE (26 1.6K) - An Open Toolkit for Knowledge Graph Extraction and Construction.. MIT
  • GitHub ( 16 410 12 200 - 1% open 28.01.2023):

     git clone https://github.com/zjunlp/deepke
    
  • PyPi ( 670 / month 05.07.2022):

     pip install deepke
    
sense2vec (25 1.5K) - Contextually-keyed word vectors. MIT
  • GitHub ( 18 230 46K 220 110 - 18% open 08.12.2022):

     git clone https://github.com/explosion/sense2vec
    
  • PyPi ( 3.5K / month 11 19.04.2021):

     pip install sense2vec
    
  • Conda ( 31K 14.07.2021):

     conda install -c conda-forge sense2vec
    
detoxify (25 550) - Trained models & code to predict toxic comments on all 3 Jigsaw.. Apache-2
  • GitHub ( 9 75 120K 200 47 - 57% open 19.12.2022):

     git clone https://github.com/unitaryai/detoxify
    
  • PyPi ( 55K / month 2 12.04.2022):

     pip install detoxify
    
PyText (23 6.4K) - A natural language modeling framework based on PyTorch. BSD-3
  • GitHub ( 230 800 310 220 - 66% open 17.10.2022):

     git clone https://github.com/facebookresearch/pytext
    
  • PyPi ( 170 / month 1 08.06.2020):

     pip install pytext-nlp
    
qdrant (23 3.9K) - Qdrant - Vector Search Engine and Database for the next generation of.. Apache-2
  • GitHub ( 30 170 380 - 9% open 13.01.2023):

     git clone https://github.com/qdrant/qdrant
    
gpt-2-simple (23 3.1K ) - Python package to easily retrain OpenAIs GPT-2 text-.. MIT
  • GitHub ( 21 630 380 270 - 63% open 22.05.2022):

     git clone https://github.com/minimaxir/gpt-2-simple
    
  • PyPi ( 5.2K / month 6 18.10.2021):

     pip install gpt-2-simple
    
Sockeye (23 1.1K) - Sequence-to-sequence framework with a focus on Neural Machine.. Apache-2
  • GitHub ( 59 330 16 03.01.2023):

     git clone https://github.com/awslabs/sockeye
    
  • PyPi ( 840 / month 2 05.05.2022):

     pip install sockeye
    
promptsource (23 1.1K) - Toolkit for creating, sharing and using natural language.. Apache-2
  • GitHub ( 59 190 25 160 - 14% open 20.01.2023):

     git clone https://github.com/bigscience-workshop/promptsource
    
  • PyPi ( 3.7K / month 18.04.2022):

     pip install promptsource
    
Texthero (22 2.7K) - Text preprocessing, representation and visualization from zero to hero. MIT
  • GitHub ( 20 230 98 140 - 55% open 28.10.2022):

     git clone https://github.com/jbesomi/texthero
    
  • PyPi ( 27K / month 6 01.07.2021):

     pip install texthero
    
OpenPrompt (22 2.3K) - An Open-Source Framework for Prompt-Learning. Apache-2
  • GitHub ( 18 270 31 200 - 25% open 09.11.2022):

     git clone https://github.com/thunlp/OpenPrompt
    
  • PyPi ( 1.3K / month 06.07.2022):

     pip install openprompt
    
fast-bert (22 1.8K) - Super easy library for BERT based NLP models. Apache-2
  • GitHub ( 36 340 250 - 62% open 27.09.2022):

     git clone https://github.com/utterworks/fast-bert
    
  • PyPi ( 2.4K / month 2 03.06.2022):

     pip install fast-bert
    
FARM (22 1.6K) - Fast & easy transfer learning for NLP. Harvesting language models.. Apache-2
  • GitHub ( 37 230 440 - 8% open 31.08.2022):

     git clone https://github.com/deepset-ai/FARM
    
  • PyPi ( 3.4K / month 2 10.06.2021):

     pip install farm
    
  • Conda ( 2.1K 14.06.2021):

     conda install -c conda-forge farm
    
lightseq (21 2.6K) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
  • GitHub ( 15 280 650 250 - 58% open 01.02.2023):

     git clone https://github.com/bytedance/lightseq
    
  • PyPi ( 930 / month 1 26.01.2022):

     pip install lightseq
    
jiant (21 1.5K) - jiant is an nlp toolkit. MIT
  • GitHub ( 59 280 2 550 - 11% open 17.10.2022):

     git clone https://github.com/nyu-mll/jiant
    
  • PyPi ( 72 / month 10.05.2021):

     pip install jiant
    
happy-transformer (21 390) - A package built on top of Hugging Faces transformers.. Apache-2 huggingface
  • GitHub ( 14 49 120 110 - 16% open 31.10.2022):

     git clone https://github.com/EricFillion/happy-transformer
    
  • PyPi ( 6.1K / month 2 06.02.2022):

     pip install happytransformer
    
TextBox (20 880) - TextBox 2.0 is a text generation library with pre-trained language models. MIT
  • GitHub ( 18 94 5 41 - 7% open 02.02.2023):

     git clone https://github.com/RUCAIBox/TextBox
    
  • PyPi ( 4 / month 15.04.2021):

     pip install textbox
    
NLP Architect (19 2.9K) - A model library for exploring state-of-the-art deep learning.. Apache-2
  • GitHub ( 38 450 9 130 - 16% open 07.11.2022):

     git clone https://github.com/IntelLabs/nlp-architect
    
  • PyPi ( 99 / month 12.04.2020):

     pip install nlp-architect
    
finetune (19 670) - Scikit-learn style model finetuning for NLP. MPL-2.0
  • GitHub ( 20 72 9 140 - 16% open 26.01.2023):

     git clone https://github.com/IndicoDataSolutions/finetune
    
  • PyPi ( 75 / month 2 20.12.2021):

     pip install finetune
    
small-text (19 390) - Active Learning for Text Classification in Python. MIT
  • GitHub ( 2 41 15 24 - 25% open 27.11.2022):

     git clone https://github.com/webis-de/small-text
    
  • PyPi ( 280 / month 14.06.2022):

     pip install small-text
    
  • Conda ( 1.6K 14.10.2022):

     conda install -c conda-forge small-text
    
textaugment (18 300 ) - TextAugment: Text Augmentation Library. MIT
  • GitHub ( 6 54 54 39 20 - 35% open 17.05.2022):

     git clone https://github.com/dsfsi/textaugment
    
  • PyPi ( 2.9K / month 1 05.11.2020):

     pip install textaugment
    
fastT5 (17 410 ) - boost inference speed of T5 models by 5x & reduce the model size.. Apache-2
  • GitHub ( 5 51 27 55 - 27% open 05.04.2022):

     git clone https://github.com/Ki6an/fastT5
    
  • PyPi ( 1K / month 05.04.2022):

     pip install fastt5
    
OpenNRE (16 3.9K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
  • GitHub ( 12 1K 360 - 1% open 03.01.2023):

     git clone https://github.com/thunlp/OpenNRE
    
Translate (15 770 ) - Translate - a PyTorch Language Library. BSD-3
  • GitHub ( 88 200 55 - 50% open 10.06.2022):

     git clone https://github.com/pytorch/translate
    
  • PyPi ( 3 / month 01.05.2018):

     pip install pytorch-translate
    
VizSeq (15 410) - An Analysis Toolkit for Natural Language Generation (Translation,.. MIT
  • GitHub ( 3 52 6 16 - 43% open 02.01.2023):

     git clone https://github.com/facebookresearch/vizseq
    
  • PyPi ( 89 / month 07.08.2020):

     pip install vizseq
    
Show 35 hidden projects...
  • ChatterBot (35 13K ) - ChatterBot is a machine learning, conversational dialog engine.. BSD-3
  • TextBlob (35 8.4K ) - Simple, Pythonic, text processing--Sentiment analysis, part-of-.. MIT
  • fuzzywuzzy (33 8.8K ) - Fuzzy String Matching in Python. GPL-2.0
  • neuralcoref (28 2.6K ) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
  • langid (27 2K ) - Stand-alone language identification system. BSD-3
  • flashtext (26 5.3K ) - Extract Keywords from sentence or Replace keywords in sentences. MIT
  • polyglot (26 2.1K ) - Multilingual text (NLP) processing toolkit. GPL-3.0
  • underthesea (26 1.1K) - Underthesea - Vietnamese NLP Toolkit. GPL-3.0
  • textgenrnn (25 4.9K ) - Easily train your own text-generating neural network of any.. MIT
  • Snips NLU (25 3.7K ) - Snips Python library to extract meaning from text. Apache-2
  • MatchZoo (24 3.7K ) - Facilitating the design, comparison and sharing of deep.. Apache-2
  • pytorch-nlp (24 2.1K ) - Basic Utilities for PyTorch Natural Language Processing.. BSD-3
  • Kashgari (23 2.3K ) - Kashgari is a production-level NLP Transfer learning.. Apache-2
  • YouTokenToMe (23 850 ) - Unsupervised text tokenizer focused on computational efficiency. MIT
  • whoosh (23 360 ) - Pure-Python full-text search library. BSD-1-Clause
  • DeepMatcher (22 4.5K ) - Python package for performing Entity and Text Matching using.. BSD-3
  • Texar (22 2.3K ) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2
  • anaGo (22 1.5K ) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,.. MIT
  • pySBD (22 560 ) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
  • DELTA (21 1.5K ) - DELTA is a deep learning based natural language and speech.. Apache-2
  • stop-words (21 140 ) - Get list of common stop words in various languages in Python. BSD-3
  • pyfasttext (20 230 ) - Yet another Python binding for fastText. GPL-3.0
  • textpipe (19 300 ) - Textpipe: clean and extract metadata from text. MIT
  • Camphr (18 340 ) - Camphr - NLP libary for creating pipeline components. Apache-2 spacy
  • numerizer (18 200) - A Python module to convert natural language numerics into ints and.. MIT
  • NeuroNER (17 1.6K ) - Named-entity recognition using neural networks. Easy-to-use and.. MIT
  • nboost (17 650 ) - NBoost is a scalable, search-api-boosting platform for deploying.. Apache-2
  • skift (17 230 ) - scikit-learn wrappers for Python fastText. MIT
  • NeuralQA (15 220 ) - NeuralQA: A Usable Library for Question Answering on Large Datasets.. MIT
  • BLINK (14 990 ) - Entity Linker solution. MIT
  • Headliner (14 230 ) - Easy training and deployment of seq2seq models. MIT
  • spacy-dbpedia-spotlight (14 85) - A spaCy wrapper for DBpedia Spotlight. MIT spacy
  • TransferNLP (13 290 ) - NLP library designed for reproducible experimentation.. MIT
  • ONNX-T5 (13 220 ) - Summarization, translation, sentiment-analysis, text-generation.. Apache-2
  • textvec (13 190 ) - Text vectorization tool to outperform TFIDF for classification.. MIT

Image Data

Back to top

Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.

Pillow (47 10K) - Python Imaging Library (Fork). PIL
  • GitHub ( 420 2K 1M 2.7K - 4% open 01.02.2023):

     git clone https://github.com/python-pillow/Pillow
    
  • PyPi ( 53M / month 63K 02.01.2023):

     pip install Pillow
    
  • Conda ( 23M 02.01.2023):

     conda install -c conda-forge pillow
    
scikit-image (43 5.2K) - Image processing in Python. BSD-2
  • GitHub ( 590 2.1K 130K 2.6K - 25% open 02.02.2023):

     git clone https://github.com/scikit-image/scikit-image
    
  • PyPi ( 5.2M / month 9.4K 12.06.2022):

     pip install scikit-image
    
  • Conda ( 4.5M 30.10.2022):

     conda install -c conda-forge scikit-image
    
torchvision (41 13K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
  • GitHub ( 520 6.5K 18K 2.9K - 29% open 02.02.2023):

     git clone https://github.com/pytorch/vision
    
  • PyPi ( 6.5M / month 3.8K 28.06.2022):

     pip install torchvision
    
  • Conda ( 510K 22.01.2023):

     conda install -c conda-forge torchvision
    
PyTorch Image Models (39 23K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
  • GitHub ( 92 3.8K 3M 7.5K 680 - 14% open 27.01.2023):

     git clone https://github.com/rwightman/pytorch-image-models
    
  • PyPi ( 1.4M / month 110 15.05.2022):

     pip install timm
    
  • Conda ( 46K 24.11.2022):

     conda install -c conda-forge timm
    
MMDetection (39 23K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
  • GitHub ( 390 8.3K 910 6.8K - 8% open 01.02.2023):

     git clone https://github.com/open-mmlab/mmdetection
    
  • PyPi ( 90K / month 13 01.06.2022):

     pip install mmdet
    
imageio (38 1.2K) - Python library for reading and writing image data. BSD-2
  • GitHub ( 96 240 470 78K 520 - 15% open 23.01.2023):

     git clone https://github.com/imageio/imageio
    
  • PyPi ( 12M / month 2.7K 23.01.2023):

     pip install imageio
    
  • Conda ( 4.3M 23.01.2023):

     conda install -c conda-forge imageio
    
MoviePy (37 10K) - Video editing with Python. MIT
  • GitHub ( 150 1.3K 21K 1.3K - 22% open 10.10.2022):

     git clone https://github.com/Zulko/moviepy
    
  • PyPi ( 770K / month 780 05.10.2020):

     pip install moviepy
    
  • Conda ( 150K 07.10.2022):

     conda install -c conda-forge moviepy
    
Albumentations (35 11K) - Fast image augmentation library and an easy-to-use wrapper.. MIT
  • GitHub ( 120 1.5K 12K 750 - 45% open 31.01.2023):

     git clone https://github.com/albumentations-team/albumentations
    
  • PyPi ( 530K / month 220 15.06.2022):

     pip install albumentations
    
  • Conda ( 89K 20.09.2022):

     conda install -c conda-forge albumentations
    
Wand (35 1.2K) - The ctypes-based simple ImageMagick binding for Python. MIT
  • GitHub ( 100 200 9K 14K 390 - 4% open 14.01.2023):

     git clone https://github.com/emcconville/wand
    
  • PyPi ( 550K / month 690 17.08.2021):

     pip install wand
    
  • Conda ( 21K 22.08.2022):

     conda install -c conda-forge wand
    
detectron2 (34 23K) - Detectron2 is a platform for object detection, segmentation.. Apache-2
  • GitHub ( 230 6.2K 870 3.2K - 9% open 02.02.2023):

     git clone https://github.com/facebookresearch/detectron2
    
  • PyPi ( 3 06.02.2020):

     pip install detectron2
    
  • Conda ( 120K 22.01.2023):

     conda install -c conda-forge detectron2
    
InsightFace (34 13K) - State-of-the-art 2D and 3D Face Analysis Project. MIT
  • GitHub ( 51 4.1K 240 2.1K - 56% open 24.01.2023):

     git clone https://github.com/deepinsight/insightface
    
  • PyPi ( 43K / month 5 29.01.2022):

     pip install insightface
    
Kornia (34 7.7K) - Open Source Differentiable Computer Vision Library. Apache-2
  • GitHub ( 200 770 510 740 - 31% open 02.02.2023):

     git clone https://github.com/kornia/kornia
    
  • PyPi ( 1.1M / month 60 17.05.2022):

     pip install kornia
    
  • Conda ( 61K 22.12.2022):

     conda install -c conda-forge kornia
    
deepface (34 5.4K) - A Lightweight Face Recognition and Facial Attribute Analysis (Age,.. MIT
  • GitHub ( 34 1.2K 1.1K 600 - 1% open 02.02.2023):

     git clone https://github.com/serengil/deepface
    
  • PyPi ( 69K / month 4 10.05.2022):

     pip install deepface
    
opencv-python (34 3.2K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
  • GitHub ( 43 630 620 - 10% open 26.01.2023):

     git clone https://github.com/opencv/opencv-python
    
  • PyPi ( 7.2M / month 9.4K 02.01.2023):

     pip install opencv-python
    
PaddleDetection (33 9.4K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2
  • GitHub ( 120 2.4K 52 4.5K - 20% open 18.01.2023):

     git clone https://github.com/PaddlePaddle/PaddleDetection
    
  • PyPi ( 1.8K / month 24.04.2022):

     pip install paddledet
    
imageai (33 7.5K) - A python library built to empower developers to build applications and.. MIT
  • GitHub ( 16 2K 820K 1.3K 720 - 39% open 30.12.2022):

     git clone https://github.com/OlafenwaMoses/ImageAI
    
  • PyPi ( 8.2K / month 17 05.01.2021):

     pip install imageai
    
  • Conda ( 4.9K 30.04.2021):

     conda install -c conda-forge imageai
    
Face Recognition (32 47K ) - The worlds simplest facial recognition api for Python.. MIT
  • GitHub ( 54 13K 630 1.3K - 54% open 10.06.2022):

     git clone https://github.com/ageitgey/face_recognition
    
  • PyPi ( 48K / month 46 20.02.2020):

     pip install face_recognition
    
  • Conda ( 13K 30.04.2021):

     conda install -c conda-forge face_recognition
    
PaddleSeg (32 6.3K) - Easy-to-use image segmentation library with awesome pre-.. Apache-2
  • GitHub ( 95 1.3K 800 1.6K - 11% open 18.01.2023):

     git clone https://github.com/PaddlePaddle/PaddleSeg
    
  • PyPi ( 4.7K / month 2 20.04.2022):

     pip install paddleseg
    
GluonCV (32 5.4K) - Gluon CV Toolkit. Apache-2
  • GitHub ( 120 1.2K 1K 840 - 7% open 19.01.2023):

     git clone https://github.com/dmlc/gluon-cv
    
  • PyPi ( 560K / month 59 07.07.2022):

     pip install gluoncv
    
ImageHash (32 2.6K) - A Python Perceptual Image Hashing Module. BSD-2
  • GitHub ( 24 320 7.3K 120 - 8% open 30.01.2023):

     git clone https://github.com/JohannesBuchner/imagehash
    
  • PyPi ( 1.3M / month 340 15.07.2021):

     pip install ImageHash
    
  • Conda ( 280K 28.09.2022):

     conda install -c conda-forge imagehash
    
vit-pytorch (29 13K) - Implementation of Vision Transformer, a simple way to achieve.. MIT
  • GitHub ( 17 2.1K 190 210 - 47% open 05.12.2022):

     git clone https://github.com/lucidrains/vit-pytorch
    
  • PyPi ( 29K / month 3 30.06.2022):

     pip install vit-pytorch
    
vidgear (29 2.6K) - A High-performance cross-platform Video Processing Python framework.. Apache-2
  • GitHub ( 13 210 720 280 250 - 1% open 26.01.2023):

     git clone https://github.com/abhiTronix/vidgear
    
  • PyPi ( 4.2K / month 5 05.07.2022):

     pip install vidgear
    
lightly (29 2.1K) - A python library for self-supervised learning on images. MIT
  • GitHub ( 24 170 87 350 - 21% open 02.02.2023):

     git clone https://github.com/lightly-ai/lightly
    
  • PyPi ( 5.2K / month 2 05.07.2022):

     pip install lightly
    
CellProfiler (29 740) - An open-source application for biological image analysis. BSD-3
  • GitHub ( 130 340 4.5K 11 3.1K - 6% open 05.01.2023):

     git clone https://github.com/CellProfiler/CellProfiler
    
  • PyPi ( 430 / month 22.07.2021):

     pip install cellprofiler
    
sahi (28 2.3K) - Framework agnostic sliced/tiled inference + interactive ui + error analysis.. MIT
  • GitHub ( 20 360 12K 280 15.01.2023):

     git clone https://github.com/obss/sahi
    
  • PyPi ( 90K / month 5 25.06.2022):

     pip install sahi
    
  • Conda ( 27K 15.01.2023):

     conda install -c conda-forge sahi
    
Norfair (28 1.7K) - Lightweight Python library for adding real-time multi-object tracking.. BSD-3
  • GitHub ( 24 170 240 75 120 - 3% open 20.01.2023):

     git clone https://github.com/tryolabs/norfair
    
  • PyPi ( 3.2K / month 1 30.05.2022):

     pip install norfair
    
doctr (28 1.5K) - docTR (Document Text Recognition) - a seamless, high-.. Apache-2
  • GitHub ( 27 200 1M 56 230 - 19% open 17.01.2023):

     git clone https://github.com/mindee/doctr
    
  • PyPi ( 5.3K / month 1 22.03.2022):

     pip install python-doctr
    
Augmentor (26 4.8K) - Image augmentation library in Python for machine learning. MIT
  • GitHub ( 23 850 540 200 - 63% open 24.01.2023):

     git clone https://github.com/mdbloice/Augmentor
    
  • PyPi ( 16K / month 29 27.04.2022):

     pip install Augmentor
    
mahotas (26 790) - Computer Vision in Python. MIT
  • GitHub ( 32 150 970 82 - 23% open 14.11.2022):

     git clone https://github.com/luispedro/mahotas
    
  • PyPi ( 7.4K / month 120 28.06.2022):

     pip install mahotas
    
  • Conda ( 350K 05.11.2022):

     conda install -c conda-forge mahotas
    
MMF (25 5.1K) - A modular framework for vision & language multimodal research from.. BSD-3
  • GitHub ( 110 880 14 720 - 32% open 02.02.2023):

     git clone https://github.com/facebookresearch/mmf
    
  • PyPi ( 180 / month 1 12.06.2020):

     pip install mmf
    
Image Deduplicator (25 4.4K) - Finding duplicate images made easy!. Apache-2
  • GitHub ( 13 400 31 130 - 37% open 10.01.2023):

     git clone https://github.com/idealo/imagededup
    
  • PyPi ( 2.2K / month 4 22.11.2020):

     pip install imagededup
    
layout-parser (25 3.4K) - A Unified Toolkit for Deep Learning Based Document Image.. Apache-2
  • GitHub ( 8 340 130 120 - 55% open 06.08.2022):

     git clone https://github.com/Layout-Parser/layout-parser
    
  • PyPi ( 36K / month 2 06.04.2022):

     pip install layoutparser
    
pyvips (25 480) - python binding for libvips using cffi. MIT
  • GitHub ( 15 45 440 330 - 38% open 31.01.2023):

     git clone https://github.com/libvips/pyvips
    
  • PyPi ( 27K / month 40 12.06.2022):

     pip install pyvips
    
  • Conda ( 46K 29.10.2022):

     conda install -c conda-forge pyvips
    
segmentation_models (24 4.2K ) - Segmentation models with pretrained backbones. Keras.. MIT
  • GitHub ( 14 950 510 - 47% open 29.07.2022):

     git clone https://github.com/qubvel/segmentation_models
    
  • PyPi ( 22K / month 24 10.01.2020):

     pip install segmentation_models
    
pytorchvideo (24 2.8K) - A deep learning library for video understanding research. Apache-2
  • GitHub ( 49 320 170 - 40% open 02.11.2022):

     git clone https://github.com/facebookresearch/pytorchvideo
    
  • PyPi ( 14K / month 5 20.01.2022):

     pip install pytorchvideo
    
deep-daze (23 4.4K ) - Simple command line tool for text to image generation using.. MIT
  • GitHub ( 14 320 50 170 - 56% open 13.03.2022):

     git clone https://github.com/lucidrains/deep-daze
    
  • PyPi ( 1.9K / month 13.03.2022):

     pip install deep-daze
    
tensorflow-graphics (23 2.7K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2
  • GitHub ( 38 360 240 - 61% open 19.01.2023):

     git clone https://github.com/tensorflow/graphics
    
  • PyPi ( 3.4K / month 5 03.12.2021):

     pip install tensorflow-graphics
    
icevision (23 800) - An Agnostic Computer Vision Framework - Pluggable to any Training.. Apache-2
  • GitHub ( 41 130 570 - 10% open 07.12.2022):

     git clone https://github.com/airctic/icevision
    
  • PyPi ( 1.9K / month 5 10.02.2022):

     pip install icevision
    
vissl (22 2.9K) - VISSL is FAIRs library of extensible, modular and scalable components.. MIT
  • GitHub ( 34 300 14 180 - 41% open 28.12.2022):

     git clone https://github.com/facebookresearch/vissl
    
  • PyPi ( 300 / month 1 02.11.2021):

     pip install vissl
    
kubric (22 1.8K) - A data generation pipeline for creating semi-realistic synthetic.. Apache-2
  • GitHub ( 21 160 2 170 - 30% open 10.01.2023):

     git clone https://github.com/google-research/kubric
    
  • PyPi ( 4.2K / month 06.07.2022):

     pip install kubric-nightly
    
Classy Vision (22 1.5K) - An end-to-end PyTorch framework for image and video.. MIT
  • GitHub ( 77 270 110 - 44% open 27.09.2022):

     git clone https://github.com/facebookresearch/ClassyVision
    
  • PyPi ( 1.2K / month 4 09.07.2021):

     pip install classy_vision
    
  • Conda ( 16K 22.03.2022):

     conda install -c conda-forge classy_vision
    
pycls (21 2K ) - Codebase for Image Classification Research, written in PyTorch. MIT
  • GitHub ( 17 230 9 81 - 30% open 12.07.2022):

     git clone https://github.com/facebookresearch/pycls
    
  • PyPi ( 120K / month 05.09.2020):

     pip install pycls
    
detecto (21 580 ) - Build fully-functioning computer vision models with PyTorch. MIT
  • GitHub ( 12 100 120 100 - 40% open 09.02.2022):

     git clone https://github.com/alankbi/detecto
    
  • PyPi ( 3.3K / month 2 02.02.2022):

     pip install detecto
    
  • Conda ( 2.1K 02.02.2022):

     conda install -c conda-forge detecto
    
image-match (20 2.8K) - Quickly search over billions of images. Apache-2
  • GitHub ( 19 400 110 - 57% open 06.12.2022):

     git clone https://github.com/ProvenanceLabs/image-match
    
  • PyPi ( 1.1K / month 4 13.02.2017):

     pip install image_match
    
ffcv (20 2.4K) - FFCV: Fast Forward Computer Vision (and other ML workloads!). Apache-2
  • GitHub ( 18 140 12 210 - 29% open 25.01.2023):

     git clone https://github.com/libffcv/ffcv
    
  • PyPi ( 790 / month 28.01.2022):

     pip install ffcv
    
DETR (19 10K ) - End-to-End Object Detection with Transformers. Apache-2
  • GitHub ( 25 1.9K 490 - 42% open 07.03.2022):

     git clone https://github.com/facebookresearch/detr
    
PySlowFast (19 5.4K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
  • GitHub ( 29 1.1K 10 600 - 55% open 12.01.2023):

     git clone https://github.com/facebookresearch/SlowFast
    
  • PyPi ( 8 / month 15.01.2020):

     pip install pyslowfast
    
scenic (19 1.7K) - Scenic: A Jax Library for Computer Vision Research and Beyond. Apache-2
  • GitHub ( 58 240 170 - 65% open 01.02.2023):

     git clone https://github.com/google-research/scenic
    
Show 16 hidden projects...
  • imgaug (34 13K ) - Image augmentation for machine learning experiments. MIT
  • glfw (34 10K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input. Zlib
  • imutils (31 4.3K ) - A series of convenience functions to make basic image processing.. MIT
  • PyTorch3D (29 6.9K) - PyTorch3D is FAIRs library of reusable components for.. Unlicensed
  • chainercv (27 1.5K ) - ChainerCV: a Library for Deep Learning in Computer Vision. MIT
  • Face Alignment (26 6.1K ) - 2D and 3D Face alignment library build using pytorch. BSD-3
  • facenet-pytorch (26 3.3K ) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT
  • Pillow-SIMD (26 1.9K) - The friendly PIL fork. PIL
  • mtcnn (26 1.9K ) - MTCNN face detection implementation for TensorFlow, as a PIP.. MIT
  • Image Super-Resolution (23 4K ) - Super-scale your images and run experiments with.. Apache-2
  • Luminoth (22 2.4K ) - Deep Learning toolkit for Computer Vision. BSD-3
  • nude.py (21 890 ) - Nudity detection with Python. MIT
  • solt (18 260 ) - Streaming over lightweight data transformations. MIT
  • Caer (17 670 ) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
  • HugsVision (16 170) - HugsVision is a easy to use huggingface wrapper for state-of-the-.. MIT huggingface
  • Torch Points 3D (16 120 ) - Pytorch framework for doing deep learning on point.. BSD-3

Graph Data

Back to top

Libraries for graph processing, clustering, embedding, and machine learning tasks.

networkx (43 12K) - Network Analysis in Python. BSD-3
  • GitHub ( 640 2.8K 63 130K 3.1K - 12% open 01.02.2023):

     git clone https://github.com/networkx/networkx
    
  • PyPi ( 22M / month 14K 08.01.2023):

     pip install networkx
    
  • Conda ( 9.8M 08.01.2023):

     conda install -c conda-forge networkx
    
PyTorch Geometric (38 17K) - Graph Neural Network Library for PyTorch. MIT
  • GitHub ( 370 3.1K 3K - 23% open 02.02.2023):

     git clone https://github.com/pyg-team/pytorch_geometric
    
  • PyPi ( 100K / month 52 12.03.2022):

     pip install torch-geometric
    
  • Conda ( 16K 04.01.2023):

     conda install -c conda-forge pytorch_geometric
    
dgl (37 11K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2
  • GitHub ( 240 2.7K 50 2.1K - 15% open 02.02.2023):

     git clone https://github.com/dmlc/dgl
    
  • PyPi ( 39K / month 46 16.03.2022):

     pip install dgl
    
ogb (29 1.5K) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
  • GitHub ( 24 350 590 240 - 2% open 27.01.2023):

     git clone https://github.com/snap-stanford/ogb
    
  • PyPi ( 23K / month 18 23.02.2022):

     pip install ogb
    
  • Conda ( 18K 03.11.2022):

     conda install -c conda-forge ogb
    
PyKEEN (27 1.1K) - A Python library for learning and evaluating knowledge graph embeddings. MIT
  • GitHub ( 35 150 150 490 - 19% open 31.01.2023):

     git clone https://github.com/pykeen/pykeen
    
  • PyPi ( 2.6K / month 3 24.05.2022):

     pip install pykeen
    
Spektral (26 2.2K) - Graph Neural Networks with Keras and Tensorflow 2. MIT
  • GitHub ( 24 330 180 240 - 20% open 19.10.2022):

     git clone https://github.com/danielegrattarola/spektral
    
  • PyPi ( 5.7K / month 3 09.04.2022):

     pip install spektral
    
pygraphistry (26 1.8K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3
  • GitHub ( 31 180 85 250 - 47% open 23.12.2022):

     git clone https://github.com/graphistry/pygraphistry
    
  • PyPi ( 1.6K / month 4 02.07.2022):

     pip install graphistry
    
Paddle Graph Learning (26 1.5K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2
  • GitHub ( 29 290 37 180 - 37% open 03.01.2023):

     git clone https://github.com/PaddlePaddle/PGL
    
  • PyPi ( 810 / month 2 21.04.2022):

     pip install pgl
    
PyTorch-BigGraph (24 3.2K) - Generate embeddings from large-scale graph-structured.. BSD-3
  • GitHub ( 29 420 150 190 - 28% open 08.12.2022):

     git clone https://github.com/facebookresearch/PyTorch-BigGraph
    
  • PyPi ( 190K / month 3 01.05.2019):

     pip install torchbiggraph
    
pytorch_geometric_temporal (23 1.9K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT
  • GitHub ( 25 270 130 - 9% open 20.01.2023):

     git clone https://github.com/benedekrozemberczki/pytorch_geometric_temporal
    
  • PyPi ( 2.2K / month 2 04.04.2022):

     pip install torch-geometric-temporal
    
Node2Vec (23 1K) - Implementation of the node2vec algorithm. MIT
  • GitHub ( 11 220 350 84 - 1% open 19.10.2022):

     git clone https://github.com/eliorc/node2vec
    
  • PyPi ( 91K / month 16 30.04.2022):

     pip install node2vec
    
  • Conda ( 24K 25.04.2020):

     conda install -c conda-forge node2vec
    
torch-cluster (22 600) - PyTorch Extension Library of Optimized Graph Cluster.. MIT
  • GitHub ( 27 110 120 - 19% open 23.01.2023):

     git clone https://github.com/rusty1s/pytorch_cluster
    
  • PyPi ( 8.6K / month 30 11.03.2022):

     pip install torch-cluster
    
  • Conda ( 51K 28.12.2022):

     conda install -c conda-forge pytorch_cluster
    
graph4nlp (21 1.5K) - Graph4nlp is the library for the easy use of Graph Neural.. Apache-2
  • GitHub ( 27 190 170 - 6% open 13.11.2022):

     git clone https://github.com/graph4ai/graph4nlp
    
  • PyPi ( 49 / month 20.01.2022):

     pip install graph4nlp
    
graph-nets (20 5.2K) - Build Graph Nets in Tensorflow. Apache-2
  • GitHub ( 11 780 130 - 3% open 12.12.2022):

     git clone https://github.com/deepmind/graph_nets
    
  • PyPi ( 1K / month 8 29.01.2020):

     pip install graph-nets
    
jraph (20 1.1K) - A Graph Neural Network Library in Jax. Apache-2
  • GitHub ( 17 69 54 41 - 36% open 31.08.2022):

     git clone https://github.com/deepmind/jraph
    
  • PyPi ( 1.7K / month 2 06.06.2022):

     pip install jraph
    
  • Conda ( 1.4K 31.10.2021):

     conda install -c conda-forge jraph
    
AutoGL (19 880) - An autoML framework & toolkit for machine learning on graphs. Apache-2
  • GitHub ( 15 110 27 - 22% open 30.12.2022):

     git clone https://github.com/THUMNLab/AutoGL
    
  • PyPi ( 4 / month 23.12.2020):

     pip install auto-graph-learning
    
OpenKE (16 3.3K) - An Open-Source Package for Knowledge Embedding (KE). MIT
  • GitHub ( 11 920 360 - 2% open 03.11.2022):

     git clone https://github.com/thunlp/OpenKE
    
GraphEmbedding (16 3.2K ) - Implementation and experiments of graph embedding.. MIT
  • GitHub ( 9 910 22 64 - 60% open 21.06.2022):

     git clone https://github.com/shenweichen/GraphEmbedding
    
GraphGym (16 1.2K) - Platform for designing and evaluating Graph Neural Networks (GNN). MIT
  • GitHub ( 5 150 19 2 38 - 28% open 25.08.2022):

     git clone https://github.com/snap-stanford/GraphGym
    
  • PyPi ( 58 / month 24.03.2022):

     pip install graphgym
    
kglib (15 530) - TypeDB-ML is the Machine Learning integrations library for TypeDB. Apache-2
  • GitHub ( 11 94 210 62 - 19% open 09.11.2022):

     git clone https://github.com/vaticle/kglib
    
  • PyPi ( 75 / month 19.08.2020):

     pip install grakn-kglib
    
OpenNE (14 1.6K) - An Open-Source Package for Network Embedding (NE). MIT
  • GitHub ( 11 480 100 - 4% open 02.11.2022):

     git clone https://github.com/thunlp/OpenNE
    
ptgnn (13 360 ) - A PyTorch Graph Neural Network Library. MIT
  • GitHub ( 7 40 2 7 - 28% open 01.02.2022):

     git clone https://github.com/microsoft/ptgnn
    
  • PyPi ( 63 / month 21.10.2021):

     pip install ptgnn
    
Show 14 hidden projects...
  • igraph (32 1.1K) - Python interface for igraph. GPL-2.0
  • StellarGraph (28 2.6K ) - StellarGraph - Machine Learning on Graphs. Apache-2
  • pygal (28 2.5K ) - PYthon svg GrAph plotting Library. LGPL-3.0
  • Karate Club (25 1.8K) - Karate Club: An API Oriented Open-source Python Framework for.. GPL-3.0
  • DeepWalk (22 2.5K ) - DeepWalk - Deep Learning for Graphs. GPL-3.0
  • AmpliGraph (22 1.9K ) - Python library for Representation Learning on Knowledge.. Apache-2
  • DIG (22 1.4K) - A library for graph deep learning research. GPL-3.0
  • pyRDF2Vec (20 180) - Python Implementation and Extension of RDF2Vec. MIT
  • deepsnap (19 450 ) - Python library assists deep learning on graphs. MIT
  • Sematch (17 400 ) - semantic similarity framework for knowledge graph. Apache-2
  • DeepGraph (16 270 ) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
  • Euler (15 2.8K ) - A distributed graph deep learning framework. Apache-2
  • GraphSAGE (14 3K ) - Representation learning on large graphs using stochastic graph.. MIT
  • GraphVite (13 1.1K ) - GraphVite: A General and High-performance Graph Embedding.. Apache-2

Audio Data

Back to top

Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.

espnet (36 6K) - End-to-End Speech Processing Toolkit. Apache-2
  • GitHub ( 330 1.8K 77 120 2.1K - 19% open 01.02.2023):

     git clone https://github.com/espnet/espnet
    
  • PyPi ( 12K / month 4 28.05.2022):

     pip install espnet
    
torchaudio (34 2K) - Data manipulation and transformation for audio signal.. BSD-2
  • GitHub ( 190 500 780 - 26% open 01.02.2023):

     git clone https://github.com/pytorch/audio
    
  • PyPi ( 650K / month 160 28.06.2022):

     pip install torchaudio
    
SpeechRecognition (33 6.7K) - Speech recognition module for Python, supporting several.. BSD-3
  • GitHub ( 49 2.2K 560 - 47% open 12.01.2023):

     git clone https://github.com/Uberi/speech_recognition
    
  • PyPi ( 400K / month 720 05.12.2017):

     pip install SpeechRecognition
    
  • Conda ( 160K 04.12.2022):

     conda install -c conda-forge speechrecognition
    
Pydub (33 6.7K) - Manipulate audio with a simple and easy high level interface. MIT
  • GitHub ( 95 890 17K 540 - 51% open 08.12.2022):

     git clone https://github.com/jiaaro/pydub
    
  • PyPi ( 3.4M / month 900 10.03.2021):

     pip install pydub
    
  • Conda ( 38K 13.03.2021):

     conda install -c conda-forge pydub
    
librosa (33 5.6K) - Python library for audio and music analysis. ISC
  • GitHub ( 110 850 1.1K - 4% open 01.02.2023):

     git clone https://github.com/librosa/librosa
    
  • PyPi ( 1.3M / month 1.2K 27.06.2022):

     pip install librosa
    
  • Conda ( 580K 27.06.2022):

     conda install -c conda-forge librosa
    
speechbrain (33 5.3K) - A PyTorch-based Speech Toolkit. Apache-2
  • GitHub ( 200 980 440 880 - 17% open 17.01.2023):

     git clone https://github.com/speechbrain/speechbrain
    
  • PyPi ( 31K / month 3 26.06.2022):

     pip install speechbrain
    
spleeter (31 22K) - Deezer source separation library including pretrained models. MIT
  • GitHub ( 19 2.4K 2.1M 370 730 - 25% open 25.11.2022):

     git clone https://github.com/deezer/spleeter
    
  • PyPi ( 11K / month 5 10.06.2022):

     pip install spleeter
    
  • Conda ( 73K 30.06.2020):

     conda install -c conda-forge spleeter
    
Magenta (31 18K) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2
  • GitHub ( 160 3.7K 400 940 - 38% open 18.01.2023):

     git clone https://github.com/magenta/magenta
    
  • PyPi ( 7.1K / month 38 12.11.2020):

     pip install magenta
    
python-soundfile (29 520) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
  • GitHub ( 29 83 11K 17K 190 - 41% open 27.12.2022):

     git clone https://github.com/bastibe/python-soundfile
    
  • PyPi ( 960K / month 27.09.2022):

     pip install soundfile
    
  • Conda:

     conda install -c anaconda pysoundfile
    
Coqui TTS (28 8K) - - a deep learning toolkit for Text-to-Speech, battle-.. MPL-2.0
  • GitHub ( 120 880 500K 500 - 5% open 29.01.2023):

     git clone https://github.com/coqui-ai/TTS
    
  • PyPi ( 12K / month 5 21.06.2022):

     pip install tts
    
  • Conda ( 6.4K 15.12.2021):

     conda install -c conda-forge tts
    
pyAudioAnalysis (28 5.1K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
  • GitHub ( 27 1.1K 330 310 - 60% open 18.09.2022):

     git clone https://github.com/tyiannak/pyAudioAnalysis
    
  • PyPi ( 23K / month 19 07.02.2022):

     pip install pyAudioAnalysis
    
Porcupine (27 2.9K) - On-device wake word detection powered by deep learning. Apache-2
  • GitHub ( 33 400 14 420 - 0% open 02.02.2023):

     git clone https://github.com/Picovoice/Porcupine
    
  • PyPi ( 1.6K / month 11 28.06.2022):

     pip install pvporcupine
    
audiomentations (27 1.2K) - A Python library for audio data augmentation. Inspired by.. MIT
  • GitHub ( 23 160 210 140 - 28% open 23.01.2023):

     git clone https://github.com/iver56/audiomentations
    
  • PyPi ( 5.6K / month 1 15.06.2022):

     pip install audiomentations
    
audioread (26 420 ) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio.. MIT
  • GitHub ( 23 100 86 - 39% open 18.11.2022):

     git clone https://github.com/beetbox/audioread
    
  • PyPi ( 1.3M / month 330 20.10.2020):

     pip install audioread
    
  • Conda ( 570K 29.10.2022):

     conda install -c conda-forge audioread
    
tinytag (25 580) - Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A,.. MIT
  • GitHub ( 25 91 680 98 - 16% open 15.01.2023):

     git clone https://github.com/devsnd/tinytag
    
  • PyPi ( 19K / month 70 12.03.2022):

     pip install tinytag
    
DDSP (24 2.4K) - DDSP: Differentiable Digital Signal Processing. Apache-2
  • GitHub ( 32 280 36 150 - 20% open 22.12.2022):

     git clone https://github.com/magenta/ddsp
    
  • PyPi ( 2.9K / month 1 25.05.2022):

     pip install ddsp
    
  • Conda ( 13K 08.06.2020):

     conda install -c conda-forge ddsp
    
kapre (22 870 ) - kapre: Keras Audio Preprocessors. MIT
  • GitHub ( 13 140 22 2.1K 95 - 13% open 04.07.2022):

     git clone https://github.com/keunwoochoi/kapre
    
  • PyPi ( 3.7K / month 14 21.01.2022):

     pip install kapre
    
nnAudio (21 790) - Audio processing by using pytorch 1D convolution network. MIT
  • GitHub ( 13 78 78 53 - 22% open 09.10.2022):

     git clone https://github.com/KinWaiCheuk/nnAudio
    
  • PyPi ( 2.8K / month 1 24.12.2021):

     pip install nnAudio
    
Julius (18 310) - Fast PyTorch based DSP for audio and 1D signals. MIT
  • GitHub ( 2 20 230 11 - 18% open 19.09.2022):

     git clone https://github.com/adefossez/julius
    
  • PyPi ( 38K / month 5 20.10.2021):

     pip install julius
    
Show 10 hidden projects...
  • DeepSpeech (34 21K ) - DeepSpeech is an open source embedded (offline, on-.. MPL-2.0
  • aubio (27 2.9K ) - a library for audio and music analysis. GPL-3.0
  • Essentia (27 2.3K) - C++ library for audio and music analysis, description and.. AGPL-3.0
  • Madmom (25 1K ) - Python audio and music signal processing library. BSD-3
  • TimeSide (24 340) - scalable audio processing framework and server written in Python. AGPL-3.0
  • TTS (23 6.6K ) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
  • python_speech_features (23 2.2K ) - This library provides common speech features for ASR.. MIT
  • Dejavu (22 6K ) - Audio fingerprinting and recognition in Python. MIT
  • Muda (17 210 ) - A library for augmenting annotated audio data. ISC
  • textlesslib (9 380 ) - Library for Textless Spoken Language Processing. MIT

Geospatial Data

Back to top

Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.

pydeck (42 11K) - WebGL2 powered visualization framework. MIT
  • GitHub ( 210 1.9K 5.3K 2.6K - 7% open 01.02.2023):

     git clone https://github.com/visgl/deck.gl
    
  • PyPi ( 940K / month 24 25.10.2021):

     pip install pydeck
    
  • Conda ( 280K 04.11.2022):

     conda install -c conda-forge pydeck
    
  • npm ( 370K / month 420 18.01.2023):

     npm install deck.gl
    
folium (40 6.1K) - Python Data. Leaflet.js Maps. MIT
  • GitHub ( 150 2.2K 21K 980 - 4% open 24.01.2023):

     git clone https://github.com/python-visualization/folium
    
  • PyPi ( 850K / month 680 19.11.2021):

     pip install folium
    
  • Conda ( 1.6M 13.12.2022):

     conda install -c conda-forge folium
    
Shapely (38 3.2K) - Manipulation and analysis of geometric objects. BSD-3
  • GitHub ( 140 510 400 38K 1K - 19% open 01.02.2023):

     git clone https://github.com/shapely/shapely
    
  • PyPi ( 8.4M / month 35 03.08.2022):

     pip install shapely
    
  • Conda ( 5.4M 30.01.2023):

     conda install -c conda-forge shapely
    
Rasterio (38 1.9K) - Rasterio reads and writes geospatial raster datasets. BSD-3
  • GitHub ( 140 500 770 6.4K 1.6K - 8% open 26.01.2023):

     git clone https://github.com/rasterio/rasterio
    
  • PyPi ( 870K / month 800 06.07.2022):

     pip install rasterio
    
  • Conda ( 2M 17.11.2022):

     conda install -c conda-forge rasterio
    
GeoPandas (37 3.5K ) - Python tools for geographic data. BSD-3
  • GitHub ( 190 780 1.7K 18K 1.5K - 27% open 27.01.2023):

     git clone https://github.com/geopandas/geopandas
    
  • PyPi ( 2.8M / month 1.2K 21.06.2022):

     pip install geopandas
    
  • Conda ( 2.4M 10.12.2022):

     conda install -c conda-forge geopandas
    
pyproj (37 850) - Python interface to PROJ (cartographic projections and coordinate.. MIT
  • GitHub ( 58 190 18K 540 - 4% open 20.01.2023):

     git clone https://github.com/pyproj4/pyproj
    
  • PyPi ( 4.7M / month 1.7K 22.04.2022):

     pip install pyproj
    
  • Conda ( 5.2M 17.01.2023):

     conda install -c conda-forge pyproj
    
geopy (36 3.9K) - Geocoding library for Python. MIT
  • GitHub ( 130 600 270 - 9% open 13.11.2022):

     git clone https://github.com/geopy/geopy
    
  • PyPi ( 3.6M / month 3.9K 11.07.2021):

     pip install geopy
    
  • Conda ( 930K 13.11.2022):

     conda install -c conda-forge geopy
    
Fiona (34 1K) - Fiona reads and writes geographic data files. BSD-3
  • GitHub ( 65 190 11K 720 - 9% open 31.01.2023):

     git clone https://github.com/Toblerity/Fiona
    
  • PyPi ( 2.9M / month 800 10.06.2022):

     pip install fiona
    
  • Conda ( 4M 31.01.2023):

     conda install -c conda-forge fiona
    
ArcGIS API (32 1.5K) - Documentation and samples for ArcGIS API for Python. Apache-2
  • GitHub ( 85 980 7.1K 520 - 5% open 19.01.2023):

     git clone https://github.com/Esri/arcgis-python-api
    
  • PyPi ( 69K / month 28 02.06.2022):

     pip install arcgis
    
  • Docker Hub ( 7.7K 40 17.06.2022):

     docker pull esridocker/arcgis-api-python-notebook
    
ipyleaflet (32 1.3K) - A Jupyter - Leaflet.js bridge. MIT
  • GitHub ( 81 350 3.6K 550 - 40% open 27.10.2022):

     git clone https://github.com/jupyter-widgets/ipyleaflet
    
  • PyPi ( 150K / month 110 07.07.2022):

     pip install ipyleaflet
    
  • Conda ( 950K 19.10.2022):

     conda install -c conda-forge ipyleaflet
    
  • npm ( 53K / month 3 19.10.2022):

     npm install jupyter-leaflet
    
geojson (30 780) - Python bindings and utilities for GeoJSON. BSD-3
  • GitHub ( 51 110 11K 89 - 23% open 26.01.2023):

     git clone https://github.com/jazzband/geojson
    
  • PyPi ( 890K / month 1.2K 09.08.2019):

     pip install geojson
    
  • Conda ( 650K 11.08.2019):

     conda install -c conda-forge geojson
    
GeoViews (28 460) - Simple, concise geographical visualization in Python. BSD-3
  • GitHub ( 28 67 560 310 - 34% open 17.01.2023):

     git clone https://github.com/holoviz/geoviews
    
  • PyPi ( 6.3K / month 28 08.03.2022):

     pip install geoviews
    
  • Conda ( 150K 17.01.2023):

     conda install -c conda-forge geoviews
    
PySAL (27 1.1K) - PySAL: Python Spatial Analysis Library Meta-Package. BSD-3
  • GitHub ( 77 280 610 - 1% open 28.01.2023):

     git clone https://github.com/pysal/pysal
    
  • PyPi ( 28K / month 32 30.01.2022):

     pip install pysal
    
  • Conda ( 470K 31.01.2023):

     conda install -c conda-forge pysal
    
Show 9 hidden projects...
  • Geocoder (32 1.5K ) - Python Geocoder. MIT
  • Satpy (30 890) - Python package for earth-observing satellite data processing. GPL-3.0
  • Sentinelsat (27 840) - Search and download Copernicus Sentinel satellite images. GPL-3.0
  • EarthPy (26 420 ) - A package built to support working with spatial data using open.. BSD-3
  • pymap3d (25 300) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
  • prettymaps (23 9K) - A small set of Python functions to draw pretty maps from.. AGPL-3.0
  • gmaps (23 750 ) - Google maps for Jupyter notebooks. BSD-3
  • Mapbox GL (23 620 ) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT
  • geoplotlib (21 980 ) - python toolbox for visualizing geographical data and making maps. MIT

Financial Data

Back to top

Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.

yfinance (40 8.8K ) - Download market data from Yahoo! Finances API. Apache-2
  • GitHub ( 73 1.8K 17K 1K - 24% open 01.02.2023):

     git clone https://github.com/ranaroussi/yfinance
    
  • PyPi ( 610K / month 150 16.06.2022):

     pip install yfinance
    
  • Conda ( 70K 10.07.2021):

     conda install -c ranaroussi yfinance
    
Qlib (33 10K) - Qlib is an AI-oriented quantitative investment platform, which aims to.. MIT
  • GitHub ( 110 1.8K 340 37 730 - 32% open 29.01.2023):

     git clone https://github.com/microsoft/qlib
    
  • PyPi ( 14K / month 15.06.2022):

     pip install pyqlib
    
ta (29 3.5K) - Technical Analysis Library using Pandas and Numpy. MIT
  • GitHub ( 29 770 1.7K 220 - 56% open 23.08.2022):

     git clone https://github.com/bukosabino/ta
    
  • PyPi ( 140K / month 34 23.08.2022):

     pip install ta
    
  • Conda ( 11K 23.08.2022):

     conda install -c conda-forge ta
    
IB-insync (29 2K) - Python sync/async framework for Interactive Brokers API. BSD-2
  • GitHub ( 33 520 460 - 2% open 19.01.2023):

     git clone https://github.com/erdewit/ib_insync
    
  • PyPi ( 13K / month 22 28.11.2021):

     pip install ib_insync
    
  • Conda ( 26K 03.01.2023):

     conda install -c conda-forge ib-insync
    
ffn (28 1.4K) - ffn - a financial function library for Python. MIT
  • GitHub ( 29 240 270 110 - 18% open 21.12.2022):

     git clone https://github.com/pmorissette/ffn
    
  • PyPi ( 180K / month 25 21.04.2021):

     pip install ffn
    
  • Conda ( 2.4K 22.04.2021):

     conda install -c conda-forge ffn
    
Alpha Vantage (27 3.8K) - A python wrapper for Alpha Vantage API for financial data. MIT
  • GitHub ( 42 660 270 - 3% open 25.12.2022):

     git clone https://github.com/RomelTorres/alpha_vantage
    
  • PyPi ( 32K / month 25 21.12.2020):

     pip install alpha_vantage
    
  • Conda ( 2.5K 14.01.2021):

     conda install -c conda-forge alpha_vantage
    
bt (27 1.6K) - bt - flexible backtesting for Python. MIT
  • GitHub ( 27 340 170 310 - 22% open 22.11.2022):

     git clone https://github.com/pmorissette/bt
    
  • PyPi ( 9.5K / month 21 21.04.2021):

     pip install bt
    
  • Conda ( 11K 12.11.2022):

     conda install -c conda-forge bt
    
TensorTrade (25 4.1K) - An open source reinforcement learning framework for training,.. Apache-2
  • GitHub ( 61 960 41 240 - 19% open 23.08.2022):

     git clone https://github.com/tensortrade-org/tensortrade
    
  • PyPi ( 600 / month 1 10.05.2021):

     pip install tensortrade
    
  • Conda ( 2.2K 10.05.2021):

     conda install -c conda-forge tensortrade
    
stockstats (24 1.1K) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
  • GitHub ( 9 270 610 94 - 17% open 19.11.2022):

     git clone https://github.com/jealous/stockstats
    
  • PyPi ( 8.5K / month 29 07.01.2022):

     pip install stockstats
    
Crypto Signals (23 4.3K) - Github.com/CryptoSignal - Trading & Technical Analysis Bot -.. MIT
  • GitHub ( 28 1.2K 260 - 22% open 09.08.2022):

     git clone https://github.com/CryptoSignal/crypto-signal
    
  • Docker Hub ( 140K 8 03.09.2020):

     docker pull shadowreaver/crypto-signal
    
tf-quant-finance (23 3.6K) - High-performance TensorFlow library for quantitative.. Apache-2
  • GitHub ( 45 480 52 - 50% open 02.02.2023):

     git clone https://github.com/google/tf-quant-finance
    
  • PyPi ( 1.2K / month 2 31.05.2022):

     pip install tf-quant-finance
    
finmarketpy (19 3K) - Python library for backtesting trading strategies & analyzing.. Apache-2
  • GitHub ( 14 450 42 5 27 - 88% open 20.12.2022):

     git clone https://github.com/cuemacro/finmarketpy
    
  • PyPi ( 99 / month 07.10.2021):

     pip install finmarketpy
    
Show 13 hidden projects...

Time Series Data

Back to top

Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.

Prophet (36 15K) - Tool for producing high quality forecasts for time series data that has.. MIT
  • GitHub ( 160 4.4K 1.2K 1.9K - 14% open 31.01.2023):

     git clone https://github.com/facebook/prophet
    
  • PyPi ( 1.5M / month 130 05.09.2020):

     pip install fbprophet
    
  • Conda ( 330K 21.01.2023):

     conda install -c conda-forge prophet
    
sktime (36 6.1K) - A unified framework for machine learning with time series. BSD-3
  • GitHub ( 230 1K 76 860 1.6K - 33% open 01.02.2023):

     git clone https://github.com/alan-turing-institute/sktime
    
  • PyPi ( 350K / month 27 29.06.2022):

     pip install sktime
    
  • Conda ( 200K 14.01.2023):

     conda install -c conda-forge sktime-all-extras
    
tsfresh (32 7K) - Automatic extraction of relevant features from time series:. MIT
  • GitHub ( 84 1.1K 510 - 12% open 31.12.2022):

     git clone https://github.com/blue-yonder/tsfresh
    
  • PyPi ( 290K / month 60 21.12.2021):

     pip install tsfresh
    
  • Conda ( 510K 31.12.2022):

     conda install -c conda-forge tsfresh
    
NeuralProphet (32 2.8K) - NeuralProphet: A simple forecasting package. MIT
  • GitHub ( 35 370 170 420 - 27% open 01.02.2023):

     git clone https://github.com/ourownstory/neural_prophet
    
  • PyPi ( 140K / month 22.03.2022):

     pip install neuralprophet
    
STUMPY (32 2.5K) - STUMPY is a powerful and scalable Python library for modern time series.. BSD-3
  • GitHub ( 32 240 350 400 - 13% open 26.01.2023):

     git clone https://github.com/TDAmeritrade/stumpy
    
  • PyPi ( 180K / month 6 31.03.2022):

     pip install stumpy
    
  • Conda ( 230K 31.03.2022):

     conda install -c conda-forge stumpy
    
pmdarima (31 1.3K) - A statistical library designed to fill the void in Pythons time series.. MIT
  • GitHub ( 22 220 3.2K 300 - 11% open 20.12.2022):

     git clone https://github.com/alkaline-ml/pmdarima
    
  • PyPi ( 1.5M / month 57 22.02.2022):

     pip install pmdarima
    
  • Conda ( 260K 30.11.2022):

     conda install -c conda-forge pmdarima
    
Darts (30 5.3K) - A python library for user-friendly forecasting and anomaly detection.. Apache-2
  • GitHub ( 75 580 180 850 - 21% open 26.01.2023):

     git clone https://github.com/unit8co/darts
    
  • PyPi ( 11K / month 3 22.06.2022):

     pip install u8darts
    
  • Conda ( 21K 12.01.2023):

     conda install -c conda-forge u8darts-all
    
  • Docker Hub ( 370 12.01.2023):

     docker pull unit8/darts
    
GluonTS (30 3.3K) - Probabilistic time series modeling in Python. Apache-2
  • GitHub ( 100 670 800 - 30% open 01.02.2023):

     git clone https://github.com/awslabs/gluon-ts
    
  • PyPi ( 130K / month 8 30.06.2022):

     pip install gluonts
    
  • Conda ( 240 14.10.2021):

     conda install -c anaconda gluonts
    
tslearn (30 2.4K) - A machine learning toolkit dedicated to time-series data. BSD-2
  • GitHub ( 40 310 680 290 - 36% open 20.01.2023):

     git clone https://github.com/tslearn-team/tslearn
    
  • PyPi ( 120K / month 26 16.08.2021):

     pip install tslearn
    
  • Conda ( 520K 20.01.2023):

     conda install -c conda-forge tslearn
    
pytorch-forecasting (29 2.6K) - Time series forecasting with PyTorch. MIT
  • GitHub ( 33 420 590 - 55% open 17.11.2022):

     git clone https://github.com/jdb78/pytorch-forecasting
    
  • PyPi ( 100K / month 4 23.05.2022):

     pip install pytorch-forecasting
    
  • Conda ( 36K 23.05.2022):

     conda install -c conda-forge pytorch-forecasting
    
StatsForecast (28 2.2K) - Lightning fast forecasting with statistical and econometric.. Apache-2
  • GitHub ( 17 130 140 130 - 36% open 27.01.2023):

     git clone https://github.com/Nixtla/statsforecast
    
  • PyPi ( 200K / month 5 27.06.2022):

     pip install statsforecast
    
  • Conda ( 24K 01.12.2022):

     conda install -c conda-forge statsforecast
    
Streamz (28 1.1K) - Real-time stream processing for python. BSD-3
  • GitHub ( 48 140 360 260 - 40% open 22.12.2022):

     git clone https://github.com/python-streamz/streamz
    
  • PyPi ( 17K / month 35 04.10.2021):

     pip install streamz
    
  • Conda ( 510K 28.07.2022):

     conda install -c conda-forge streamz
    
uber/orbit (26 1.6K) - A Python package for Bayesian forecasting with object-oriented.. Apache-2
  • GitHub ( 18 120 13 380 - 14% open 29.01.2023):

     git clone https://github.com/uber/orbit
    
  • PyPi ( 120K / month 1 28.04.2022):

     pip install orbit-ml
    
  • Conda ( 2.3K 31.01.2023):

     conda install -c conda-forge orbit-ml
    
pyts (26 1.4K) - A Python package for time series classification. BSD-3
  • GitHub ( 13 150 310 70 - 54% open 05.12.2022):

     git clone https://github.com/johannfaouzi/pyts
    
  • PyPi ( 82K / month 11 31.10.2021):

     pip install pyts
    
  • Conda ( 17K 31.10.2021):

     conda install -c conda-forge pyts
    
NeuralForecast (24 1.2K) - Scalable and user friendly neural forecasting algorithms. Apache-2
  • GitHub ( 23 110 6 160 - 17% open 31.01.2023):

     git clone https://github.com/Nixtla/neuralforecast
    
  • PyPi ( 2.5K / month 02.06.2022):

     pip install neuralforecast
    
  • Conda ( 4.2K 15.12.2022):

     conda install -c conda-forge neuralforecast
    
tick (22 410) - Module for statistical learning, with a particular emphasis on time-.. BSD-3
  • GitHub ( 17 89 240 67 230 - 27% open 18.01.2023):

     git clone https://github.com/X-DataInitiative/tick
    
  • PyPi ( 1.2K / month 1 24.05.2020):

     pip install tick
    
greykite (21 1.7K) - A flexible, intuitive and fast forecasting library. BSD-2
  • GitHub ( 8 88 14 81 - 19% open 31.08.2022):

     git clone https://github.com/linkedin/greykite
    
  • PyPi ( 14K / month 15.12.2021):

     pip install greykite
    
TSFEL (21 610 ) - An intuitive library to extract features from time series. BSD-3
  • GitHub ( 14 100 53 62 - 29% open 16.03.2022):

     git clone https://github.com/fraunhoferportugal/tsfel
    
  • PyPi ( 9.5K / month 14.02.2021):

     pip install tsfel
    
seglearn (20 540) - Python module for machine learning time series:. BSD-3
  • GitHub ( 14 63 18 29 - 20% open 27.08.2022):

     git clone https://github.com/dmbee/seglearn
    
  • PyPi ( 3.6K / month 1 13.03.2021):

     pip install seglearn
    
Auto TS (17 530) - Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost.. Apache-2
  • GitHub ( 6 94 79 - 10% open 16.08.2022):

     git clone https://github.com/AutoViML/Auto_TS
    
  • PyPi ( 5.9K / month 31.01.2022):

     pip install auto-ts
    
Show 9 hidden projects...
  • PyFlux (24 2K ) - Open source time series library for Python. BSD-3
  • luminol (21 1.1K ) - Anomaly Detection and Correlation library. Apache-2
  • matrixprofile-ts (20 700 ) - A Python library for detecting patterns and anomalies.. Apache-2
  • pydlm (20 440 ) - A python library for Bayesian time series modeling. BSD-3
  • ADTK (19 900 ) - A Python toolkit for rule-based/unsupervised anomaly detection in time.. MPL-2.0
  • tsflex (19 210) - Flexible time series feature extraction & processing. MIT
  • tsaug (15 280 ) - A Python package for time series augmentation. Apache-2
  • atspy (13 470 ) - AtsPy: Automated Time Series Models in Python (by @firmai). MIT
  • tslumen (6 45 ) - A library for Time Series EDA (exploratory data analysis). Apache-2

Medical Data

Back to top

Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.

MNE (37 2.1K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
  • GitHub ( 320 1.1K 2.2K 4.4K - 10% open 02.02.2023):

     git clone https://github.com/mne-tools/mne-python
    
  • PyPi ( 48K / month 220 12.05.2022):

     pip install mne
    
  • Conda ( 260K 22.12.2022):

     conda install -c conda-forge mne
    
NiBabel (35 520) - Python package to access a cacophony of neuro-imaging file formats. MIT
  • GitHub ( 94 240 9.5K 480 - 30% open 31.01.2023):

     git clone https://github.com/nipy/nibabel
    
  • PyPi ( 190K / month 1K 18.06.2022):

     pip install nibabel
    
  • Conda ( 530K 10.01.2023):

     conda install -c conda-forge nibabel
    
Nilearn (34 930) - Machine learning for NeuroImaging in Python. BSD-3
  • GitHub ( 200 490 86 2K 1.7K - 16% open 27.01.2023):

     git clone https://github.com/nilearn/nilearn
    
  • PyPi ( 31K / month 240 13.04.2022):

     pip install nilearn
    
  • Conda ( 210K 06.01.2023):

     conda install -c conda-forge nilearn
    
NIPYPE (34 660) - Workflows and interfaces for neuroimaging packages. Apache-2
  • GitHub ( 240 510 1.2K 1.3K - 29% open 29.01.2023):

     git clone https://github.com/nipy/nipype
    
  • PyPi ( 62K / month 160 06.06.2022):

     pip install nipype
    
  • Conda ( 530K 06.11.2022):

     conda install -c conda-forge nipype
    
MONAI (33 3.8K) - AI Toolkit for Healthcare Imaging. Apache-2
  • GitHub ( 140 720 710 2.3K - 11% open 02.02.2023):

     git clone https://github.com/Project-MONAI/MONAI
    
  • PyPi ( 55K / month 21 19.12.2022):

     pip install monai
    
  • Conda ( 8.8K 19.12.2022):

     conda install -c conda-forge monai
    
Lifelines (32 2K) - Survival analysis in Python. MIT
  • GitHub ( 100 520 1.3K 890 - 26% open 17.11.2022):

     git clone https://github.com/CamDavidsonPilon/lifelines
    
  • PyPi ( 380K / month 110 26.06.2022):

     pip install lifelines
    
  • Conda ( 240K 17.11.2022):

     conda install -c conda-forge lifelines
    
Hail (32 850) - Scalable genomic data analysis. MIT
  • GitHub ( 82 220 90 2.1K - 2% open 02.02.2023):

     git clone https://github.com/hail-is/hail
    
  • PyPi ( 42K / month 12 07.07.2022):

     pip install hail
    
DIPY (32 570) - DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic.. BSD-3
  • GitHub ( 130 360 690 830 - 18% open 31.01.2023):

     git clone https://github.com/dipy/dipy
    
  • PyPi ( 13K / month 83 11.03.2022):

     pip install dipy
    
  • Conda ( 350K 26.01.2023):

     conda install -c conda-forge dipy
    
NIPY (25 330) - Neuroimaging in Python FMRI analysis package. BSD-3
  • GitHub ( 64 130 160 - 28% open 03.01.2023):

     git clone https://github.com/nipy/nipy
    
  • PyPi ( 1.4K / month 47 19.02.2018):

     pip install nipy
    
  • Conda ( 100K 04.05.2020):

     conda install -c conda-forge nipy
    
pyRiemann (24 460) - Python machine learning package based on sklearn API for multivariate.. BSD-3
  • GitHub ( 24 140 180 91 - 7% open 29.01.2023):

     git clone https://github.com/pyRiemann/pyRiemann
    
  • PyPi ( 25K / month 13 27.06.2021):

     pip install pyriemann
    
DeepVariant (23 2.7K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3
  • GitHub ( 24 650 4.4K 560 - 1% open 17.10.2022):

     git clone https://github.com/google/deepvariant
    
  • Conda ( 49K 05.06.2022):

     conda install -c bioconda deepvariant
    
Medical Detection Toolkit (14 1.2K ) - The Medical Detection Toolkit contains 2D + 3D.. Apache-2
  • GitHub ( 3 290 130 - 33% open 04.04.2022):

     git clone https://github.com/MIC-DKFZ/medicaldetectiontoolkit
    
Show 8 hidden projects...
  • NiftyNet (24 1.3K ) - [unmaintained] An open-source convolutional neural.. Apache-2
  • MedPy (22 460) - Medical image processing in Python. GPL-3.0
  • DLTK (20 1.4K ) - Deep Learning Toolkit for Medical Image Analysis. Apache-2
  • Glow (20 220) - An open-source toolkit for large-scale genomic analysis. Apache-2
  • Brainiak (19 290 ) - Brain Imaging Analysis Kit. Apache-2
  • MedicalTorch (16 800 ) - A medical imaging framework for Pytorch. Apache-2
  • MedicalNet (12 1.5K ) - Many studies have shown that the performance on deep learning is.. MIT
  • DeepNeuro (12 110 ) - A deep learning python package for neuroimaging data. Made by:. MIT

Tabular Data

Back to top

Libraries for processing tabular and structured data.

pytorch_tabular (21 820) - A standard framework for modelling Deep Learning Models.. MIT
  • GitHub ( 15 95 87 - 6% open 22.01.2023):

     git clone https://github.com/manujosephv/pytorch_tabular
    
  • PyPi ( 2.1K / month 1 01.09.2021):

     pip install pytorch_tabular
    
carefree-learn (21 380) - Deep Learning PyTorch. MIT
  • GitHub ( 1 36 3 02.02.2023):

     git clone https://github.com/carefree0910/carefree-learn
    
  • PyPi ( 810 / month 20.06.2022):

     pip install carefree-learn
    
deltapy (12 470 ) - DeltaPy - Tabular Data Augmentation (by @firmai). MIT
  • GitHub ( 4 47 3 3 - 66% open 01.03.2022):

     git clone https://github.com/firmai/deltapy
    
  • PyPi ( 48 / month 09.04.2020):

     pip install deltapy
    
Show 2 hidden projects...
  • miceforest (23 210) - Multiple Imputation with LightGBM in Python. MIT
  • upgini (19 180) - Free automated data enrichment library for machine learning searches.. BSD-3

Optical Character Recognition

Back to top

Libraries for optical character recognition (OCR) and text extraction from images or videos.

PaddleOCR (36 28K ) - Awesome multilingual OCR toolkits based on PaddlePaddle.. Apache-2
  • GitHub ( 140 5.7K 1.1K 6.5K - 21% open 31.01.2023):

     git clone https://github.com/PaddlePaddle/PaddleOCR
    
  • PyPi ( 37K / month 9 10.05.2022):

     pip install paddleocr
    
EasyOCR (35 17K) - Ready-to-use OCR with 80+ supported languages and all popular writing.. Apache-2
  • GitHub ( 110 2.4K 3.5M 2.1K 720 - 22% open 06.01.2023):

     git clone https://github.com/JaidedAI/EasyOCR
    
  • PyPi ( 120K / month 37 20.09.2022):

     pip install easyocr
    
Tesseract (32 4.6K) - Python-tesseract is an optical character recognition (OCR) tool.. Apache-2
  • GitHub ( 43 630 320 - 6% open 26.12.2022):

     git clone https://github.com/madmaze/pytesseract
    
  • PyPi ( 650K / month 960 19.02.2022):

     pip install pytesseract
    
  • Conda ( 550K 15.03.2022):

     conda install -c conda-forge pytesseract
    
OCRmyPDF (29 8.1K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0
  • GitHub ( 79 640 930 - 10% open 01.02.2023):

     git clone https://github.com/ocrmypdf/OCRmyPDF
    
  • PyPi ( 25K / month 13 04.07.2022):

     pip install ocrmypdf
    
  • Conda ( 37K 05.01.2023):

     conda install -c conda-forge ocrmypdf
    
MMOCR (29 3.1K) - OpenMMLab Text Detection, Recognition and Understanding Toolbox. Apache-2
  • GitHub ( 73 580 37 770 - 14% open 06.01.2023):

     git clone https://github.com/open-mmlab/mmocr
    
  • PyPi ( 19K / month 1 05.05.2022):

     pip install mmocr
    
tesserocr (28 1.7K) - A Python wrapper for the tesseract-ocr API. MIT
  • GitHub ( 27 230 770 250 - 30% open 26.08.2022):

     git clone https://github.com/sirfz/tesserocr
    
  • PyPi ( 43K / month 67 19.06.2021):

     pip install tesserocr
    
  • Conda ( 100K 06.11.2022):

     conda install -c conda-forge tesserocr
    
keras-ocr (26 1.1K) - A packaged and flexible version of the CRAFT text detector and.. MIT
  • GitHub ( 17 300 440K 180 190 - 40% open 24.12.2022):

     git clone https://github.com/faustomorales/keras-ocr
    
  • PyPi ( 7.2K / month 2 19.05.2022):

     pip install keras-ocr
    
  • Conda ( 140 14.01.2022):

     conda install -c anaconda keras-ocr
    
pdftabextract (21 2K ) - A set of tools for extracting tables from PDF files.. Apache-2
  • GitHub ( 3 350 42 21 - 14% open 24.06.2022):

     git clone https://github.com/WZBSocialScienceCenter/pdftabextract
    
  • PyPi ( 290 / month 1 09.01.2018):

     pip install pdftabextract
    
calamari (20 960) - Line based ATR Engine based on OCRopy. Apache-2
  • GitHub ( 20 200 260 - 20% open 22.11.2022):

     git clone https://github.com/Calamari-OCR/calamari
    
  • PyPi ( 1.3K / month 2 13.11.2018):

     pip install calamari_ocr
    
Mozart (10 430) - An optical music recognition (OMR) system. Converts sheet music.. Apache-2
  • GitHub ( 5 63 13 - 15% open 24.08.2022):

     git clone https://github.com/aashrafh/Mozart
    
Show 2 hidden projects...
  • attention-ocr (21 940 ) - A Tensorflow model for text recognition (CNN + seq2seq.. MIT
  • doc2text (20 1.3K ) - Detect text blocks and OCR poorly scanned PDFs in bulk. Python.. MIT

Data Containers & Structures

Back to top

General-purpose data containers & structures as well as utilities & extensions for pandas.

 best-of-python - Data Containers ( 2.8K) - Collection of data-container, dataframe, and pandas-..


Data Loading & Extraction

Back to top

Libraries for loading, collecting, and extracting data from a variety of data sources and formats.

 best-of-python - Data Extraction ( 2.8K) - Collection of data-loading and -extraction libraries.


Web Scraping & Crawling

Back to top

Libraries for web scraping, crawling, downloading, and mining as well as libraries.

 best-of-web-python - Web Scraping ( 1.8K) - Collection of web-scraping and crawling libraries.


Data Pipelines & Streaming

Back to top

Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.

 best-of-python - Data Pipelines ( 2.8K) - Libraries for data batch- and stream-processing,..


Distributed Machine Learning

Back to top

Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.

Ray (45 24K) - Ray is a unified framework for scaling AI and Python applications. Ray.. Apache-2
  • GitHub ( 810 4.2K 7.2K 13K - 20% open 02.02.2023):

     git clone https://github.com/ray-project/ray
    
  • PyPi ( 1.9M / month 280 09.06.2022):

     pip install ray
    
  • Conda ( 93K 25.01.2023):

     conda install -c conda-forge ray-tune
    
dask (43 11K) - Parallel computing with task scheduling. BSD-3
  • GitHub ( 580 1.6K 44K 4.8K - 18% open 02.02.2023):

     git clone https://github.com/dask/dask
    
  • PyPi ( 7.7M / month 2.7K 27.01.2023):

     pip install dask
    
  • Conda ( 7.8M 28.01.2023):

     conda install -c conda-forge dask
    
dask.distributed (40 1.4K) - A distributed task scheduler for Dask. BSD-3
  • GitHub ( 310 670 27K 3.4K - 38% open 30.01.2023):

     git clone https://github.com/dask/distributed
    
  • PyPi ( 4.9M / month 1.2K 24.06.2022):

     pip install distributed
    
  • Conda ( 9.3M 28.01.2023):

     conda install -c conda-forge distributed
    
horovod (37 13K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2
  • GitHub ( 160 2.2K 790 2.1K - 15% open 01.02.2023):

     git clone https://github.com/horovod/horovod
    
  • PyPi ( 77K / month 30 21.06.2022):

     pip install horovod
    
DeepSpeed (35 8.7K) - DeepSpeed is a deep learning optimization library that makes.. MIT
  • GitHub ( 160 1K 690 1.3K - 46% open 02.02.2023):

     git clone https://github.com/microsoft/DeepSpeed
    
  • PyPi ( 370K / month 12 25.05.2022):

     pip install deepspeed
    
  • Docker Hub ( 15K 3 02.09.2022):

     docker pull deepspeed/deepspeed
    
BigDL (35 4.1K) - Fast, distributed, secure AI for Big Data. Apache-2
  • GitHub ( 180 1.1K 42 1.8K - 38% open 02.02.2023):

     git clone https://github.com/intel-analytics/BigDL
    
  • PyPi ( 4.1K / month 1 07.07.2022):

     pip install bigdl
    
  • Maven ( 4 20.04.2021):

     <dependency>
     	<groupId>com.intel.analytics.bigdl</groupId>
     	<artifactId>bigdl-SPARK_2.4</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
metrics (34 1.2K) - Machine learning metrics for distributed, scalable PyTorch.. Apache-2
  • GitHub ( 170 270 1.3K 7.3K 540 - 11% open 02.02.2023):

     git clone https://github.com/Lightning-AI/metrics
    
  • PyPi ( 4.6K / month 14 28.04.2018):

     pip install metrics
    
  • Conda ( 670K 31.01.2023):

     conda install -c conda-forge torchmetrics
    
H2O-3 (32 6.1K) - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning.. Apache-2
  • GitHub ( 240 1.9K 02.02.2023):

     git clone https://github.com/h2oai/h2o-3
    
  • PyPi ( 380K / month 76 26.05.2022):

     pip install h2o
    
FairScale (32 2K) - PyTorch extensions for high performance and large scale training. BSD-3
  • GitHub ( 65 200 1.3K 340 - 19% open 12.12.2022):

     git clone https://github.com/facebookresearch/fairscale
    
  • PyPi ( 640K / month 20 09.03.2022):

     pip install fairscale
    
  • Conda ( 87K 12.12.2022):

     conda install -c conda-forge fairscale
    
ColossalAI (29 8.2K) - Colossal-AI: A Unified Deep Learning System for Big Model Era. Apache-2
  • GitHub ( 88 730 160 380 - 38% open 02.02.2023):

     git clone https://github.com/hpcaitech/colossalai
    
SynapseML (29 3.9K) - Simple and Distributed Machine Learning. MIT
  • GitHub ( 100 720 650 - 43% open 02.02.2023):

     git clone https://github.com/microsoft/SynapseML
    
  • PyPi ( 60K / month 1 12.01.2022):

     pip install synapseml
    
Elephas (28 1.6K) - Distributed Deep learning with Keras & Spark. MIT keras
  • GitHub ( 27 310 64 160 - 4% open 31.08.2022):

     git clone https://github.com/maxpumperla/elephas
    
  • PyPi ( 58K / month 3 30.03.2022):

     pip install elephas
    
  • Conda ( 11K 02.06.2021):

     conda install -c conda-forge elephas
    
mpi4py (28 610) - Python bindings for MPI. BSD-2
  • GitHub ( 22 88 8.9K 100 - 7% open 01.02.2023):

     git clone https://github.com/mpi4py/mpi4py
    
  • PyPi ( 190K / month 620 25.11.2021):

     pip install mpi4py
    
  • Conda ( 1.6M 03.11.2022):

     conda install -c conda-forge mpi4py
    
petastorm (27 1.6K) - Petastorm library enables single machine or distributed training.. Apache-2
  • GitHub ( 46 260 350 99 310 - 52% open 30.01.2023):

     git clone https://github.com/uber/petastorm
    
  • PyPi ( 49K / month 7 19.02.2022):

     pip install petastorm
    
Hivemind (27 1.3K) - Decentralized deep learning in PyTorch. Built to train models on.. MIT
  • GitHub ( 27 84 46 140 - 33% open 11.01.2023):

     git clone https://github.com/learning-at-home/hivemind
    
  • PyPi ( 9.1K / month 2 20.06.2022):

     pip install hivemind
    
dask-ml (27 840) - Scalable Machine Learning with Dask. BSD-3
  • GitHub ( 76 240 740 510 - 51% open 11.01.2023):

     git clone https://github.com/dask/dask-ml
    
  • PyPi ( 110K / month 58 27.05.2022):

     pip install dask-ml
    
  • Conda ( 650K 27.05.2022):

     conda install -c conda-forge dask-ml
    
TensorFlowOnSpark (26 3.8K ) - TensorFlowOnSpark brings TensorFlow programs to.. Apache-2
  • GitHub ( 34 960 360 - 2% open 21.04.2022):

     git clone https://github.com/yahoo/TensorFlowOnSpark
    
  • PyPi ( 170K / month 5 21.04.2022):

     pip install tensorflowonspark
    
  • Conda ( 16K 21.08.2022):

     conda install -c conda-forge tensorflowonspark
    
analytics-zoo (25 2.5K) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2
  • GitHub ( 110 720 3 1.4K - 39% open 20.12.2022):

     git clone https://github.com/intel-analytics/analytics-zoo
    
  • PyPi ( 260 / month 1 07.07.2022):

     pip install analytics-zoo
    
Mesh (25 1.4K) - Mesh TensorFlow: Model Parallelism Made Easier. Apache-2
  • GitHub ( 49 230 770 110 - 87% open 25.01.2023):

     git clone https://github.com/tensorflow/mesh
    
  • PyPi ( 48K / month 32 15.05.2022):

     pip install mesh-tensorflow
    
MMLSpark (24 3.9K) - Simple and Distributed Machine Learning. MIT
  • GitHub ( 100 720 650 - 43% open 02.02.2023):

     git clone https://github.com/microsoft/SynapseML
    
  • PyPi ( 3 / month 18.03.2020):

     pip install mmlspark
    
Apache Singa (21 2.7K ) - a distributed deep learning platform. Apache-2
  • GitHub ( 79 880 2 100 - 36% open 01.06.2022):

     git clone https://github.com/apache/singa
    
  • Conda ( 600 09.08.2021):

     conda install -c nusdbsystem singa
    
  • Docker Hub ( 2.7K 4 31.05.2022):

     docker pull apache/singa
    
BytePS (20 3.3K ) - A high performance and generic framework for distributed DNN.. Apache-2
  • GitHub ( 21 460 260 - 38% open 10.02.2022):

     git clone https://github.com/bytedance/byteps
    
  • PyPi ( 34 / month 02.08.2021):

     pip install byteps
    
  • Docker Hub ( 1.3K 03.03.2020):

     docker pull bytepsimage/tensorflow
    
Submit it (19 770) - Python 3.6+ toolbox for submitting jobs to Slurm. MIT
  • GitHub ( 23 87 87 - 42% open 28.09.2022):

     git clone https://github.com/facebookincubator/submitit
    
  • PyPi ( 36K / month 6 07.04.2022):

     pip install submitit
    
  • Conda ( 15K 10.02.2021):

     conda install -c conda-forge submitit
    
parallelformers (18 580 ) - Parallelformers: An Efficient Model Parallelization.. Apache-2
  • GitHub ( 5 41 12 38 - 55% open 27.07.2022):

     git clone https://github.com/tunib-ai/parallelformers
    
  • PyPi ( 1.3K / month 29.12.2021):

     pip install parallelformers
    
mesh-transformer-jax (16 5.1K) - Model parallel transformers in JAX and Haiku. Apache-2
  • GitHub ( 23 720 190 - 19% open 12.01.2023):

     git clone https://github.com/kingoflolz/mesh-transformer-jax
    
moolib (10 350) - A library for distributed ML training with PyTorch. MIT
  • GitHub ( 6 16 3 18 - 33% open 12.12.2022):

     git clone https://github.com/facebookresearch/moolib
    
Show 10 hidden projects...
  • DEAP (31 5K) - Distributed Evolutionary Algorithms in Python. LGPL-3.0
  • ipyparallel (29 2.3K) - IPython Parallel: Interactive Parallel Computing in.. Unlicensed
  • sk-dist (21 280 ) - Distributed scikit-learn meta-estimators in PySpark. Apache-2
  • TensorFrames (20 760 ) - [DEPRECATED] Tensorflow wrapper for DataFrames on.. Apache-2
  • launchpad (20 280) - Launchpad is a library that simplifies writing distributed.. Apache-2
  • Fiber (19 1K ) - Distributed Computing for AI Made Simple. Apache-2
  • somoclu (19 240) - Massively parallel self-organizing maps: accelerate training on multicore.. MIT
  • bluefog (16 280 ) - Distributed and decentralized training framework for PyTorch.. Apache-2
  • LazyCluster (14 44 ) - Distributed machine learning made simple. Apache-2
  • autodist (10 130 ) - Simple Distributed Deep Learning on TensorFlow. Apache-2

Hyperparameter Optimization & AutoML

Back to top

Libraries for hyperparameter optimization, automl and neural architecture search.

Optuna (40 7.5K) - A hyperparameter optimization framework. MIT
  • GitHub ( 230 800 5.5K 1.4K - 7% open 02.02.2023):

     git clone https://github.com/optuna/optuna
    
  • PyPi ( 1.7M / month 240 13.06.2022):

     pip install optuna
    
  • Conda ( 380K 20.01.2023):

     conda install -c conda-forge optuna
    
NNI (36 12K) - An open source AutoML toolkit for automate machine learning lifecycle,.. MIT
  • GitHub ( 180 1.7K 340 1.8K - 15% open 02.02.2023):

     git clone https://github.com/microsoft/nni
    
  • PyPi ( 12K / month 30 22.06.2022):

     pip install nni
    
featuretools (35 6.5K) - An open source python library for automated feature engineering. BSD-3
  • GitHub ( 69 840 1.2K 910 - 17% open 01.02.2023):

     git clone https://github.com/alteryx/featuretools
    
  • PyPi ( 92K / month 65 05.07.2022):

     pip install featuretools
    
  • Conda ( 130K 01.02.2023):

     conda install -c conda-forge featuretools
    
AutoKeras (34 8.7K) - AutoML library for deep learning. Apache-2
  • GitHub ( 140 1.4K 12K 420 860 - 13% open 01.02.2023):

     git clone https://github.com/keras-team/autokeras
    
  • PyPi ( 9.8K / month 10 30.04.2022):

     pip install autokeras
    
Keras Tuner (34 2.7K) - A Hyperparameter Tuning Library for Keras. Apache-2
  • GitHub ( 53 350 2K 430 - 42% open 28.01.2023):

     git clone https://github.com/keras-team/keras-tuner
    
  • PyPi ( 600K / month 46 25.03.2022):

     pip install keras-tuner
    
  • Conda ( 13K 29.01.2023):

     conda install -c conda-forge keras-tuner
    
Bayesian Optimization (33 6.5K) - A Python implementation of global optimization with.. MIT
  • GitHub ( 37 1.4K 110 1.6K 300 - 5% open 04.12.2022):

     git clone https://github.com/fmfn/BayesianOptimization
    
  • PyPi ( 180K / month 91 16.05.2020):

     pip install bayesian-optimization
    
BoTorch (33 2.5K) - Bayesian optimization in PyTorch. MIT
  • GitHub ( 86 300 420 380 - 20% open 31.01.2023):

     git clone https://github.com/pytorch/botorch
    
  • PyPi ( 200K / month 18 21.04.2022):

     pip install botorch
    
  • Conda ( 55K 06.01.2023):

     conda install -c conda-forge botorch
    
Ax (33 2K) - Adaptive Experimentation Platform. MIT
  • GitHub ( 140 240 400 510 - 8% open 01.02.2023):

     git clone https://github.com/facebook/Ax
    
  • PyPi ( 120K / month 15 26.04.2022):

     pip install ax-platform
    
  • Conda ( 6.1K 18.01.2023):

     conda install -c conda-forge ax-platform
    
auto-sklearn (32 6.7K) - Automated Machine Learning with scikit-learn. BSD-3
  • GitHub ( 88 1.2K 41 390 960 - 15% open 07.12.2022):

     git clone https://github.com/automl/auto-sklearn
    
  • PyPi ( 32K / month 30 20.09.2022):

     pip install auto-sklearn
    
  • Conda ( 14K 21.09.2022):

     conda install -c conda-forge auto-sklearn
    
AutoGluon (30 5.3K) - AutoGluon: AutoML for Image, Text, Time Series, and.. Apache-2
  • GitHub ( 100 690 900 - 20% open 02.02.2023):

     git clone https://github.com/autogluon/autogluon
    
  • PyPi ( 32K / month 5 07.07.2022):

     pip install autogluon
    
  • Docker Hub ( 2.7K 11 17.12.2022):

     docker pull autogluon/autogluon
    
nevergrad (30 3.4K) - A Python toolbox for performing gradient-free optimization. MIT
  • GitHub ( 49 320 440 260 - 39% open 11.01.2023):

     git clone https://github.com/facebookresearch/nevergrad
    
  • PyPi ( 31K / month 20 08.03.2022):

     pip install nevergrad
    
  • Conda ( 36K 14.06.2021):

     conda install -c conda-forge nevergrad
    
Hyperas (27 2.1K) - Keras + Hyperopt: A very simple wrapper for convenient.. MIT
  • GitHub ( 22 320 280 250 - 37% open 05.01.2023):

     git clone https://github.com/maxpumperla/hyperas
    
  • PyPi ( 14K / month 24 28.02.2019):

     pip install hyperas
    
GPyOpt (27 850) - Gaussian Process Optimization using GPy. BSD-3
  • GitHub ( 49 250 360 290 - 35% open 17.01.2023):

     git clone https://github.com/SheffieldML/GPyOpt
    
  • PyPi ( 67K / month 32 19.03.2020):

     pip install gpyopt
    
mljar-supervised (26 2.4K) - Python package for AutoML on Tabular Data with Feature.. MIT
  • GitHub ( 20 320 63 530 - 22% open 30.12.2022):

     git clone https://github.com/mljar/mljar-supervised
    
  • PyPi ( 6.1K / month 2 02.03.2022):

     pip install mljar-supervised
    
  • Conda ( 6.4K 30.12.2022):

     conda install -c conda-forge mljar-supervised
    
lazypredict (25 1.2K) - Lazy Predict help build a lot of basic models without much code.. MIT
  • GitHub ( 18 140 410 90 - 58% open 28.09.2022):

     git clone https://github.com/shankarpandala/lazypredict
    
  • PyPi ( 14K / month 17.02.2021):

     pip install lazypredict
    
  • Conda ( 1.5K 29.09.2022):

     conda install -c conda-forge lazypredict
    
FEDOT (24 490) - Automated modeling and machine learning framework FEDOT. BSD-3
  • GitHub ( 27 67 27 430 - 19% open 01.02.2023):

     git clone https://github.com/nccr-itmo/FEDOT
    
  • PyPi ( 880 / month 28.03.2022):

     pip install fedot
    
Talos (23 1.6K) - Hyperparameter Optimization for TensorFlow, Keras and PyTorch. MIT
  • GitHub ( 22 260 160 400 - 2% open 18.09.2022):

     git clone https://github.com/autonomio/talos
    
  • PyPi ( 840 / month 7 28.05.2022):

     pip install talos
    
Hyperactive (23 430) - An optimization and data collection toolbox for convenient and fast.. MIT
  • GitHub ( 6 39 110 17 53 - 16% open 01.02.2023):

     git clone https://github.com/SimonBlanke/Hyperactive
    
  • PyPi ( 580 / month 4 04.05.2022):

     pip install hyperactive
    
HpBandSter (22 560 ) - a distributed Hyperband implementation on Steroids. BSD-3
  • GitHub ( 11 110 300 92 - 61% open 22.04.2022):

     git clone https://github.com/automl/HpBandSter
    
  • PyPi ( 18K / month 10 06.11.2018):

     pip install hpbandster
    
  • Conda ( 6.1K 11.12.2020):

     conda install -c conda-forge hpbandster
    
featurewiz (22 350) - Use advanced feature engineering strategies and select best.. Apache-2
  • GitHub ( 7 64 34 59 - 1% open 06.01.2023):

     git clone https://github.com/AutoViML/featurewiz
    
  • PyPi ( 16K / month 1 05.07.2022):

     pip install featurewiz
    
igel (20 3K ) - a delightful machine learning tool that allows you to train, test, and use.. MIT
  • GitHub ( 19 160 33 3 50 - 12% open 06.02.2022):

     git clone https://github.com/nidhaloff/igel
    
  • PyPi ( 150 / month 19.11.2021):

     pip install igel
    
Dragonfly (20 750) - An open source python library for scalable Bayesian optimisation. MIT
  • GitHub ( 13 220 58 - 65% open 01.10.2022):

     git clone https://github.com/dragonfly/dragonfly
    
  • PyPi ( 36K / month 2 03.07.2020):

     pip install dragonfly-opt
    
Neuraxle (20 560) - The worlds cleanest AutoML library - Do hyperparameter tuning with.. Apache-2
  • GitHub ( 9 55 41 330 - 18% open 16.08.2022):

     git clone https://github.com/Neuraxio/Neuraxle
    
  • PyPi ( 240 / month 1 15.04.2022):

     pip install neuraxle
    
opytimizer (19 550) - Opytimizer is a Python library consisting of meta-heuristic.. Apache-2
  • GitHub ( 4 35 13 06.10.2022):

     git clone https://github.com/gugarosa/opytimizer
    
  • PyPi ( 720 / month 04.05.2022):

     pip install opytimizer
    
Auto ViML (19 410) - Automatically Build Multiple ML Models with a Single Line of Code... Apache-2
  • GitHub ( 6 84 20 25 - 16% open 16.08.2022):

     git clone https://github.com/AutoViML/Auto_ViML
    
  • PyPi ( 800 / month 2 01.06.2022):

     pip install autoviml
    
AlphaPy (18 850 ) - Automated Machine Learning [AutoML] with Python, scikit-learn,.. Apache-2
  • GitHub ( 3 160 3 41 - 29% open 23.04.2022):

     git clone https://github.com/ScottfreeLLC/AlphaPy
    
  • PyPi ( 130 / month 29.08.2020):

     pip install alphapy
    
shap-hypetune (16 430) - A python package for simultaneous Hyperparameters Tuning and.. MIT
  • GitHub ( 2 53 10 21 - 4% open 24.08.2022):

     git clone https://github.com/cerlymarco/shap-hypetune
    
  • PyPi ( 5K / month 1 16.01.2022):

     pip install shap-hypetune
    
model_search (11 3.2K ) - AutoML algorithms for model architecture search at scale. Apache-2
  • GitHub ( 1 360 62 - 75% open 09.02.2022):

     git clone https://github.com/google/model_search
    
Show 24 hidden projects...
  • scikit-optimize (33 2.5K ) - Sequential model-based optimization with a.. BSD-3
  • TPOT (32 8.9K ) - A Python Automated Machine Learning tool that optimizes.. LGPL-3.0
  • Hyperopt (32 6.6K ) - Distributed Asynchronous Hyperparameter Optimization in Python. BSD-3
  • Orion (27 260) - Asynchronous Distributed Hyperparameter Optimization. BSD-3
  • SMAC3 (24 790) - SMAC3: A Versatile Bayesian Optimization Package for.. BSD-1-Clause
  • AdaNet (23 3.4K ) - Fast and flexible AutoML with learning guarantees. Apache-2
  • auto_ml (22 1.6K ) - [UNMAINTAINED] Automated machine learning for analytics & production. MIT
  • MLBox (22 1.4K ) - MLBox is a powerful Automated Machine Learning python library. BSD-1-Clause
  • Test Tube (22 730 ) - Python library to easily log experiments and parallelize.. MIT
  • optunity (22 400 ) - optimization routines for hyperparameter tuning. BSD-3
  • lightwood (22 320) - Lightwood is Legos for Machine Learning. GPL-3.0
  • sklearn-deap (20 710 ) - Use evolutionary algorithms instead of gridsearch in.. MIT
  • Sherpa (19 320 ) - Hyperparameter optimization that enables researchers to.. GPL-3.0
  • Auto Tune Models (18 520 ) - Auto Tune Models - A multi-tenant, multi-data system for.. MIT
  • Advisor (17 1.5K ) - Open-source implementation of Google Vizier for hyper parameters.. Apache-2
  • Parfit (17 200 ) - A package for parallelizing the fit and flexibly scoring of.. MIT
  • Xcessiv (16 1.3K ) - A web-based application for quick, scalable, and automated.. Apache-2
  • HyperparameterHunter (16 700 ) - Easy hyperparameter optimization and automatic result.. MIT
  • automl-gs (15 1.8K ) - Provide an input CSV and a target field to predict, generate a.. MIT
  • ENAS (13 2.6K ) - PyTorch implementation of Efficient Neural Architecture Search via.. Apache-2
  • Auptimizer (13 200 ) - An automatic ML model optimization tool. GPL-3.0
  • Devol (11 950 ) - Genetic neural architecture search with Keras. MIT
  • Hypermax (11 100 ) - Better, faster hyper-parameter optimization. BSD-3
  • Hypertunity (9 130 ) - A toolset for black-box hyperparameter optimisation. Apache-2

Reinforcement Learning

Back to top

Libraries for building and evaluating reinforcement learning & agent-based systems.

OpenAI Gym (40 30K) - A toolkit for developing and comparing reinforcement learning.. MIT
  • GitHub ( 380 8.2K 37K 1.7K - 2% open 30.01.2023):

     git clone https://github.com/openai/gym
    
  • PyPi ( 1.1M / month 2.5K 07.06.2022):

     pip install gym
    
  • Conda ( 190K 26.01.2023):

     conda install -c conda-forge gym
    
TF-Agents (33 2.4K) - TF-Agents: A reliable, scalable and easy to use TensorFlow.. Apache-2
  • GitHub ( 130 660 1K 600 - 25% open 19.01.2023):

     git clone https://github.com/tensorflow/agents
    
  • PyPi ( 100K / month 23.10.2022):

     pip install tf-agents
    
FinRL (29 6.7K) - FinRL: Financial Reinforcement Learning. MIT
  • GitHub ( 84 1.6K 19 510 - 20% open 01.02.2023):

     git clone https://github.com/AI4Finance-Foundation/FinRL
    
  • PyPi ( 1K / month 08.01.2022):

     pip install finrl
    
ViZDoom (29 1.5K) - Reinforcement Learning environments based on the 1993 game Doom. MIT
  • GitHub ( 51 340 12K 160 450 - 11% open 30.01.2023):

     git clone https://github.com/mwydmuch/ViZDoom
    
  • PyPi ( 2.5K / month 14 18.04.2022):

     pip install vizdoom
    
Dopamine (28 10K) - Dopamine is a research framework for fast prototyping of.. Apache-2
  • GitHub ( 15 1.3K 170 - 50% open 28.11.2022):

     git clone https://github.com/google/dopamine
    
  • PyPi ( 490K / month 37 20.05.2022):

     pip install dopamine-rl
    
Acme (28 3K) - A library of reinforcement learning components and agents. Apache-2
  • GitHub ( 81 370 130 230 - 16% open 01.02.2023):

     git clone https://github.com/deepmind/acme
    
  • PyPi ( 3.1K / month 2 10.02.2022):

     pip install dm-acme
    
  • Conda ( 5.4K 09.12.2021):

     conda install -c conda-forge dm-acme
    
TensorLayer (27 7.1K ) - Deep Learning and Reinforcement Learning Library for.. Apache-2
  • GitHub ( 130 1.6K 1.4K 470 - 6% open 23.04.2022):

     git clone https://github.com/tensorlayer/tensorlayer
    
  • PyPi ( 1.5K / month 40 15.02.2022):

     pip install tensorlayer
    
PARL (27 2.9K) - A high-performance distributed training framework for Reinforcement.. Apache-2
  • GitHub ( 35 780 100 470 - 19% open 18.01.2023):

     git clone https://github.com/PaddlePaddle/PARL
    
  • PyPi ( 590 / month 1 13.05.2022):

     pip install parl
    
garage (25 1.6K) - A toolkit for reproducible reinforcement learning research. MIT
  • GitHub ( 79 280 63 1K - 22% open 04.01.2023):

     git clone https://github.com/rlworkgroup/garage
    
  • PyPi ( 440 / month 2 23.03.2021):

     pip install garage
    
Stable Baselines (24 3.7K) - A fork of OpenAI Baselines, implementations of reinforcement.. MIT
  • GitHub ( 110 700 940 - 12% open 04.09.2022):

     git clone https://github.com/hill-a/stable-baselines
    
  • PyPi ( 7.5K / month 34 06.04.2021):

     pip install stable-baselines
    
ReAgent (24 3.3K) - A platform for Reasoning systems (Reinforcement Learning,.. BSD-3
  • GitHub ( 150 480 140 - 47% open 02.02.2023):

     git clone https://github.com/facebookresearch/ReAgent
    
  • PyPi ( 24 / month 27.05.2020):

     pip install reagent
    
TensorForce (22 3.2K) - Tensorforce: a TensorFlow library for applied.. Apache-2
  • GitHub ( 82 520 660 - 4% open 15.01.2023):

     git clone https://github.com/tensorforce/tensorforce
    
  • PyPi ( 1.1K / month 30.08.2021):

     pip install tensorforce
    
RLax (22 970) - A library of reinforcement learning building blocks in JAX. Apache-2
  • GitHub ( 20 70 100 32 - 46% open 09.01.2023):

     git clone https://github.com/deepmind/rlax
    
  • PyPi ( 3.1K / month 2 24.02.2022):

     pip install rlax
    
Coach (21 2.2K) - Reinforcement Learning Coach by Intel AI Lab enables easy.. Apache-2
  • GitHub ( 36 430 270 - 32% open 11.12.2022):

     git clone https://github.com/IntelLabs/coach
    
  • PyPi ( 220 / month 2 10.10.2019):

     pip install rl_coach
    
PFRL (20 980) - PFRL: a PyTorch-based deep reinforcement learning library. MIT
  • GitHub ( 18 140 72 72 - 44% open 21.09.2022):

     git clone https://github.com/pfnet/pfrl
    
  • PyPi ( 520 / month 1 07.07.2021):

     pip install pfrl
    
rliable (12 540) - [NeurIPS21 Outstanding Paper] Library for reliable evaluation on RL.. Apache-2
  • GitHub ( 5 32 36 12 - 8% open 14.09.2022):

     git clone https://github.com/google-research/rliable
    
  • PyPi:

     pip install rliable`
    
Show 7 hidden projects...
  • baselines (30 14K ) - OpenAI Baselines: high-quality implementations of reinforcement.. MIT
  • keras-rl (29 5.4K ) - Deep Reinforcement Learning for Keras. MIT
  • TRFL (23 3.1K ) - TensorFlow Reinforcement Learning. Apache-2
  • ChainerRL (23 1.1K ) - ChainerRL is a deep reinforcement learning library built on top of.. MIT
  • DeepMind Lab (19 6.8K) - A customisable 3D platform for agent-based AI research. Unlicensed
  • SerpentAI (18 6.4K ) - Game Agent Framework. Helping you create AIs / Bots that learn to.. MIT
  • Maze (15 220) - Maze Applied Reinforcement Learning Framework. Custom

Recommender Systems

Back to top

Libraries for building and evaluating recommendation systems.

Recommenders (33 15K) - Best Practices on Recommendation Systems. MIT
  • GitHub ( 120 2.6K 290 56 740 - 20% open 21.11.2022):

     git clone https://github.com/microsoft/recommenders
    
  • PyPi ( 20K / month 2 01.04.2022):

     pip install recommenders
    
implicit (31 3.1K) - Fast Python Collaborative Filtering for Implicit Feedback Datasets. MIT
  • GitHub ( 34 570 300 790 440 - 16% open 11.12.2022):

     git clone https://github.com/benfred/implicit
    
  • PyPi ( 130K / month 32 29.01.2022):

     pip install implicit
    
  • Conda ( 440K 29.01.2022):

     conda install -c conda-forge implicit
    
TF Recommenders (30 1.5K) - TensorFlow Recommenders is a library for building.. Apache-2
  • GitHub ( 41 220 170 340 - 52% open 02.02.2023):

     git clone https://github.com/tensorflow/recommenders
    
  • PyPi ( 300K / month 1 23.08.2021):

     pip install tensorflow-recommenders
    
TF Ranking (29 2.6K) - Learning to Rank in TensorFlow. Apache-2
  • GitHub ( 31 450 310 - 24% open 01.02.2023):

     git clone https://github.com/tensorflow/ranking
    
  • PyPi ( 100K / month 12 16.11.2021):

     pip install tensorflow_ranking
    
lightfm (28 4.2K ) - A Python implementation of LightFM, a hybrid recommendation.. Apache-2
  • GitHub ( 44 650 940 470 - 25% open 19.07.2022):

     git clone https://github.com/lyst/lightfm
    
  • PyPi ( 380K / month 45 27.11.2020):

     pip install lightfm
    
  • Conda ( 140K 09.03.2022):

     conda install -c conda-forge lightfm
    
scikit-surprise (27 5.7K ) - A Python scikit for building and analyzing recommender.. BSD-3
  • GitHub ( 45 960 370 - 18% open 27.01.2023):

     git clone https://github.com/NicolasHug/Surprise
    
  • PyPi ( 75K / month 49 19.07.2020):

     pip install scikit-surprise
    
  • Conda ( 290K 31.10.2022):

     conda install -c conda-forge scikit-surprise
    
RecBole (27 2.4K) - A unified, comprehensive and efficient recommendation library. MIT
  • GitHub ( 57 440 610 - 11% open 06.01.2023):

     git clone https://github.com/RUCAIBox/RecBole
    
  • PyPi ( 3.3K / month 1 25.02.2022):

     pip install recbole
    
  • Conda ( 2.7K 05.10.2022):

     conda install -c aibox recbole
    
torchrec (24 1.3K) - Pytorch domain library for recommendation systems. BSD-3
  • GitHub ( 140 220 140 - 60% open 02.02.2023):

     git clone https://github.com/pytorch/torchrec
    
  • PyPi ( 190 / month 12.05.2022):

     pip install torchrec-nightly-cpu
    
Cornac (23 680) - A Comparative Framework for Multimodal Recommender Systems. Apache-2
  • GitHub ( 15 110 140 120 - 8% open 18.10.2022):

     git clone https://github.com/PreferredAI/cornac
    
  • PyPi ( 27K / month 15 19.02.2022):

     pip install cornac
    
  • Conda ( 270K 10.11.2022):

     conda install -c conda-forge cornac
    
recmetrics (18 460 ) - A library of metrics for evaluating recommender systems. MIT
  • GitHub ( 16 89 1 34 21 - 42% open 17.04.2022):

     git clone https://github.com/statisticianinstilettos/recmetrics
    
  • PyPi ( 2.6K / month 26.04.2022):

     pip install recmetrics
    
Show 7 hidden projects...
  • tensorrec (22 1.2K ) - A TensorFlow recommendation algorithm and framework in.. Apache-2
  • lkpy (22 220) - Python recommendation toolkit. MIT
  • fastFM (21 1K ) - fastFM: A Library for Factorization Machines. BSD-3
  • Spotlight (19 2.8K ) - Deep recommender models using PyTorch. MIT
  • Case Recommender (17 440 ) - Case Recommender: A Flexible and Extensible Python.. MIT
  • Collie (17 97) - A library for preparing, training, and evaluating scalable deep.. BSD-3
  • OpenRec (15 400 ) - OpenRec is an open-source and modular library for neural network-.. Apache-2

Privacy Machine Learning

Back to top

Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.

PySyft (36 8.5K) - Data science on data without acquiring a copy. Apache-2
  • GitHub ( 490 1.9K 3.4K - 4% open 02.02.2023):

     git clone https://github.com/OpenMined/PySyft
    
  • PyPi ( 4.2K / month 5 29.06.2022):

     pip install syft
    
Opacus (30 1.3K) - Training PyTorch models with differential privacy. Apache-2
  • GitHub ( 65 260 51 200 220 - 23% open 24.01.2023):

     git clone https://github.com/pytorch/opacus
    
  • PyPi ( 15K / month 11 06.05.2022):

     pip install opacus
    
  • Conda ( 7.3K 09.09.2022):

     conda install -c conda-forge opacus
    
FATE (27 4.8K) - An Industrial Grade Federated Learning Framework. Apache-2
  • GitHub ( 88 1.4K 1.5K - 38% open 04.01.2023):

     git clone https://github.com/FederatedAI/FATE
    
  • PyPi ( 06.05.2020):

     pip install ETAF
    
TensorFlow Privacy (26 1.7K) - Library for training machine learning models with.. Apache-2
  • GitHub ( 51 400 94 180 - 51% open 26.01.2023):

     git clone https://github.com/tensorflow/privacy
    
  • PyPi ( 32K / month 7 22.02.2022):

     pip install tensorflow-privacy
    
CrypTen (24 1.2K) - A framework for Privacy Preserving Machine Learning. MIT
  • GitHub ( 31 210 25 220 - 20% open 08.12.2022):

     git clone https://github.com/facebookresearch/CrypTen
    
  • PyPi ( 520 / month 2 09.09.2021):

     pip install crypten
    
TFEncrypted (23 1.1K) - A Framework for Encrypted Machine Learning in TensorFlow. Apache-2
  • GitHub ( 29 200 64 420 - 32% open 16.11.2022):

     git clone https://github.com/tf-encrypted/tf-encrypted
    
  • PyPi ( 890 / month 9 07.03.2022):

     pip install tf-encrypted
    
Show 1 hidden projects...
  • PipelineDP (20 240) - PipelineDP is a Python framework for applying differentially.. Apache-2

Workflow & Experiment Tracking

Back to top

Libraries to organize, track, and visualize machine learning experiments.

Tensorboard (43 6.1K) - TensorFlows Visualization Toolkit. Apache-2
  • GitHub ( 290 1.6K 150K 1.8K - 34% open 01.02.2023):

     git clone https://github.com/tensorflow/tensorboard
    
  • PyPi ( 17M / month 2.4K 08.06.2022):

     pip install tensorboard
    
  • Conda ( 3.7M 14.01.2023):

     conda install -c conda-forge tensorboard
    
mlflow (41 14K) - Open source platform for the machine learning lifecycle. Apache-2
  • GitHub ( 540 3.2K 2.9K - 35% open 02.02.2023):

     git clone https://github.com/mlflow/mlflow
    
  • PyPi ( 10M / month 330 29.06.2022):

     pip install mlflow
    
  • Conda ( 1.2M 19.01.2023):

     conda install -c conda-forge mlflow
    
DVC (41 11K) - Data Version Control | Git for Data & Models | ML Experiments Management. Apache-2
  • GitHub ( 280 1K 120K 6.2K 4.1K - 15% open 02.02.2023):

     git clone https://github.com/iterative/dvc
    
  • PyPi ( 1.5M / month 52 05.07.2022):

     pip install dvc
    
  • Conda ( 1.4M 16.12.2022):

     conda install -c conda-forge dvc
    
PyCaret (37 6.9K) - An open-source, low-code machine learning library in Python. MIT
  • GitHub ( 110 1.5K 620 3.1K 1.9K - 13% open 30.01.2023):

     git clone https://github.com/pycaret/pycaret
    
  • PyPi ( 710K / month 13 06.06.2022):

     pip install pycaret
    
  • Conda ( 27K 18.04.2022):

     conda install -c conda-forge pycaret
    
wandb client (37 5.4K) - A tool for visualizing and tracking your machine learning.. MIT
  • GitHub ( 140 420 16K 2.2K - 27% open 01.02.2023):

     git clone https://github.com/wandb/client
    
  • PyPi ( 1.5M / month 270 05.07.2022):

     pip install wandb
    
  • Conda ( 150K 13.01.2023):

     conda install -c conda-forge wandb
    
SageMaker SDK (37 1.8K) - A library for training and deploying machine learning.. Apache-2
  • GitHub ( 320 920 2K 1.2K - 37% open 02.02.2023):

     git clone https://github.com/aws/sagemaker-python-sdk
    
  • PyPi ( 23M / month 53 05.07.2022):

     pip install sagemaker
    
  • Conda ( 510K 03.11.2022):

     conda install -c conda-forge sagemaker-python-sdk
    
AzureML SDK (35 3.6K) - Python notebooks with ML and deep learning examples with Azure.. MIT
  • GitHub ( 61 2.3K 480 1.4K - 24% open 20.12.2022):

     git clone https://github.com/Azure/MachineLearningNotebooks
    
  • PyPi ( 1.1M / month 45 27.06.2022):

     pip install azureml-sdk
    
tensorboardX (33 7.5K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
  • GitHub ( 74 870 350 26K 440 - 16% open 27.01.2023):

     git clone https://github.com/lanpa/tensorboardX
    
  • PyPi ( 1.5M / month 890 05.06.2022):

     pip install tensorboardX
    
  • Conda ( 940K 07.06.2022):

     conda install -c conda-forge tensorboardx
    
ClearML (33 4K) - ClearML - Auto-Magical CI/CD to streamline your ML workflow... Apache-2
  • GitHub ( 67 540 650 400 720 - 46% open 02.02.2023):

     git clone https://github.com/allegroai/clearml
    
  • PyPi ( 220K / month 8 04.07.2022):

     pip install clearml
    
  • Docker Hub ( 30K 05.10.2020):

     docker pull allegroai/trains
    
snakemake (33 1.6K) - This is the development home of the workflow management system.. MIT
  • GitHub ( 280 400 1.3K 1.2K - 59% open 30.01.2023):

     git clone https://github.com/snakemake/snakemake
    
  • PyPi ( 28K / month 210 30.06.2022):

     pip install snakemake
    
  • Conda ( 640K 19.01.2023):

     conda install -c bioconda snakemake
    
Metaflow (32 6.4K) - Build and manage real-life data science projects with ease!. Apache-2
  • GitHub ( 64 600 380 500 - 48% open 01.02.2023):

     git clone https://github.com/Netflix/metaflow
    
  • PyPi ( 86K / month 9 17.06.2022):

     pip install metaflow
    
  • Conda ( 95K 26.01.2023):

     conda install -c conda-forge metaflow
    
VisualDL (32 4.5K) - Deep Learning Visualization Toolkit. Apache-2
  • GitHub ( 32 600 260 1.7K 440 - 22% open 17.01.2023):

     git clone https://github.com/PaddlePaddle/VisualDL
    
  • PyPi ( 75K / month 25 01.07.2022):

     pip install visualdl
    
sacred (32 4K) - Sacred is a tool to help you configure, organize, log and reproduce.. MIT
  • GitHub ( 100 370 1.8K 550 - 16% open 28.01.2023):

     git clone https://github.com/IDSIA/sacred
    
  • PyPi ( 38K / month 100 14.12.2020):

     pip install sacred
    
  • Conda ( 2.3K 14.11.2021):

     conda install -c conda-forge sacred
    
aim (31 3.1K) - Aim easy-to-use and performant open-source ML experiment tracker. Apache-2
  • GitHub ( 50 190 140 790 - 26% open 02.02.2023):

     git clone https://github.com/aimhubio/aim
    
  • PyPi ( 28K / month 5 07.07.2022):

     pip install aim
    
  • Conda ( 21K 16.01.2023):

     conda install -c conda-forge aim
    
Neptune.ai (29 360) - Experiment tracking tool and model registry. Apache-2
  • GitHub ( 38 37 180 - 12% open 02.02.2023):

     git clone https://github.com/neptune-ai/neptune-client
    
  • PyPi ( 600K / month 30 01.07.2022):

     pip install neptune-client
    
  • Conda ( 150K 02.02.2023):

     conda install -c conda-forge neptune-client
    
Catalyst (28 3.1K ) - Accelerated deep learning R&D. Apache-2
  • GitHub ( 100 360 780 350 - 0% open 29.04.2022):

     git clone https://github.com/catalyst-team/catalyst
    
  • PyPi ( 49K / month 29 29.04.2022):

     pip install catalyst
    
Guild AI (28 770) - Experiment tracking, ML developer tools. Apache-2
  • GitHub ( 24 70 7 66 400 - 46% open 02.02.2023):

     git clone https://github.com/guildai/guildai
    
  • PyPi ( 3.2K / month 11.05.2022):

     pip install guildai
    
ml-metadata (28 510) - For recording and retrieving metadata associated with ML.. Apache-2
  • GitHub ( 15 110 1.8K 290 96 - 27% open 21.01.2023):

     git clone https://github.com/google/ml-metadata
    
  • PyPi ( 830K / month 19 21.06.2022):

     pip install ml-metadata
    
TNT (27 1.5K) - A lightweight library for PyTorch training tools and utilities. BSD-3
  • GitHub ( 78 220 65 - 7% open 02.02.2023):

     git clone https://github.com/pytorch/tnt
    
  • PyPi ( 17K / month 37 29.07.2018):

     pip install torchnet
    
livelossplot (25 1.2K ) - Live training loss plot in Jupyter Notebook for Keras,.. MIT
  • GitHub ( 17 140 940 76 - 7% open 04.04.2022):

     git clone https://github.com/stared/livelossplot
    
  • PyPi ( 16K / month 11 04.04.2022):

     pip install livelossplot
    
Labml (24 1.3K) - Monitor deep learning model training and hardware usage from your mobile.. MIT
  • GitHub ( 7 87 66 35 - 51% open 23.11.2022):

     git clone https://github.com/labmlai/labml
    
  • PyPi ( 1.3K / month 6 05.07.2022):

     pip install labml
    
Studio.ml (21 380) - Studio: Simplify and expedite model building process. Apache-2
  • GitHub ( 22 52 5 250 - 22% open 09.01.2023):

     git clone https://github.com/studioml/studio
    
  • PyPi ( 270 / month 14.09.2021):

     pip install studioml
    
lore (20 1.5K) - Lore makes machine learning approachable for Software Engineers and.. MIT
  • GitHub ( 27 130 20 45 - 57% open 27.09.2022):

     git clone https://github.com/instacart/lore
    
  • PyPi ( 1.1K / month 1 02.02.2022):

     pip install lore
    
keepsake (18 1.6K ) - Version control for machine learning. Apache-2
  • GitHub ( 17 66 190 - 65% open 24.05.2022):

     git clone https://github.com/replicate/keepsake
    
  • PyPi ( 390 / month 11.03.2021):

     pip install keepsake
    
Show 15 hidden projects...
  • kaggle (28 5.1K ) - Official Kaggle API. Apache-2
  • SKLL (26 530) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine.. BSD-1-Clause
  • knockknock (24 2.5K ) - Knock Knock: Get notified when your training ends with only two.. MIT
  • gokart (24 270) - Gokart solves reproducibility, task dependencies, constraints of good code,.. MIT
  • quinn (23 400 ) - pyspark methods to enhance developer productivity. Apache-2
  • TensorWatch (21 3.3K ) - Debugging, monitoring and visualization for Python Machine.. MIT
  • hiddenlayer (21 1.7K ) - Neural network graphs and training metrics for.. MIT
  • TensorBoard Logger (21 620 ) - Log TensorBoard events without touching TensorFlow. MIT
  • MXBoard (19 330 ) - Logging MXNet data for visualization in TensorBoard. Apache-2
  • datmo (17 340 ) - Open source production model management tool for data scientists. MIT
  • chitra (16 210 ) - A multi-functional library for full-stack Deep Learning... Apache-2
  • steppy (16 130 ) - Lightweight, Python library for fast and reproducible experimentation. MIT
  • caliban (15 450 ) - Research workflows made easy, locally and in the Cloud. Apache-2
  • ModelChimp (13 120 ) - Experiment tracking for machine and deep learning projects. BSD-2
  • traintool (8 11 ) - Train off-the-shelf machine learning models in one.. Apache-2

Model Serialization & Deployment

Back to top

Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.

onnx (41 14K) - Open standard for machine learning interoperability. Apache-2
  • GitHub ( 270 3.3K 18K 11K 2.3K - 15% open 02.02.2023):

     git clone https://github.com/onnx/onnx
    
  • PyPi ( 2.1M / month 400 12.12.2022):

     pip install onnx
    
  • Conda ( 630K 15.12.2022):

     conda install -c conda-forge onnx
    
BentoML (32 4.5K) - Unified Model Serving Framework. Apache-2
  • GitHub ( 140 520 1.7K 510 830 - 17% open 02.02.2023):

     git clone https://github.com/bentoml/BentoML
    
  • PyPi ( 33K / month 3 01.07.2022):

     pip install bentoml
    
Core ML Tools (32 3.1K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
  • GitHub ( 140 470 4.8K 1.3K 1.1K - 16% open 20.01.2023):

     git clone https://github.com/apple/coremltools
    
  • PyPi ( 480K / month 150 07.06.2022):

     pip install coremltools
    
  • Conda ( 42K 15.10.2021):

     conda install -c conda-forge coremltools
    
huggingface_hub (32 680) - All the open source things related to the Hugging Face Hub. Apache-2
  • GitHub ( 82 170 400 - 20% open 31.01.2023):

     git clone https://github.com/huggingface/huggingface_hub
    
  • PyPi ( 7.3M / month 87 21.06.2022):

     pip install huggingface_hub
    
  • Conda ( 730K 25.01.2023):

     conda install -c conda-forge huggingface_hub
    
triton (31 5.1K) - Development repository for the Triton language and compiler. MIT
  • GitHub ( 71 430 260 400 - 41% open 02.02.2023):

     git clone https://github.com/openai/triton
    
  • PyPi ( 280K / month 3 04.07.2022):

     pip install triton
    
TorchServe (30 3.2K) - Serve, optimize and scale PyTorch models in production. Apache-2
  • GitHub ( 150 670 2.4K 1.1K - 17% open 31.01.2023):

     git clone https://github.com/pytorch/serve
    
  • PyPi ( 20K / month 9 13.05.2022):

     pip install torchserve
    
  • Conda ( 62K 12.12.2022):

     conda install -c pytorch torchserve
    
  • Docker Hub ( 1.1M 16 12.12.2022):

     docker pull pytorch/torchserve
    
Hummingbird (29 3.1K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
  • GitHub ( 37 260 210 51 280 - 19% open 25.01.2023):

     git clone https://github.com/microsoft/hummingbird
    
  • PyPi ( 26K / month 25.04.2022):

     pip install hummingbird-ml
    
  • Conda ( 21K 29.11.2022):

     conda install -c conda-forge hummingbird-ml
    
nebullvm (26 2.2K) - Plug and play modules to optimize the performances of your AI.. Apache-2
  • GitHub ( 23 120 7 86 - 29% open 30.01.2023):

     git clone https://github.com/nebuly-ai/nebullvm
    
  • PyPi ( 1.3K / month 28.06.2022):

     pip install nebullvm
    
cortex (25 7.9K) - Production infrastructure for machine learning at scale. Apache-2
  • GitHub ( 24 600 1.1K - 10% open 23.09.2022):

     git clone https://github.com/cortexlabs/cortex
    
  • PyPi ( 1.2K / month 1 10.01.2022):

     pip install cortex
    
mmdnn (25 5.7K) - MMdnn is a set of tools to help users inter-operate among different deep.. MIT
  • GitHub ( 86 970 3.6K 100 620 - 53% open 22.09.2022):

     git clone https://github.com/Microsoft/MMdnn
    
  • PyPi ( 580 / month 3 24.07.2020):

     pip install mmdnn
    
m2cgen (25 2.4K) - Transform ML models into a native code (Java, C, Python, Go, JavaScript,.. MIT
  • GitHub ( 14 200 42 87 100 - 30% open 05.10.2022):

     git clone https://github.com/BayesWitnesses/m2cgen
    
  • PyPi ( 16K / month 3 26.04.2022):

     pip install m2cgen
    
sklearn-porter (23 1.2K ) - Transpile trained scikit-learn estimators to C, Java,.. BSD-3
  • GitHub ( 12 160 48 76 - 55% open 22.05.2022):

     git clone https://github.com/nok/sklearn-porter
    
  • PyPi ( 320 / month 1 18.12.2019):

     pip install sklearn-porter
    
hls4ml (23 760) - Machine learning on FPGAs using HLS. Apache-2
  • GitHub ( 48 290 340 - 40% open 27.01.2023):

     git clone https://github.com/fastmachinelearning/hls4ml
    
  • PyPi ( 190 / month 12.11.2021):

     pip install hls4ml
    
  • Conda ( 5.1K 12.11.2021):

     conda install -c conda-forge hls4ml
    
Show 7 hidden projects...
  • pytorch2keras (20 830 ) - PyTorch to Keras model convertor. MIT
  • Larq Compute Engine (20 220) - Highly optimized inference engine for Binarized.. Apache-2
  • OMLT (18 180) - Represent trained machine learning models as Pyomo optimization formulations. BSD-3
  • modelkit (16 140) - Toolkit for developing and maintaining ML models. MIT
  • tfdeploy (15 350 ) - Deploy tensorflow graphs for fast evaluation and export to.. BSD-3
  • backprop (13 230 ) - Backprop makes it simple to use, finetune, and deploy state-of-.. Apache-2
  • ml-ane-transformers (11 560) - Reference implementation of the Transformer.. Unlicensed

Model Interpretability

Back to top

Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.

shap (38 18K ) - A game theoretic approach to explain the output of any machine learning.. MIT
  • GitHub ( 200 2.7K 8.5K 2.1K - 70% open 16.06.2022):

     git clone https://github.com/slundberg/shap
    
  • PyPi ( 6.6M / month 280 16.06.2022):

     pip install shap
    
  • Conda ( 1.8M 20.06.2022):

     conda install -c conda-forge shap
    
arviz (34 1.3K) - Exploratory analysis of Bayesian models with Python. Apache-2
  • GitHub ( 140 320 120 3.4K 800 - 21% open 01.02.2023):

     git clone https://github.com/arviz-devs/arviz
    
  • PyPi ( 1M / month 110 16.11.2022):

     pip install arviz
    
  • Conda ( 1.2M 16.11.2022):

     conda install -c conda-forge arviz
    
Netron (33 21K) - Visualizer for neural network, deep learning, and machine.. MIT
  • GitHub ( 2 2.4K 45K 10 900 - 2% open 02.02.2023):

     git clone https://github.com/lutzroeder/netron
    
  • PyPi ( 11K / month 67 04.07.2022):

     pip install netron
    
Captum (33 3.7K) - Model interpretability and understanding for PyTorch. BSD-3
  • GitHub ( 98 390 890 440 - 31% open 01.02.2023):

     git clone https://github.com/pytorch/captum
    
  • PyPi ( 120K / month 23 03.03.2022):

     pip install captum
    
  • Conda ( 5.2K 04.03.2022):

     conda install -c conda-forge captum
    
InterpretML (32 5.2K) - Fit interpretable models. Explain blackbox machine learning. MIT
  • GitHub ( 32 640 350 350 - 28% open 28.01.2023):

     git clone https://github.com/interpretml/interpret
    
  • PyPi ( 71K / month 17 23.09.2021):

     pip install interpret
    
DoWhy (31 5.6K) - DoWhy is a Python library for causal inference that supports explicit.. MIT
  • GitHub ( 69 800 31 180 340 - 31% open 02.02.2023):

     git clone https://github.com/py-why/dowhy
    
  • PyPi ( 78K / month 5 20.03.2022):

     pip install dowhy
    
  • Conda ( 12K 19.07.2022):

     conda install -c conda-forge dowhy
    
dtreeviz (31 2.4K) - A python library for decision tree visualization and model interpretation. MIT
  • GitHub ( 22 300 610 160 - 23% open 29.01.2023):

     git clone https://github.com/parrt/dtreeviz
    
  • PyPi ( 87K / month 14 29.04.2022):

     pip install dtreeviz
    
  • Conda ( 34K 29.01.2023):

     conda install -c conda-forge dtreeviz
    
Model Analysis (31 1.2K) - Model analysis tools for TensorFlow. Apache-2
  • GitHub ( 52 260 81 - 37% open 20.01.2023):

     git clone https://github.com/tensorflow/model-analysis
    
  • PyPi ( 600K / month 21 16.05.2022):

     pip install tensorflow-model-analysis
    
shapash (29 2.1K) - Shapash makes Machine Learning models transparent and.. Apache-2
  • GitHub ( 35 260 96 140 - 15% open 01.02.2023):

     git clone https://github.com/MAIF/shapash
    
  • PyPi ( 18K / month 1 09.05.2022):

     pip install shapash
    
Alibi (29 1.9K) - Algorithms for explaining machine learning models. Apache-2
  • GitHub ( 18 220 220 340 - 39% open 19.01.2023):

     git clone https://github.com/SeldonIO/alibi
    
  • PyPi ( 23K / month 24 18.05.2022):

     pip install alibi
    
fairlearn (29 1.5K) - A Python package to assess and improve fairness of machine.. MIT
  • GitHub ( 72 340 420 - 41% open 30.01.2023):

     git clone https://github.com/fairlearn/fairlearn
    
  • PyPi ( 190K / month 13 07.07.2021):

     pip install fairlearn
    
  • Conda ( 24K 02.12.2022):

     conda install -c conda-forge fairlearn
    
evaluate (29 1.1K) - Evaluate: A library for easily evaluating machine learning models.. Apache-2
  • GitHub ( 110 100 780 170 - 38% open 30.01.2023):

     git clone https://github.com/huggingface/evaluate
    
  • PyPi ( 220K / month 2 16.06.2022):

     pip install evaluate
    
responsible-ai-widgets (29 660) - Responsible AI Toolbox is a suite of tools providing.. MIT
  • GitHub ( 31 170 45 280 - 22% open 02.02.2023):

     git clone https://github.com/microsoft/responsible-ai-toolbox
    
  • PyPi ( 9K / month 3 10.06.2022):

     pip install raiwidgets
    
Fairness 360 (28 1.9K ) - A comprehensive set of fairness metrics for datasets and.. Apache-2
  • GitHub ( 63 630 210 220 - 61% open 04.11.2022):

     git clone https://github.com/Trusted-AI/AIF360
    
  • PyPi ( 8.7K / month 10 04.03.2021):

     pip install aif360
    
  • Conda ( 4.8K 04.09.2022):

     conda install -c conda-forge aif360
    
CausalNex (28 1.8K) - A Python library that helps data scientists to infer.. Apache-2
  • GitHub ( 30 210 68 120 - 13% open 17.01.2023):

     git clone https://github.com/quantumblacklabs/causalnex
    
  • PyPi ( 9.8K / month 2 11.11.2021):

     pip install causalnex
    
explainerdashboard (28 1.6K) - Quickly build Explainable AI dashboards that show the inner.. MIT
  • GitHub ( 19 200 220 190 - 8% open 02.01.2023):

     git clone https://github.com/oegedijk/explainerdashboard
    
  • PyPi ( 32K / month 4 15.06.2022):

     pip install explainerdashboard
    
  • Conda ( 29K 15.02.2022):

     conda install -c conda-forge explainerdashboard
    
yellowbrick (27 3.9K) - Visual analysis and diagnostic tools to facilitate machine.. Apache-2
  • GitHub ( 110 520 680 - 12% open 13.01.2023):

     git clone https://github.com/DistrictDataLabs/yellowbrick
    
  • PyPi ( 860K / month 67 19.02.2022):

     pip install yellowbrick
    
  • Conda ( 51K 22.08.2022):

     conda install -c conda-forge yellowbrick
    
LIT (26 3.1K) - The Learning Interpretability Tool: Interactively analyze ML models to.. Apache-2
  • GitHub ( 27 320 12 130 - 45% open 02.12.2022):

     git clone https://github.com/PAIR-code/lit
    
  • PyPi ( 1K / month 1 21.12.2021):

     pip install lit-nlp
    
  • Conda ( 51K 09.11.2021):

     conda install -c conda-forge lit-nlp
    
checklist (25 1.8K) - Beyond Accuracy: Behavioral Testing of NLP models with CheckList. MIT
  • GitHub ( 13 180 200 100 - 22% open 12.08.2022):

     git clone https://github.com/marcotcr/checklist
    
  • PyPi ( 5.1K / month 6 24.05.2021):

     pip install checklist
    
  • Conda ( 4.9K 15.07.2021):

     conda install -c conda-forge checklist
    
iNNvestigate (25 1.1K) - A toolbox to iNNvestigate neural networks predictions!. BSD-2
  • GitHub ( 19 230 43 90 250 - 20% open 31.01.2023):

     git clone https://github.com/albermax/innvestigate
    
  • PyPi ( 980 / month 2 14.11.2020):

     pip install innvestigate
    
imodels (25 1K) - Interpretable ML package for concise, transparent, and accurate predictive.. MIT
  • GitHub ( 15 97 39 56 - 35% open 26.01.2023):

     git clone https://github.com/csinva/imodels
    
  • PyPi ( 37K / month 2 03.07.2022):

     pip install imodels
    
Explainability 360 (23 1.2K) - Interpretability and explainability of data and.. Apache-2
  • GitHub ( 39 260 69 82 - 64% open 03.11.2022):

     git clone https://github.com/Trusted-AI/AIX360
    
  • PyPi ( 1.1K / month 1 28.10.2020):

     pip install aix360
    
keract (22 1K) - Layers Outputs and Gradients in Keras. Made easy. MIT
  • GitHub ( 16 180 160 89 - 3% open 03.11.2022):

     git clone https://github.com/philipperemy/keract
    
  • PyPi ( 3.9K / month 6 19.06.2021):

     pip install keract
    
sklearn-evaluation (22 360 ) - Machine learning model evaluation made easy: plots,.. MIT
  • GitHub ( 19 41 13.01.2023):

     git clone https://github.com/edublancas/sklearn-evaluation
    
  • PyPi ( 28K / month 3 05.07.2022):

     pip install sklearn-evaluation
    
tf-explain (21 960 ) - Interpretability Methods for tf.keras models with Tensorflow.. MIT
  • GitHub ( 18 110 150 92 - 44% open 30.06.2022):

     git clone https://github.com/sicara/tf-explain
    
  • PyPi ( 1.9K / month 7 18.11.2021):

     pip install tf-explain
    
aequitas (21 520) - Bias and Fairness Audit Toolkit. MIT
  • GitHub ( 17 93 120 68 - 69% open 12.12.2022):

     git clone https://github.com/dssg/aequitas
    
  • PyPi ( 1.1K / month 6 16.12.2020):

     pip install aequitas
    
DiCE (20 1K) - Generate Diverse Counterfactual Explanations for any machine learning.. MIT
  • GitHub ( 16 140 140 - 36% open 12.12.2022):

     git clone https://github.com/interpretml/DiCE
    
  • PyPi ( 58K / month 4 02.06.2022):

     pip install dice-ml
    
model-card-toolkit (19 360) - A toolkit that streamlines and automates the generation.. Apache-2
  • GitHub ( 20 75 12 24 - 41% open 18.01.2023):

     git clone https://github.com/tensorflow/model-card-toolkit
    
  • PyPi ( 690 / month 28.04.2022):

     pip install model-card-toolkit
    
LOFO (17 700) - Leave One Feature Out Importance. MIT
  • GitHub ( 4 72 23 22 - 18% open 08.12.2022):

     git clone https://github.com/aerdem4/lofo-importance
    
  • PyPi ( 1.4K / month 2 27.04.2022):

     pip install lofo-importance
    
Anchor (16 740 ) - Code for High-Precision Model-Agnostic Explanations paper. BSD-2
  • GitHub ( 10 100 74 - 31% open 19.07.2022):

     git clone https://github.com/marcotcr/anchor
    
  • PyPi ( 1.8K / month 2 10.09.2020):

     pip install anchor_exp
    
ExplainX.ai (15 330) - Explainable AI framework for data scientists. Explain & debug any.. MIT
  • GitHub ( 4 42 7 27 - 37% open 15.09.2022):

     git clone https://github.com/explainX/explainx
    
  • PyPi ( 2.4K / month 04.02.2021):

     pip install explainx
    
Show 23 hidden projects...
  • Lime (31 10K ) - Lime: Explaining the predictions of any machine learning classifier. BSD-2
  • pyLDAvis (31 1.7K ) - Python library for interactive topic model visualization... BSD-3
  • Deep Checks (29 2.4K) - Tests for Continuous Validation of ML Models & Data... AGPL-3.0
  • scikit-plot (28 2.3K ) - An intuitive library to add plotting functionality to.. MIT
  • Lucid (26 4.5K ) - A collection of infrastructure and tools for research in.. Apache-2
  • eli5 (26 2.6K ) - A library for debugging/inspecting machine learning classifiers and.. MIT
  • DALEX (26 1.2K) - moDel Agnostic Language for Exploration and eXplanation. GPL-3.0
  • keras-vis (25 2.9K ) - Neural network visualization toolkit for keras. MIT
  • What-If Tool (22 780 ) - Source code/webpage/demos for the What-If Tool. Apache-2
  • TreeInterpreter (22 720 ) - Package for interpreting scikit-learns decision tree.. BSD-3
  • random-forest-importances (22 540 ) - Code to compute permutation and drop-column.. MIT
  • Skater (21 1.1K ) - Python Library for Model Interpretation/Explanations. UPL-1.0
  • deeplift (21 700 ) - Public facing deeplift repo. MIT
  • Quantus (21 280) - Quantus is an eXplainable AI toolkit for responsible evaluation of.. LGPL-3.0
  • ecco (19 1.6K ) - Explain, analyze, and visualize NLP language models. Ecco creates.. BSD-3
  • fairness-indicators (19 290) - Tensorflows Fairness Evaluation and Visualization.. Apache-2
  • tcav (18 560 ) - Code for the TCAV ML interpretability project. Apache-2
  • XAI (17 880 ) - XAI - An eXplainability toolbox for machine learning. MIT
  • FlashTorch (16 700 ) - Visualization toolkit for neural networks in PyTorch! Demo --. MIT
  • interpret-text (14 360 ) - A library that incorporates state-of-the-art explainers.. MIT
  • contextual-ai (12 82 ) - Contextual AI adds explainability to different stages of.. Apache-2
  • Attribution Priors (11 110 ) - Tools for training explainable models using.. MIT
  • bias-detector (11 40) - Bias Detector is a python package for detecting bias in machine.. MIT

Vector Similarity Search (ANN)

Back to top

Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.

 ANN Benchmarks ( 3.2K) - Benchmarks of approximate nearest neighbor libraries in Python.

Milvus (38 15K) - Vector database for scalable similarity search and AI applications. Apache-2
  • GitHub ( 230 1.9K 28K 7K - 4% open 02.02.2023):

     git clone https://github.com/milvus-io/milvus
    
  • PyPi ( 240K / month 17 02.04.2022):

     pip install pymilvus
    
  • Docker Hub ( 2.7M 23 02.02.2023):

     docker pull milvusdb/milvus
    
Faiss (37 19K) - A library for efficient similarity search and clustering of dense vectors. MIT
  • GitHub ( 120 2.8K 890 2K - 13% open 31.01.2023):

     git clone https://github.com/facebookresearch/faiss
    
  • PyPi ( 240K / month 17 02.04.2022):

     pip install pymilvus
    
  • Conda ( 630K 10.01.2023):

     conda install -c conda-forge faiss
    
Annoy (33 11K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage.. Apache-2
  • GitHub ( 82 1.1K 2.5K 370 - 12% open 27.10.2022):

     git clone https://github.com/spotify/annoy
    
  • PyPi ( 1.2M / month 240 18.09.2020):

     pip install annoy
    
  • Conda ( 300K 31.10.2022):

     conda install -c conda-forge python-annoy
    
NMSLIB (29 2.9K ) - Non-Metric Space Library (NMSLIB): An efficient similarity search.. Apache-2
  • GitHub ( 48 410 750 410 - 15% open 31.05.2022):

     git clone https://github.com/nmslib/nmslib
    
  • PyPi ( 200K / month 52 03.02.2021):

     pip install nmslib
    
  • Conda ( 77K 30.10.2022):

     conda install -c conda-forge nmslib
    
hnswlib (28 2.3K ) - Header-only C++/python library for fast approximate nearest.. Apache-2
  • GitHub ( 56 420 350 280 - 54% open 16.04.2022):

     git clone https://github.com/nmslib/hnswlib
    
  • PyPi ( 330K / month 28 14.02.2022):

     pip install hnswlib
    
  • Conda ( 69K 01.11.2022):

     conda install -c conda-forge hnswlib
    
PyNNDescent (28 710) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
  • GitHub ( 24 91 2.8K 110 - 48% open 03.01.2023):

     git clone https://github.com/lmcinnes/pynndescent
    
  • PyPi ( 730K / month 26 14.05.2022):

     pip install pynndescent
    
  • Conda ( 1.2M 01.11.2022):

     conda install -c conda-forge pynndescent
    
NGT (22 960) - Nearest Neighbor Search with Neighborhood Graph and Tree for High-.. Apache-2
  • GitHub ( 14 98 110 - 13% open 06.01.2023):

     git clone https://github.com/yahoojapan/NGT
    
  • PyPi ( 15K / month 8 20.06.2022):

     pip install ngt
    
NearPy (21 730) - Python framework for fast (approximated) nearest neighbour search in large,.. MIT
  • GitHub ( 19 140 79 63 - 38% open 22.01.2023):

     git clone https://github.com/pixelogik/NearPy
    
  • PyPi ( 1.8K / month 16 27.09.2016):

     pip install NearPy
    
Show 3 hidden projects...
  • Magnitude (23 1.6K ) - A fast, efficient universal vector embedding utility package. MIT
  • N2 (19 540 ) - TOROS N2 - lightweight approximate Nearest Neighbor library which runs.. Apache-2
  • PySparNN (11 900 ) - Approximate Nearest Neighbor Search for Sparse Data in Python!. BSD-3

Probabilistics & Statistics

Back to top

Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, gaussian processes, or statistics.

PyMC3 (39 7.3K) - Probabilistic Programming in Python: Bayesian Modeling and.. Apache-2
  • GitHub ( 440 1.7K 1.9K 1.1K 3K - 6% open 01.02.2023):

     git clone https://github.com/pymc-devs/pymc
    
  • PyPi ( 340K / month 240 15.03.2022):

     pip install pymc3
    
  • Conda ( 490K 20.05.2022):

     conda install -c conda-forge pymc3
    
tensorflow-probability (37 3.9K) - Probabilistic reasoning and statistical analysis in.. Apache-2
  • GitHub ( 470 1K 1.3K - 46% open 31.01.2023):

     git clone https://github.com/tensorflow/probability
    
  • PyPi ( 850K / month 340 07.06.2022):

     pip install tensorflow-probability
    
  • Conda ( 90K 07.12.2022):

     conda install -c conda-forge tensorflow-probability
    
Pyro (34 7.8K) - Deep universal probabilistic programming with Python and PyTorch. Apache-2
  • GitHub ( 130 950 1K 1K - 22% open 27.01.2023):

     git clone https://github.com/pyro-ppl/pyro
    
  • PyPi ( 320K / month 60 24.03.2022):

     pip install pyro-ppl
    
  • Conda ( 42K 04.01.2023):

     conda install -c conda-forge pyro-ppl
    
GPyTorch (33 3K) - A highly efficient and modular implementation of Gaussian Processes.. MIT
  • GitHub ( 110 470 960 1.2K - 25% open 01.02.2023):

     git clone https://github.com/cornellius-gp/gpytorch
    
  • PyPi ( 190K / month 41 27.06.2022):

     pip install gpytorch
    
  • Conda ( 74K 06.01.2023):

     conda install -c conda-forge gpytorch
    
hmmlearn (32 2.7K) - Hidden Markov Models in Python, with scikit-learn like API. BSD-3
  • GitHub ( 42 710 1.6K 410 - 13% open 10.01.2023):

     git clone https://github.com/hmmlearn/hmmlearn
    
  • PyPi ( 110K / month 130 10.02.2022):

     pip install hmmlearn
    
  • Conda ( 160K 05.11.2022):

     conda install -c conda-forge hmmlearn
    
GPflow (32 1.7K) - Gaussian processes in TensorFlow. Apache-2
  • GitHub ( 81 440 460 810 - 17% open 27.01.2023):

     git clone https://github.com/GPflow/GPflow
    
  • PyPi ( 60K / month 29 10.05.2022):

     pip install gpflow
    
  • Conda ( 19K 24.05.2022):

     conda install -c conda-forge gpflow
    
pgmpy (31 2.3K) - Python Library for learning (Structure and Parameter), inference.. MIT
  • GitHub ( 110 660 190 510 820 - 28% open 28.01.2023):

     git clone https://github.com/pgmpy/pgmpy
    
  • PyPi ( 63K / month 11 30.06.2022):

     pip install pgmpy
    
emcee (31 1.3K) - The Python ensemble sampling toolkit for affine-invariant MCMC. MIT
  • GitHub ( 67 430 1.7K 280 - 17% open 27.01.2023):

     git clone https://github.com/dfm/emcee
    
  • PyPi ( 110K / month 310 10.05.2022):

     pip install emcee
    
  • Conda ( 270K 27.01.2023):

     conda install -c conda-forge emcee
    
pandas-ta (30 3.3K) - Technical Analysis Indicators - Pandas TA is an easy to use.. MIT
  • GitHub ( 45 710 1.1K 480 - 21% open 24.09.2022):

     git clone https://github.com/twopirllc/pandas-ta
    
  • PyPi ( 75K / month 19 28.07.2021):

     pip install pandas-ta
    
  • Conda ( 6.4K 05.10.2021):

     conda install -c conda-forge pandas-ta
    
filterpy (30 2.6K) - Python Kalman filtering and optimal estimation library. Implements.. MIT
  • GitHub ( 43 540 2.2K 210 - 25% open 22.08.2022):

     git clone https://github.com/rlabbe/filterpy
    
  • PyPi ( 1.3M / month 130 10.10.2018):

     pip install filterpy
    
  • Conda ( 200K 05.05.2020):

     conda install -c conda-forge filterpy
    
patsy (30 870) - Describing statistical models in Python using symbolic formulas. BSD-2
  • GitHub ( 17 100 65K 140 - 48% open 09.10.2022):

     git clone https://github.com/pydata/patsy
    
  • PyPi ( 7.9M / month 2.7K 26.09.2021):

     pip install patsy
    
  • Conda ( 7M 09.10.2022):

     conda install -c conda-forge patsy
    
SALib (30 680) - Sensitivity Analysis Library in Python. Contains Sobol, Morris, FAST, and.. MIT
  • GitHub ( 39 200 300 - 13% open 10.01.2023):

     git clone https://github.com/SALib/SALib
    
  • PyPi ( 140K / month 65 08.01.2023):

     pip install salib
    
  • Conda ( 110K 08.01.2023):

     conda install -c conda-forge salib
    
pomegranate (27 3K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
  • GitHub ( 66 550 830 690 - 10% open 29.11.2022):

     git clone https://github.com/jmschrei/pomegranate
    
  • PyPi ( 39K / month 46 21.02.2022):

     pip install pomegranate
    
  • Conda ( 110K 19.09.2022):

     conda install -c conda-forge pomegranate
    
Orbit (26 1.6K) - A Python package for Bayesian forecasting with object-oriented design.. Apache-2
  • GitHub ( 18 120 13 380 - 14% open 29.01.2023):

     git clone https://github.com/uber/orbit
    
  • PyPi ( 120K / month 1 28.04.2022):

     pip install orbit-ml
    
bambi (26 880) - BAyesian Model-Building Interface (Bambi) in Python. MIT
  • GitHub ( 27 94 48 320 - 21% open 02.02.2023):

     git clone https://github.com/bambinos/bambi
    
  • PyPi ( 3.4K / month 3 06.06.2022):

     pip install bambi
    
  • Conda ( 17K 21.12.2022):

     conda install -c conda-forge bambi
    
Baal (21 690) - Library to enable Bayesian active learning in your research or labeling.. Apache-2
  • GitHub ( 19 65 25 92 - 23% open 28.01.2023):

     git clone https://github.com/baal-org/baal
    
  • PyPi ( 2.1K / month 1 03.05.2022):

     pip install baal
    
  • Conda ( 4.5K 31.10.2022):

     conda install -c conda-forge baal
    
Show 7 hidden projects...
  • pingouin (31 1.3K) - Statistical package in Python based on Pandas. GPL-3.0
  • Edward (28 4.8K ) - A probabilistic programming language in TensorFlow. Deep.. Apache-2
  • PyStan (25 240 ) - PyStan, a Python interface to Stan, a platform for statistical.. ISC
  • pyhsmm (20 530 ) - Bayesian inference in HSMMs and HMMs. MIT
  • scikit-posthocs (20 270) - Multiple Pairwise Comparisons (Post Hoc) Tests in Python. MIT
  • Funsor (20 210) - Functional tensors for probabilistic programming. Apache-2
  • ZhuSuan (16 2.2K ) - A probabilistic programming library for Bayesian deep learning,.. MIT

Adversarial Robustness

Back to top

Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.

ART (34 3.4K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT
  • GitHub ( 120 920 300 760 - 13% open 02.02.2023):

     git clone https://github.com/Trusted-AI/adversarial-robustness-toolbox
    
  • PyPi ( 38K / month 7 01.07.2022):

     pip install adversarial-robustness-toolbox
    
  • Conda ( 21K 18.12.2022):

     conda install -c conda-forge adversarial-robustness-toolbox
    
CleverHans (29 5.7K) - An adversarial example library for constructing attacks,.. MIT
  • GitHub ( 130 1.4K 420 460 - 8% open 31.01.2023):

     git clone https://github.com/cleverhans-lab/cleverhans
    
  • PyPi ( 1.1K / month 11 24.07.2021):

     pip install cleverhans
    
  • Conda ( 5.2K 29.07.2021):

     conda install -c conda-forge cleverhans
    
TextAttack (28 2.2K) - TextAttack is a Python framework for adversarial attacks, data.. MIT
  • GitHub ( 58 290 120 230 - 9% open 21.12.2022):

     git clone https://github.com/QData/TextAttack
    
  • PyPi ( 4.7K / month 7 25.05.2022):

     pip install textattack
    
  • Conda ( 4.9K 29.06.2021):

     conda install -c conda-forge textattack
    
Foolbox (27 2.4K ) - A Python toolbox to create adversarial examples that fool neural.. MIT
  • GitHub ( 32 410 370 360 - 8% open 25.05.2022):

     git clone https://github.com/bethgelab/foolbox
    
  • PyPi ( 2.3K / month 13 02.04.2022):

     pip install foolbox
    
  • Conda ( 9.3K 30.04.2021):

     conda install -c conda-forge foolbox
    
AdvBox (18 1.3K) - Advbox is a toolbox to generate adversarial examples that fool neural.. Apache-2
  • GitHub ( 19 250 38 - 21% open 08.08.2022):

     git clone https://github.com/advboxes/AdvBox
    
  • PyPi ( 14 / month 05.12.2018):

     pip install advbox
    
robustness (18 770 ) - A library for experimenting with, training and evaluating neural.. MIT
  • GitHub ( 13 150 100 77 - 27% open 14.02.2022):

     git clone https://github.com/MadryLab/robustness
    
  • PyPi ( 620 / month 2 01.12.2020):

     pip install robustness
    
  • Conda ( 5.7K 30.04.2021):

     conda install -c conda-forge robustness
    
Show 3 hidden projects...
  • advertorch (22 1.1K ) - A Toolbox for Adversarial Robustness Research. GPL-3.0
  • textflint (16 590 ) - Unified Multilingual Robustness Evaluation Toolkit for.. GPL-3.0
  • Adversary (14 370 ) - Tool to generate adversarial text examples and test machine.. MIT

GPU & Accelerator Utilities

Back to top

Libraries that require and make use of CUDA/GPU or other accelerator hardware capabilities to optimize machine learning tasks.

CuPy (38 6.6K ) - NumPy & SciPy for GPU. MIT
  • GitHub ( 320 650 55K 1.4K 1.9K - 24% open 02.02.2023):

     git clone https://github.com/cupy/cupy
    
  • PyPi ( 16K / month 160 30.06.2022):

     pip install cupy
    
  • Conda ( 2.3M 20.01.2023):

     conda install -c conda-forge cupy
    
  • Docker Hub ( 56K 8 19.01.2023):

     docker pull cupy/cupy
    
PyCUDA (32 1.5K) - CUDA integration for Python, plus shiny features. MIT
  • GitHub ( 79 260 1.8K 250 - 31% open 10.01.2023):

     git clone https://github.com/inducer/pycuda
    
  • PyPi ( 33K / month 200 24.06.2022):

     pip install pycuda
    
  • Conda ( 160K 21.12.2022):

     conda install -c conda-forge pycuda
    
cuDF (31 5.3K) - cuDF - GPU DataFrame Library. Apache-2
  • GitHub ( 260 660 5.2K - 13% open 02.02.2023):

     git clone https://github.com/rapidsai/cudf
    
  • PyPi ( 2.1K / month 6 01.06.2020):

     pip install cudf
    
gpustat (30 3.2K) - A simple command-line utility for querying and monitoring GPU status. MIT
  • GitHub ( 17 250 2.6K 100 - 22% open 15.12.2022):

     git clone https://github.com/wookayin/gpustat
    
  • PyPi ( 820K / month 120 05.07.2022):

     pip install gpustat
    
  • Conda ( 160K 11.10.2022):

     conda install -c conda-forge gpustat
    
cuML (30 3.1K) - cuML - RAPIDS Machine Learning Library. Apache-2
  • GitHub ( 160 440 1 2.2K - 32% open 02.02.2023):

     git clone https://github.com/rapidsai/cuml
    
  • PyPi ( 1.4K / month 4 01.06.2020):

     pip install cuml
    
Apex (29 6.9K) - A PyTorch Extension: Tools for easy mixed precision and distributed.. BSD-3
  • GitHub ( 100 1.1K 1.5K 1.1K - 55% open 31.01.2023):

     git clone https://github.com/NVIDIA/apex
    
  • Conda ( 130K 28.12.2022):

     conda install -c conda-forge nvidia-apex
    
ArrayFire (29 4K) - ArrayFire: a general purpose GPU library. BSD-3
  • GitHub ( 88 500 3.2K 1.6K - 16% open 01.02.2023):

     git clone https://github.com/arrayfire/arrayfire
    
  • PyPi ( 2.6K / month 5 22.02.2022):

     pip install arrayfire
    
optimum (27 900) - Accelerate training and inference of Transformers with easy to use.. Apache-2
  • GitHub ( 49 120 74 250 - 41% open 02.02.2023):

     git clone https://github.com/huggingface/optimum
    
  • PyPi ( 45K / month 3 13.06.2022):

     pip install optimum
    
  • Conda ( 5.6K 12.07.2022):

     conda install -c conda-forge optimum
    
cuGraph (26 1.2K) - cuGraph - RAPIDS Graph Analytics Library. Apache-2
  • GitHub ( 93 220 2 1.3K - 19% open 01.02.2023):

     git clone https://github.com/rapidsai/cugraph
    
  • PyPi ( 500 / month 1 01.06.2020):

     pip install cugraph
    
  • Conda ( 12K 29.04.2021):

     conda install -c conda-forge libcugraph
    
DALI (25 4.2K) - A GPU-accelerated library containing highly optimized building blocks.. Apache-2
  • GitHub ( 81 540 1.3K - 17% open 02.02.2023):

     git clone https://github.com/NVIDIA/DALI
    
Merlin (25 440) - NVIDIA Merlin is an open source library providing end-to-end GPU-.. Apache-2
  • GitHub ( 27 68 320 - 47% open 27.01.2023):

     git clone https://github.com/NVIDIA-Merlin/Merlin
    
  • PyPi ( 19K / month 14.06.2022):

     pip install merlin-core
    
scikit-cuda (24 920 ) - Python interface to GPU-powered libraries. BSD-3
  • GitHub ( 46 170 220 220 - 22% open 31.03.2022):

     git clone https://github.com/lebedov/scikit-cuda
    
  • PyPi ( 600 / month 44 27.05.2019):

     pip install scikit-cuda
    
Vulkan Kompute (22 1.1K) - General purpose GPU compute framework built on Vulkan to.. Apache-2
  • GitHub ( 21 75 230 5 190 - 33% open 03.12.2022):

     git clone https://github.com/KomputeProject/kompute
    
  • PyPi ( 96 / month 13.04.2022):

     pip install kp
    
cuSignal (19 650) - GPU accelerated signal processing. Apache-2
  • GitHub ( 42 100 140 - 11% open 01.02.2023):

     git clone https://github.com/rapidsai/cusignal
    
Show 6 hidden projects...
  • GPUtil (22 940 ) - A Python module for getting the GPU status from NVIDA GPUs using.. MIT
  • BlazingSQL (21 1.8K ) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for.. Apache-2
  • py3nvml (21 210 ) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside.. BSD-3
  • nvidia-ml-py3 (20 95 ) - Python 3 Bindings for the NVIDIA Management Library. BSD-3
  • SpeedTorch (15 660 ) - Library for faster pinned CPU - GPU transfer in Pytorch. MIT
  • ipyexperiments (15 150 ) - jupyter/ipython experiment containers for GPU and.. Apache-2

Tensorflow Utilities

Back to top

Libraries that extend TensorFlow with additional capabilities.

TensorFlow Datasets (36 3.7K) - TFDS is a collection of datasets ready to use with.. Apache-2
  • GitHub ( 280 1.4K 1.3K - 51% open 31.01.2023):

     git clone https://github.com/tensorflow/datasets
    
  • PyPi ( 1.1M / month 170 02.06.2022):

     pip install tensorflow-datasets
    
  • Conda ( 13K 19.01.2023):

     conda install -c conda-forge tensorflow-datasets
    
tensorflow-hub (36 3.3K) - A library for transfer learning by reusing parts of.. Apache-2
  • GitHub ( 100 1.7K 15K 660 - 0% open 31.01.2023):

     git clone https://github.com/tensorflow/hub
    
  • PyPi ( 4.7M / month 300 14.04.2021):

     pip install tensorflow-hub
    
  • Conda ( 75K 18.04.2021):

     conda install -c conda-forge tensorflow-hub
    
TF Addons (35 1.6K) - Useful extra functionality for TensorFlow 2.x maintained by.. Apache-2
  • GitHub ( 200 590 8.6K 980 - 25% open 14.12.2022):

     git clone https://github.com/tensorflow/addons
    
  • PyPi ( 1.7M / month 180 14.06.2022):

     pip install tensorflow-addons
    
tensor2tensor (33 13K) - Library of deep learning models and datasets designed to.. Apache-2
  • GitHub ( 240 3.2K 1.3K 1.3K - 46% open 19.01.2023):

     git clone https://github.com/tensorflow/tensor2tensor
    
  • PyPi ( 37K / month 93 17.06.2020):

     pip install tensor2tensor
    
TensorFlow Transform (33 940) - Input pipeline framework. Apache-2
  • GitHub ( 27 200 1.2K 200 - 18% open 01.02.2023):

     git clone https://github.com/tensorflow/transform
    
  • PyPi ( 4M / month 56 29.06.2022):

     pip install tensorflow-transform
    
TFX (32 1.9K) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
  • GitHub ( 160 630 880 - 23% open 01.02.2023):

     git clone https://github.com/tensorflow/tfx
    
  • PyPi ( 430K / month 13 26.05.2022):

     pip install tfx
    
TF Model Optimization (30 1.4K) - A toolkit to optimize ML models for deployment for.. Apache-2
  • GitHub ( 74 300 2.5K 340 - 54% open 19.12.2022):

     git clone https://github.com/tensorflow/model-optimization
    
  • PyPi ( 140K / month 21 18.03.2022):

     pip install tensorflow-model-optimization
    
Keras-Preprocessing (29 1K ) - Utilities for working with image data, text data, and.. MIT
  • GitHub ( 52 460 200 - 47% open 17.02.2022):

     git clone https://github.com/keras-team/keras-preprocessing
    
  • PyPi ( 7.2M / month 1.5K 14.05.2020):

     pip install keras-preprocessing
    
  • Conda ( 1.7M 15.01.2021):

     conda install -c conda-forge keras-preprocessing
    
TensorFlow I/O (29 610) - Dataset, streaming, and file system extensions.. Apache-2
  • GitHub ( 96 250 580 - 39% open 18.01.2023):

     git clone https://github.com/tensorflow/io
    
  • PyPi ( 640K / month 25 18.05.2022):

     pip install tensorflow-io
    
Neural Structured Learning (27 960) - Training neural models with structured signals. Apache-2
  • GitHub ( 37 180 300 70 - 4% open 11.01.2023):

     git clone https://github.com/tensorflow/neural-structured-learning
    
  • PyPi ( 18K / month 3 18.08.2020):

     pip install neural-structured-learning
    
TensorFlow Cloud (25 340) - The TensorFlow Cloud repository provides APIs that.. Apache-2
  • GitHub ( 27 75 220 87 - 68% open 22.12.2022):

     git clone https://github.com/tensorflow/cloud
    
  • PyPi ( 82K / month 1 17.06.2021):

     pip install tensorflow-cloud
    
Saliency (22 860 ) - Framework-agnostic implementation for state-of-the-art.. Apache-2
  • GitHub ( 16 180 51 31 - 12% open 13.05.2022):

     git clone https://github.com/PAIR-code/saliency
    
  • PyPi ( 3.1K / month 3 14.06.2022):

     pip install saliency
    
TF Compression (21 710) - Data compression in TensorFlow. Apache-2
  • GitHub ( 19 230 94 - 4% open 24.01.2023):

     git clone https://github.com/tensorflow/compression
    
  • PyPi ( 5.8K / month 1 30.05.2022):

     pip install tensorflow-compression
    
Show 3 hidden projects...
  • efficientnet (25 2K ) - Implementation of EfficientNet model. Keras and.. Apache-2
  • TensorNets (20 1K ) - High level network definitions with pre-trained weights in.. MIT
  • tffm (20 780 ) - TensorFlow implementation of an arbitrary order Factorization Machine. MIT

Jax Utilities

Back to top

Libraries that extend Jax with additional capabilities.

equinox (23 950) - Callable PyTrees and filtered transforms = neural networks in.. Apache-2
  • GitHub ( 16 53 83 130 - 21% open 01.12.2022):

     git clone https://github.com/patrick-kidger/equinox
    
  • PyPi ( 12K / month 6 05.07.2022):

     pip install equinox
    
evojax (19 610) - EvoJAX: Hardware-accelerated Neuroevolution. Apache-2
  • GitHub ( 12 48 2 22 - 31% open 21.12.2022):

     git clone https://github.com/google/evojax
    
  • PyPi ( 800 / month 15.06.2022):

     pip install evojax
    
  • Conda ( 13K 05.10.2022):

     conda install -c conda-forge evojax
    
Show 1 hidden projects...
  • jaxdf (13 65) - A JAX-based research framework for writing differentiable.. LGPL-3.0

Sklearn Utilities

Back to top

Libraries that extend scikit-learn with additional capabilities.

imbalanced-learn (35 6.2K) - A Python Package to Tackle the Curse of Imbalanced.. MIT
  • GitHub ( 71 1.2K 15K 540 - 9% open 28.12.2022):

     git clone https://github.com/scikit-learn-contrib/imbalanced-learn
    
  • PyPi ( 3.2M / month 270 16.05.2022):

     pip install imbalanced-learn
    
  • Conda ( 330K 28.12.2022):

     conda install -c conda-forge imbalanced-learn
    
MLxtend (35 4.2K) - A library of extension and helper modules for Pythons data.. BSD-3
  • GitHub ( 92 790 8K 450 - 28% open 31.01.2023):

     git clone https://github.com/rasbt/mlxtend
    
  • PyPi ( 1.3M / month 160 27.05.2022):

     pip install mlxtend
    
  • Conda ( 250K 17.09.2022):

     conda install -c conda-forge mlxtend
    
category_encoders (34 2.2K) - A library of sklearn compatible categorical variable.. BSD-3
  • GitHub ( 60 380 4.5K 260 - 15% open 24.01.2023):

     git clone https://github.com/scikit-learn-contrib/category_encoders
    
  • PyPi ( 1.1M / month 120 14.01.2023):

     pip install category_encoders
    
  • Conda ( 200K 17.01.2023):

     conda install -c conda-forge category_encoders
    
scikit-learn-intelex (32 880) - Intel(R) Extension for Scikit-learn is a seamless way.. Apache-2
  • GitHub ( 65 140 4.5K 200 - 38% open 31.01.2023):

     git clone https://github.com/intel/scikit-learn-intelex
    
  • PyPi ( 120K / month 6 16.06.2022):

     pip install scikit-learn-intelex
    
  • Conda ( 120K 31.01.2023):

     conda install -c conda-forge scikit-learn-intelex
    
scikit-multilearn (27 810 ) - A scikit-learn based module for multi-label et. al... BSD-2
  • GitHub ( 17 150 980 200 - 53% open 09.07.2022):

     git clone https://github.com/scikit-multilearn/scikit-multilearn
    
  • PyPi ( 83K / month 18 10.12.2018):

     pip install scikit-multilearn
    
scikit-lego (24 960) - Extra blocks for scikit-learn pipelines. MIT
  • GitHub ( 56 99 80 260 - 9% open 21.12.2022):

     git clone https://github.com/koaning/scikit-lego
    
  • PyPi ( 20K / month 7 05.06.2022):

     pip install scikit-lego
    
  • Conda ( 31K 03.11.2022):

     conda install -c conda-forge scikit-lego
    
scikit-opt (23 3.8K ) - Genetic Algorithm, Particle Swarm Optimization, Simulated.. MIT
  • GitHub ( 16 840 100 160 - 32% open 15.07.2022):

     git clone https://github.com/guofei9987/scikit-opt
    
  • PyPi ( 1.9K / month 6 14.01.2022):

     pip install scikit-opt
    
combo (21 610) - (AAAI 20) A Python Toolbox for Machine Learning Model Combination. BSD-2 xgboost
  • GitHub ( 2 100 520 14 - 78% open 14.01.2023):

     git clone https://github.com/yzhao062/combo
    
  • PyPi ( 34K / month 4 02.04.2022):

     pip install combo
    
iterative-stratification (20 750 ) - scikit-learn cross validators for iterative.. BSD-3
  • GitHub ( 7 68 270 24 - 16% open 06.06.2022):

     git clone https://github.com/trent-b/iterative-stratification
    
  • PyPi ( 50K / month 10 03.10.2021):

     pip install iterative-stratification
    
DESlib (19 430 ) - A Python library for dynamic classifier and ensemble selection. BSD-3
  • GitHub ( 14 73 36 150 - 10% open 07.06.2022):

     git clone https://github.com/scikit-learn-contrib/DESlib
    
  • PyPi ( 3.4K / month 2 08.02.2021):

     pip install deslib
    
scikit-tda (17 380 ) - Topological Data Analysis for Python. MIT
  • GitHub ( 4 46 41 19 - 78% open 13.03.2022):

     git clone https://github.com/scikit-tda/scikit-tda
    
  • PyPi ( 2.6K / month 03.08.2021):

     pip install scikit-tda
    
Show 8 hidden projects...

Pytorch Utilities

Back to top

Libraries that extend Pytorch with additional capabilities.

accelerate (34 3.6K) - A simple way to train and use PyTorch models with multi-.. Apache-2
  • GitHub ( 85 320 2.6K 500 - 13% open 01.02.2023):

     git clone https://github.com/huggingface/accelerate
    
  • PyPi ( 1.5M / month 38 15.06.2022):

     pip install accelerate
    
  • Conda ( 17K 01.02.2023):

     conda install -c conda-forge accelerate
    
PML (32 5K) - The easiest way to use deep metric learning in your application. Modular,.. MIT
  • GitHub ( 30 600 460 420 - 9% open 30.01.2023):

     git clone https://github.com/KevinMusgrave/pytorch-metric-learning
    
  • PyPi ( 130K / month 12 29.06.2022):

     pip install pytorch-metric-learning
    
  • Conda ( 9.5K 01.11.2022):

     conda install -c metric-learning pytorch-metric-learning
    
torchdiffeq (29 4.4K) - Differentiable ODE solvers with full GPU support and.. MIT
  • GitHub ( 21 800 770 190 - 25% open 29.01.2023):

     git clone https://github.com/rtqichen/torchdiffeq
    
  • PyPi ( 660K / month 24 22.04.2022):

     pip install torchdiffeq
    
  • Conda ( 9.5K 03.06.2021):

     conda install -c conda-forge torchdiffeq
    
lightning-flash (27 1.6K) - Your PyTorch AI Factory - Flash enables you to easily.. Apache-2
  • GitHub ( 82 190 170 510 - 5% open 26.01.2023):

     git clone https://github.com/Lightning-AI/lightning-flash
    
  • PyPi ( 2.6K / month 3 11.05.2022):

     pip install lightning-flash
    
  • Conda ( 9.8K 08.11.2022):

     conda install -c conda-forge lightning-flash
    
torch-scatter (26 1.2K) - PyTorch Extension Library of Optimized Scatter Operations. MIT
  • GitHub ( 25 140 300 - 7% open 11.01.2023):

     git clone https://github.com/rusty1s/pytorch_scatter
    
  • PyPi ( 40K / month 53 22.10.2021):

     pip install torch-scatter
    
  • Conda ( 150K 02.01.2023):

     conda install -c conda-forge pytorch_scatter
    
PyTorch Sparse (26 780) - PyTorch Extension Library of Optimized Autograd Sparse.. MIT
  • GitHub ( 37 120 230 - 14% open 01.02.2023):

     git clone https://github.com/rusty1s/pytorch_sparse
    
  • PyPi ( 26K / month 47 30.06.2022):

     pip install torch-sparse
    
  • Conda ( 160K 23.01.2023):

     conda install -c conda-forge pytorch_sparse
    
tinygrad (25 10K) - You like pytorch? You like micrograd? You love tinygrad!. MIT
  • GitHub ( 77 880 8 150 - 14% open 02.02.2023):

     git clone https://github.com/geohot/tinygrad
    
Pytorch Toolbelt (24 1.3K) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT
  • GitHub ( 7 110 25 - 4% open 25.01.2023):

     git clone https://github.com/BloodAxe/pytorch-toolbelt
    
  • PyPi ( 13K / month 6 27.06.2022):

     pip install pytorch_toolbelt
    
TabNet (23 2K) - PyTorch implementation of TabNet paper :.. MIT
  • GitHub ( 19 400 280 - 15% open 12.12.2022):

     git clone https://github.com/dreamquark-ai/tabnet
    
  • PyPi ( 27K / month 9 02.02.2021):

     pip install pytorch-tabnet
    
  • Conda ( 3K 30.12.2021):

     conda install -c conda-forge pytorch-tabnet
    
reformer-pytorch (19 1.8K ) - Reformer, the efficient Transformer, in Pytorch. MIT
  • GitHub ( 11 240 120 - 11% open 24.06.2022):

     git clone https://github.com/lucidrains/reformer-pytorch
    
  • PyPi ( 1K / month 06.11.2021):

     pip install reformer-pytorch
    
Performer Pytorch (19 910 ) - An implementation of Performer, a linear attention-.. MIT
  • GitHub ( 6 130 76 82 - 47% open 02.02.2022):

     git clone https://github.com/lucidrains/performer-pytorch
    
  • PyPi ( 3.7K / month 4 02.02.2022):

     pip install performer-pytorch
    
Tez (17 1.1K) - Tez is a super-simple and lightweight Trainer for PyTorch. It also.. Apache-2
  • GitHub ( 2 140 37 41 - 56% open 16.09.2022):

     git clone https://github.com/abhishekkrthakur/tez
    
  • PyPi ( 510 / month 2 05.06.2022):

     pip install tez
    
Tensor Sensor (17 710 ) - The goal of this library is to generate more helpful.. MIT
  • GitHub ( 4 37 13 24 - 33% open 07.04.2022):

     git clone https://github.com/parrt/tensor-sensor
    
  • PyPi ( 1.8K / month 11.12.2021):

     pip install tensor-sensor
    
  • Conda ( 1.8K 11.12.2021):

     conda install -c conda-forge tensor-sensor
    
madgrad (16 780 ) - MADGRAD Optimization Method. MIT
  • GitHub ( 2 55 43 9 - 22% open 10.03.2022):

     git clone https://github.com/facebookresearch/madgrad
    
  • PyPi ( 5.8K / month 08.03.2022):

     pip install madgrad
    
Show 18 hidden projects...

Database Clients

Back to top

Libraries for connecting to, operating, and querying databases.

 best-of-python - DB Clients ( 2.8K) - Collection of database clients for python.


Others

Back to top

scipy (49 11K) - Ecosystem of open-source software for mathematics, science, and engineering. BSD-3
  • GitHub ( 1.4K 4.6K 360K 640K 9.1K - 18% open 31.01.2023):

     git clone https://github.com/scipy/scipy
    
  • PyPi ( 48M / month 58K 03.01.2023):

     pip install scipy
    
  • Conda ( 32M 04.01.2023):

     conda install -c conda-forge scipy
    
SymPy (46 10K) - A computer algebra system written in pure Python. BSD-3
  • GitHub ( 1.2K 3.8K 490K 52K 13K - 35% open 02.02.2023):

     git clone https://github.com/sympy/sympy
    
  • PyPi ( 3.4M / month 4.2K 20.03.2022):

     pip install sympy
    
  • Conda ( 2.8M 27.10.2022):

     conda install -c conda-forge sympy
    
Streamlit (40 22K) - Streamlit The fastest way to build data apps in Python. Apache-2
  • GitHub ( 160 2K 580 3K - 19% open 02.02.2023):

     git clone https://github.com/streamlit/streamlit
    
  • PyPi ( 1M / month 440 27.07.2022):

     pip install streamlit
    
carla (37 8.8K) - Open-source simulator for autonomous driving research. MIT
  • GitHub ( 150 2.7K 350 4.3K - 19% open 24.01.2023):

     git clone https://github.com/carla-simulator/carla
    
  • PyPi ( 22K / month 3 17.11.2021):

     pip install carla
    
Gradio (35 12K) - Wrap UIs around any model, share with anyone. Apache-2
  • GitHub ( 130 800 1.6K - 15% open 02.02.2023):

     git clone https://github.com/gradio-app/gradio
    
  • PyPi ( 1.1M / month 27 04.07.2022):

     pip install gradio
    
PaddleHub (35 11K) - Awesome pre-trained models toolkit based on PaddlePaddle... Apache-2
  • GitHub ( 68 1.9K 580 1.1K 1.2K - 42% open 29.12.2022):

     git clone https://github.com/PaddlePaddle/PaddleHub
    
  • PyPi ( 6K / month 6 28.12.2021):

     pip install paddlehub
    
PyOD (35 6.7K) - A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly.. BSD-2
  • GitHub ( 46 1.2K 1.9K 300 - 52% open 16.12.2022):

     git clone https://github.com/yzhao062/pyod
    
  • PyPi ( 590K / month 34 05.07.2022):

     pip install pyod
    
  • Conda ( 54K 17.12.2022):

     conda install -c conda-forge pyod
    
Datasette (34 7K) - An open source multi-tool for exploring and publishing data. Apache-2
  • GitHub ( 73 480 41 860 1.5K - 29% open 28.01.2023):

     git clone https://github.com/simonw/datasette
    
  • PyPi ( 36K / month 170 11.01.2023):

     pip install datasette
    
  • Conda ( 16K 12.01.2023):

     conda install -c conda-forge datasette
    
DeepChem (34 4.1K) - Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry,.. MIT
  • GitHub ( 210 1.4K 150 1.6K - 34% open 02.02.2023):

     git clone https://github.com/deepchem/deepchem
    
  • PyPi ( 10K / month 6 06.07.2022):

     pip install deepchem
    
  • Conda ( 79K 02.12.2022):

     conda install -c conda-forge deepchem
    
Autograd (33 6.2K) - Efficiently computes derivatives of numpy code. MIT
  • GitHub ( 52 840 4.6K 400 - 42% open 29.09.2022):

     git clone https://github.com/HIPS/autograd
    
  • PyPi ( 990K / month 290 08.04.2022):

     pip install autograd
    
  • Conda ( 270K 03.10.2022):

     conda install -c conda-forge autograd
    
agate (33 1.1K) - A Python data analysis library that is optimized for humans instead of.. MIT
  • GitHub ( 51 150 1.4K 640 - 1% open 12.01.2023):

     git clone https://github.com/wireservice/agate
    
  • PyPi ( 2.5M / month 140 15.07.2021):

     pip install agate
    
  • Conda ( 110K 05.01.2023):

     conda install -c conda-forge agate
    
River (32 4.1K) - Online machine learning in Python. BSD-3
  • GitHub ( 94 460 220 400 - 1% open 29.01.2023):

     git clone https://github.com/online-ml/river
    
  • PyPi ( 15K / month 14 06.06.2022):

     pip install river
    
  • Conda ( 26K 31.10.2022):

     conda install -c conda-forge river
    
Pythran (32 1.9K) - Ahead of Time compiler for numeric kernels. BSD-3
  • GitHub ( 68 180 500 800 - 15% open 30.01.2023):

     git clone https://github.com/serge-sans-paille/pythran
    
  • PyPi ( 360K / month 17 14.12.2021):

     pip install pythran
    
  • Conda ( 320K 15.01.2023):

     conda install -c conda-forge pythran
    
tensorly (32 1.4K) - TensorLy: Tensor Learning in Python. BSD-2
  • GitHub ( 59 260 340 230 - 22% open 24.01.2023):

     git clone https://github.com/tensorly/tensorly
    
  • PyPi ( 13K / month 35 14.01.2023):

     pip install tensorly
    
  • Conda ( 360K 15.01.2023):

     conda install -c conda-forge tensorly
    
adapter-transformers (32 1.2K) - Huggingface Transformers + Adapters =. Apache-2 huggingface
  • GitHub ( 1.6K 210 120 260 - 17% open 02.02.2023):

     git clone https://github.com/Adapter-Hub/adapter-transformers
    
  • PyPi ( 27K / month 5 18.05.2022):

     pip install adapter-transformers
    
datalad (32 380) - Keep code, data, containers under control with git and git-annex. MIT
  • GitHub ( 54 100 3.8K - 12% open 30.01.2023):

     git clone https://github.com/datalad/datalad
    
  • PyPi ( 8.4K / month 56 06.07.2022):

     pip install datalad
    
  • Conda ( 300K 17.01.2023):

     conda install -c conda-forge datalad
    
hdbscan (31 2.3K) - A high performance implementation of HDBSCAN clustering. BSD-3
  • GitHub ( 81 430 1.9K 460 - 65% open 10.11.2022):

     git clone https://github.com/scikit-learn-contrib/hdbscan
    
  • PyPi ( 490K / month 160 08.02.2022):

     pip install hdbscan
    
  • Conda ( 1.4M 02.11.2022):

     conda install -c conda-forge hdbscan
    
PennyLane (31 1.7K) - PennyLane is a cross-platform Python library for differentiable.. Apache-2
  • GitHub ( 130 430 63 950 - 28% open 02.02.2023):

     git clone https://github.com/PennyLaneAI/PennyLane
    
  • PyPi ( 18K / month 34 20.06.2022):

     pip install pennylane
    
  • Conda ( 6.9K 01.05.2022):

     conda install -c conda-forge pennylane
    
pyjanitor (30 1.1K) - Clean APIs for data cleaning. Python implementation of R package.. MIT
  • GitHub ( 110 160 280 520 - 20% open 31.01.2023):

     git clone https://github.com/pyjanitor-devs/pyjanitor
    
  • PyPi ( 41K / month 12 03.05.2022):

     pip install pyjanitor
    
  • Conda ( 150K 17.10.2022):

     conda install -c conda-forge pyjanitor
    
pyopencl (30 950) - OpenCL integration for Python, plus shiny features. MIT
  • GitHub ( 94 230 910 320 - 20% open 15.01.2023):

     git clone https://github.com/inducer/pyopencl
    
  • PyPi ( 32K / month 190 22.06.2022):

     pip install pyopencl
    
  • Conda ( 790K 05.01.2023):

     conda install -c conda-forge pyopencl
    
kmodes (29 1.1K) - Python implementations of the k-modes and k-prototypes clustering.. MIT
  • GitHub ( 21 390 1.7K 150 - 11% open 10.12.2022):

     git clone https://github.com/nicodv/kmodes
    
  • PyPi ( 400K / month 26 14.04.2022):

     pip install kmodes
    
  • Conda ( 24K 06.09.2022):

     conda install -c conda-forge kmodes
    
pysc2 (28 7.7K) - StarCraft II Learning Environment. Apache-2
  • GitHub ( 36 1.2K 30K 480 280 - 17% open 07.08.2022):

     git clone https://github.com/deepmind/pysc2
    
  • PyPi ( 2.2K / month 26 27.09.2019):

     pip install pysc2
    
alibi-detect (28 1.7K) - Algorithms for outlier, adversarial and drift detection. Apache-2
  • GitHub ( 17 170 150 320 - 35% open 01.02.2023):

     git clone https://github.com/SeldonIO/alibi-detect
    
  • PyPi ( 22K / month 6 01.06.2022):

     pip install alibi-detect
    
TabPy (28 1.4K) - Execute Python code on the fly and display results in Tableau visualizations:. MIT
  • GitHub ( 47 530 110 310 - 3% open 31.01.2023):

     git clone https://github.com/tableau/TabPy
    
  • PyPi ( 17K / month 2 20.01.2022):

     pip install tabpy
    
  • Conda ( 3.5K 02.05.2022):

     conda install -c anaconda tabpy-client
    
pycm (28 1.3K) - Multi-class confusion matrix library in Python. MIT
  • GitHub ( 17 120 190 190 - 5% open 01.02.2023):

     git clone https://github.com/sepandhaghighi/pycm
    
  • PyPi ( 35K / month 13 27.04.2022):

     pip install pycm
    
avalanche (28 1.2K) - Avalanche: an End-to-End Library for Continual Learning based on.. MIT
  • GitHub ( 60 210 4 18 630 - 12% open 02.02.2023):

     git clone https://github.com/ContinualAI/avalanche
    
  • PyPi ( 990 / month 14.06.2022):

     pip install avalanche-lib
    
causalml (27 3.8K) - Uplift modeling and causal inference with machine learning.. Apache-2
  • GitHub ( 48 590 61 310 - 25% open 12.12.2022):

     git clone https://github.com/uber/causalml
    
  • PyPi ( 45K / month 1 14.03.2022):

     pip install causalml
    
Mars (27 2.5K) - Mars is a tensor-based unified framework for large-scale data.. Apache-2
  • GitHub ( 45 310 1.2K - 18% open 17.01.2023):

     git clone https://github.com/mars-project/mars
    
  • PyPi ( 41K / month 1 12.06.2022):

     pip install pymars
    
anomalib (27 1.6K) - An anomaly detection library comprising state-of-the-art algorithms.. Apache-2
  • GitHub ( 30 270 29 6 390 - 9% open 01.02.2023):

     git clone https://github.com/openvinotoolkit/anomalib
    
  • PyPi ( 2.3K / month 05.07.2022):

     pip install anomalib
    
metric-learn (26 1.3K ) - Metric learning algorithms in Python. MIT
  • GitHub ( 22 220 250 170 - 30% open 21.06.2022):

     git clone https://github.com/scikit-learn-contrib/metric-learn
    
  • PyPi ( 33K / month 11 02.07.2020):

     pip install metric-learn
    
  • Conda ( 8.3K 02.07.2020):

     conda install -c conda-forge metric-learn
    
PySwarms (26 1K ) - A research toolkit for particle swarm optimization in Python. MIT
  • GitHub ( 44 310 220 210 - 7% open 03.07.2022):

     git clone https://github.com/ljvmiranda921/pyswarms
    
  • PyPi ( 6.9K / month 11 03.01.2021):

     pip install pyswarms
    
Trax (25 7.3K) - Trax Deep Learning with Clear Code and Speed. Apache-2
  • GitHub ( 78 760 96 230 - 45% open 19.12.2022):

     git clone https://github.com/google/trax
    
  • PyPi ( 6.3K / month 26.10.2021):

     pip install trax
    
gplearn (25 1.3K) - Genetic Programming in Python, with a scikit-learn inspired API. BSD-3
  • GitHub ( 11 220 330 200 - 8% open 04.08.2022):

     git clone https://github.com/trevorstephens/gplearn
    
  • PyPi ( 6.4K / month 11 03.05.2022):

     pip install gplearn
    
  • Conda ( 4.3K 04.05.2022):

     conda install -c conda-forge gplearn
    
Prince (25 930) - Python factor analysis library (PCA, CA, MCA, MFA, FAMD). MIT
  • GitHub ( 14 160 290 110 - 37% open 07.09.2022):

     git clone https://github.com/MaxHalford/prince
    
  • PyPi ( 28K / month 5 06.10.2020):

     pip install prince
    
  • Conda ( 14K 30.04.2021):

     conda install -c conda-forge prince-factor-analysis
    
AugLy (24 4.7K) - A data augmentations library for audio, image, text, and video. MIT
  • GitHub ( 26 270 51 71 - 23% open 05.12.2022):

     git clone https://github.com/facebookresearch/AugLy
    
  • PyPi ( 1.7K / month 3 28.03.2022):

     pip install augly
    
MONAILabel (24 360) - MONAI Label is an intelligent open source image labeling and.. Apache-2
  • GitHub ( 38 120 37K 370 - 10% open 27.01.2023):

     git clone https://github.com/Project-MONAI/MONAILabel
    
  • PyPi ( 780 / month 03.07.2022):

     pip install monailabel-weekly
    
Feature Engine (23 1.2K) - Feature engineering package with sklearn like functionality. BSD-3
  • GitHub ( 38 250 07.12.2022):

     git clone https://github.com/solegalli/feature_engine
    
  • PyPi ( 130K / month 24.10.2022):

     pip install feature_engine
    
  • Conda ( 25K 23.11.2022):

     conda install -c conda-forge feature_engine
    
AstroML (23 880) - Machine learning, statistics, and data mining for astronomy and.. BSD-2
  • GitHub ( 30 280 160 - 39% open 12.12.2022):

     git clone https://github.com/astroML/astroML
    
  • PyPi ( 1.8K / month 33 01.03.2022):

     pip install astroML
    
  • Conda ( 35K 02.03.2022):

     conda install -c conda-forge astroml
    
findspark (23 460 ) - Find pyspark to make it importable. BSD-3
  • GitHub ( 15 72 3.1K 22 - 50% open 11.02.2022):

     git clone https://github.com/minrk/findspark
    
  • PyPi ( 2.1M / month 140 11.02.2022):

     pip install findspark
    
  • Conda ( 770K 11.02.2022):

     conda install -c conda-forge findspark
    
BioPandas (22 570) - Working with molecular structures in pandas DataFrames. BSD-3
  • GitHub ( 9 100 150 50 - 40% open 29.01.2023):

     git clone https://github.com/rasbt/biopandas
    
  • PyPi ( 5K / month 16 13.05.2022):

     pip install biopandas
    
  • Conda ( 130K 13.05.2022):

     conda install -c conda-forge biopandas
    
dstack (22 480) - An open-source ML workflow orchestration system designed for.. MPL-2.0
  • GitHub ( 7 26 10 140 - 25% open 02.02.2023):

     git clone https://github.com/dstackai/dstack
    
  • PyPi ( 190 / month 29.06.2022):

     pip install dstack
    
StreamAlert (21 2.8K ) - StreamAlert is a serverless, realtime data analysis.. Apache-2
  • GitHub ( 33 320 340 - 24% open 20.07.2022):

     git clone https://github.com/airbnb/streamalert
    
benchmark_VAE (20 1.2K) - Unifying Variational Autoencoder (VAE) implementations.. Apache-2
  • GitHub ( 7 98 3 31 - 19% open 05.01.2023):

     git clone https://github.com/clementchadebec/benchmark_VAE
    
  • PyPi ( 220 / month 05.07.2022):

     pip install pythae
    
SUOD (19 340 ) - (MLSys 21) An Acceleration System for Large-scare Unsupervised.. BSD-2
  • GitHub ( 2 43 460 10 - 70% open 07.07.2022):

     git clone https://github.com/yzhao062/SUOD
    
  • PyPi ( 26K / month 01.10.2021):

     pip install suod
    
pykale (17 370) - Knowledge-Aware machine LEarning (KALE): accessible machine learning.. MIT
  • GitHub ( 18 51 91 - 4% open 19.01.2023):

     git clone https://github.com/pykale/pykale
    
  • PyPi ( 26 / month 12.04.2022):

     pip install pykale
    
KD-Lib (15 460) - A Pytorch Knowledge Distillation library for benchmarking and.. MIT
  • GitHub ( 6 43 64 - 23% open 27.12.2022):

     git clone https://github.com/SforAiDl/KD_Lib
    
  • PyPi ( 31 / month 18.05.2022):

     pip install KD-Lib
    
NeuralCompression (15 300) - A collection of tools for neural compression enthusiasts. MIT
  • GitHub ( 5 26 77 - 23% open 20.12.2022):

     git clone https://github.com/facebookresearch/NeuralCompression
    
  • PyPi ( 130 / month 12.01.2022):

     pip install neuralcompression
    
traingenerator (13 1.2K ) - A web app to generate template code for machine learning. MIT
  • GitHub ( 3 170 16 - 81% open 30.06.2022):

     git clone https://github.com/jrieke/traingenerator
    
Show 17 hidden projects...
  • cleanlab (30 5K) - The standard data-centric AI package for data quality and machine.. AGPL-3.0
  • Cython BLIS (29 200) - Fast matrix-multiplication as a self-contained Python library no.. BSD-3
  • pyclustering (26 1K ) - pyclustering is a Python, C++ data mining library. BSD-3
  • modAL (25 1.9K ) - A modular active learning framework for Python. MIT
  • minisom (25 1.2K) - MiniSom is a minimalistic implementation of the Self Organizing.. CC-BY-3.0
  • mlens (22 770 ) - ML-Ensemble high performance ensemble learning. MIT
  • metricflow (22 740) - MetricFlow allows you to define, build, and maintain metrics in.. AGPL-3.0
  • vecstack (21 670 ) - Python package for stacking (machine learning technique). MIT
  • rrcf (21 420 ) - Implementation of the Robust Random Cut Forest algorithm for anomaly.. MIT
  • opyrator (20 2.8K ) - Turns your machine learning code into microservices with web API,.. MIT
  • impyute (20 330 ) - Data imputations library to preprocess datasets with missing data. MIT
  • scikit-rebate (19 380 ) - A scikit-learn-compatible Python implementation of.. MIT
  • pandas-ml (19 300 ) - pandas, scikit-learn, xgboost and seaborn integration. BSD-3
  • pymdp (19 240) - A Python implementation of active inference for Markov Decision Processes. MIT
  • baikal (18 590 ) - A graph-based functional API for building complex scikit-learn.. BSD-3
  • apricot (17 460 ) - apricot implements submodular optimization for the purpose of.. MIT
  • nylon (12 80 ) - An intelligent, flexible grammar of machine learning. MIT

Related Resources

Contribution

Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:

  • Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
  • Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI.

If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.

For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.

License

CC0

Alternatives To Best Of Ml Python
Select To Compare


Alternative Project Comparisons
Related Awesome Lists
Top Programming Languages

Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
Python (892,779
Learning (75,662
Github (45,477
Machine Learning (40,996
Deep Learning (39,386
Tensorflow (22,906
Pytorch (22,682
Natural Language Processing (15,920
Awesome List (13,901
Data Science (11,466
Keras (10,613
Data Visualization (6,299
Conda (5,942
Data Analysis (5,408
Python Library (4,520
Scikit Learn (2,546
Automl (834
Jax (346
Best Of (15