Awesome Open Source
Awesome Open Source

Best-of Machine Learning with Python

  A ranked list of awesome machine learning Python libraries. Updated weekly.

This curated list contains 890 awesome open-source projects with a total of 3.2M stars grouped into 33 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!


  Discover other best-of lists or create your own.
  Subscribe to our newsletter for updates and trending projects.


Contents

Explanation

  •   Combined project-quality score
  •   Star count from GitHub
  •   New project (less than 6 months old)
  •   Inactive project (6 months no activity)
  •   Dead project (12 months no activity)
  •   Project is trending up or down
  •   Project was recently added
  •   Warning (e.g. missing/risky license)
  •   Contributors count from GitHub
  •   Fork count from GitHub
  •   Issue count from GitHub
  •   Last update timestamp on package manager
  •   Download count from package manager
  •   Number of dependent projects
  •   Tensorflow related project
  •   Sklearn related project
  •   PyTorch related project
  •   MxNet related project
  •   Apache Spark related project
  •   Jupyter related project
  •   PaddlePaddle related project
  •   Pandas related project

Machine Learning Frameworks

Back to top

General-purpose machine learning and deep learning frameworks.

Tensorflow (55 160K) - An Open Source Machine Learning Framework for Everyone. Apache-2
  • GitHub ( 3.9K 86K 180K 34K - 7% open 13.01.2022):

     git clone https://github.com/tensorflow/tensorflow
    
  • PyPi ( 13M / month 14K 22.12.2021):

     pip install tensorflow
    
  • Conda ( 3M 08.12.2021):

     conda install -c conda-forge tensorflow
    
  • Docker Hub ( 64M 2K 13.01.2022):

     docker pull tensorflow/tensorflow
    
PyTorch (49 53K) - Tensors and Dynamic neural networks in Python with strong GPU.. BSD-3
  • GitHub ( 3.1K 15K 1.3K 28K - 40% open 13.01.2022):

     git clone https://github.com/pytorch/pytorch
    
  • PyPi ( 5.7M / month 6.6K 15.12.2021):

     pip install torch
    
  • Conda ( 15M 15.12.2021):

     conda install -c pytorch pytorch
    
scikit-learn (49 49K) - scikit-learn: machine learning in Python. BSD-3
  • GitHub ( 2.4K 22K 760 300K 9.9K - 25% open 13.01.2022):

     git clone https://github.com/scikit-learn/scikit-learn
    
  • PyPi ( 23M / month 24K 25.12.2021):

     pip install scikit-learn
    
  • Conda ( 11M 26.12.2021):

     conda install -c conda-forge scikit-learn
    
Keras (44 54K) - Deep Learning for humans. Apache-2
  • GitHub ( 1K 19K 11K - 2% open 12.01.2022):

     git clone https://github.com/keras-team/keras
    
  • PyPi ( 6.9M / month 210 21.12.2021):

     pip install keras
    
  • Conda ( 2M 25.11.2021):

     conda install -c conda-forge keras
    
XGBoost (44 22K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
  • GitHub ( 540 8.2K 3.6K 26K 4.3K - 7% open 13.01.2022):

     git clone https://github.com/dmlc/xgboost
    
  • PyPi ( 8M / month 1.3K 24.11.2021):

     pip install xgboost
    
  • Conda ( 2.2M 20.11.2021):

     conda install -c conda-forge xgboost
    
StatsModels (44 7K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
  • GitHub ( 350 2.4K 26 56K 4.7K - 47% open 13.01.2022):

     git clone https://github.com/statsmodels/statsmodels
    
  • PyPi ( 6.8M / month 4.4K 15.12.2021):

     pip install statsmodels
    
  • Conda ( 5.5M 13.11.2021):

     conda install -c conda-forge statsmodels
    
PySpark (42 32K) - Apache Spark Python API. Apache-2
  • GitHub ( 2.6K 25K 13.01.2022):

     git clone https://github.com/apache/spark
    
  • PyPi ( 15M / month 750 18.10.2021):

     pip install pyspark
    
  • Conda ( 1.4M 18.10.2021):

     conda install -c conda-forge pyspark
    
pytorch-lightning (42 17K) - The lightweight PyTorch wrapper for high-performance.. Apache-2
  • GitHub ( 600 2.1K 5K 6.4K 4.5K - 10% open 13.01.2022):

     git clone https://github.com/PyTorchLightning/pytorch-lightning
    
  • PyPi ( 850K / month 290 05.01.2022):

     pip install pytorch-lightning
    
  • Conda ( 390K 05.01.2022):

     conda install -c conda-forge pytorch-lightning
    
LightGBM (42 13K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
  • GitHub ( 250 3.5K 130K 11K 2.5K - 6% open 09.01.2022):

     git clone https://github.com/microsoft/LightGBM
    
  • PyPi ( 10M / month 570 07.01.2022):

     pip install lightgbm
    
  • Conda ( 850K 08.01.2022):

     conda install -c conda-forge lightgbm
    
PaddlePaddle (41 17K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
  • GitHub ( 690 4.2K 15K 88 15K - 18% open 13.01.2022):

     git clone https://github.com/PaddlePaddle/Paddle
    
  • PyPi ( 97K / month 41 02.12.2021):

     pip install paddlepaddle
    
jax (41 16K) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2
  • GitHub ( 350 1.4K 3.2K 2.9K - 31% open 13.01.2022):

     git clone https://github.com/google/jax
    
  • PyPi ( 300K / month 240 08.12.2021):

     pip install jax
    
  • Conda ( 260K 10.12.2021):

     conda install -c conda-forge jaxlib
    
MXNet (40 20K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning.. Apache-2
  • GitHub ( 970 6.9K 24K 9.6K - 20% open 13.01.2022):

     git clone https://github.com/apache/incubator-mxnet
    
  • PyPi ( 250K / month 280 18.12.2021):

     pip install mxnet
    
  • Conda ( 6.9K 5 29.02.2020):

     conda install -c anaconda mxnet
    
Catboost (40 6.3K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
  • GitHub ( 940 950 71K 1.7K - 20% open 13.01.2022):

     git clone https://github.com/catboost/catboost
    
  • PyPi ( 3.1M / month 190 04.11.2021):

     pip install catboost
    
  • Conda ( 900K 09.11.2021):

     conda install -c conda-forge catboost
    
Fastai (37 22K) - The fastai deep learning library. Apache-2
  • GitHub ( 600 7.1K 1.5K - 6% open 29.11.2021):

     git clone https://github.com/fastai/fastai
    
  • PyPi ( 190K / month 290 23.10.2021):

     pip install fastai
    
Jina (37 13K) - Cloud-native neural search framework for kind of data. Apache-2
  • GitHub ( 140 1.8K 210 1.3K - 6% open 13.01.2022):

     git clone https://github.com/jina-ai/jina
    
  • PyPi ( 13K / month 11.01.2022):

     pip install jina
    
  • Docker Hub ( 1M 6 11.01.2022):

     docker pull jinaai/jina
    
Theano (37 9.5K) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
  • GitHub ( 380 2.5K 12K 2.8K - 24% open 23.11.2021):

     git clone https://github.com/Theano/Theano
    
  • PyPi ( 220K / month 2.8K 27.07.2020):

     pip install theano
    
  • Conda ( 1.8M 10.11.2021):

     conda install -c conda-forge theano
    
Chainer (36 5.7K) - A flexible framework of neural networks for deep learning. MIT
  • GitHub ( 320 1.4K 2.5K 2K - 0% open 05.01.2022):

     git clone https://github.com/chainer/chainer
    
  • PyPi ( 18K / month 400 05.01.2022):

     pip install chainer
    
Thinc (36 2.4K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
  • GitHub ( 43 230 18K 120 - 17% open 21.12.2021):

     git clone https://github.com/explosion/thinc
    
  • PyPi ( 3.7M / month 610 17.12.2021):

     pip install thinc
    
  • Conda ( 1.8M 08.12.2021):

     conda install -c conda-forge thinc
    
PyFlink (34 18K) - Apache Flink Python API. Apache-2
  • GitHub ( 1.4K 9.9K 13.01.2022):

     git clone https://github.com/apache/flink
    
  • PyPi ( 7.4K / month 9 16.12.2021):

     pip install apache-flink
    
Vowpal Wabbit (33 7.8K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
  • GitHub ( 310 1.8K 1.2K - 15% open 12.01.2022):

     git clone https://github.com/VowpalWabbit/vowpal_wabbit
    
  • PyPi ( 48K / month 25 14.07.2021):

     pip install vowpalwabbit
    
Ignite (33 3.8K) - High-level library to help with training and evaluating neural.. BSD-3
  • GitHub ( 160 510 1K - 13% open 10.01.2022):

     git clone https://github.com/pytorch/ignite
    
  • PyPi ( 69K / month 76 13.01.2022):

     pip install pytorch-ignite
    
  • Conda ( 77K 19.10.2021):

     conda install -c pytorch ignite
    
Flax (33 2.5K) - Flax is a neural network library for JAX that is designed for.. Apache-2 jax
  • GitHub ( 120 270 31 610 450 - 36% open 13.01.2022):

     git clone https://github.com/google/flax
    
  • PyPi ( 160K / month 50 27.10.2021):

     pip install flax
    
tensorflow-upstream (33 580) - TensorFlow ROCm port. Apache-2
  • GitHub ( 3.9K 66 17 320 - 17% open 13.01.2022):

     git clone https://github.com/ROCmSoftwarePlatform/tensorflow-upstream
    
  • PyPi ( 1.6K / month 5 17.12.2021):

     pip install tensorflow-rocm
    
Turi Create (32 11K) - Turi Create simplifies the development of custom machine learning.. BSD-3
  • GitHub ( 83 1.1K 5K 280 1.8K - 27% open 29.11.2021):

     git clone https://github.com/apple/turicreate
    
  • PyPi ( 22K / month 19 30.09.2020):

     pip install turicreate
    
tensorpack (32 6.1K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2
  • GitHub ( 58 1.8K 130 920 1.3K - 0% open 27.11.2021):

     git clone https://github.com/tensorpack/tensorpack
    
  • PyPi ( 21K / month 46 22.01.2021):

     pip install tensorpack
    
mlpack (32 3.9K) - mlpack: a scalable C++ machine learning library --. BSD-3
  • GitHub ( 280 1.4K 1.4K - 7% open 12.01.2022):

     git clone https://github.com/mlpack/mlpack
    
  • PyPi ( 210 / month 1 28.10.2020):

     pip install mlpack
    
  • Conda ( 97K 09.11.2021):

     conda install -c conda-forge mlpack
    
Sonnet (31 9.2K) - TensorFlow-based neural network library. Apache-2
  • GitHub ( 53 1.3K 750 170 - 13% open 15.12.2021):

     git clone https://github.com/deepmind/sonnet
    
  • PyPi ( 120K / month 52 27.03.2020):

     pip install dm-sonnet
    
  • Conda ( 13K 14.11.2020):

     conda install -c conda-forge sonnet
    
dyNET (31 3.3K ) - DyNet: The Dynamic Neural Network Toolkit. Apache-2
  • GitHub ( 160 700 4.4K 200 920 - 28% open 27.01.2021):

     git clone https://github.com/clab/dynet
    
  • PyPi ( 15K / month 28 21.10.2020):

     pip install dyNET
    
einops (30 4.1K) - Deep learning operations reinvented (for pytorch, tensorflow, jax and.. MIT
  • GitHub ( 14 160 1.8K 91 - 32% open 10.01.2022):

     git clone https://github.com/arogozhnikov/einops
    
  • PyPi ( 580K / month 160 31.08.2021):

     pip install einops
    
  • Conda ( 9.7K 31.08.2021):

     conda install -c conda-forge einops
    
Neural Network Libraries (30 2.5K) - Neural Network Libraries. Apache-2
  • GitHub ( 64 310 530 76 - 42% open 11.01.2022):

     git clone https://github.com/sony/nnabla
    
  • PyPi ( 3.9K / month 51 23.12.2021):

     pip install nnabla
    
ktrain (30 940) - ktrain is a Python library that makes deep learning and AI more.. Apache-2
  • GitHub ( 12 220 250 380 - 1% open 23.11.2021):

     git clone https://github.com/amaiya/ktrain
    
  • PyPi ( 31K / month 2 05.11.2021):

     pip install ktrain
    
Ludwig (29 8K) - Data-centric declarative deep learning framework. Apache-2
  • GitHub ( 110 950 99 640 - 24% open 12.01.2022):

     git clone https://github.com/ludwig-ai/ludwig
    
  • PyPi ( 2.9K / month 8 15.06.2021):

     pip install ludwig
    
skorch (29 4.3K) - A scikit-learn compatible neural network library that wraps.. BSD-3
  • GitHub ( 47 300 420 420 - 12% open 30.12.2021):

     git clone https://github.com/skorch-dev/skorch
    
  • PyPi ( 23K / month 33 31.10.2021):

     pip install skorch
    
  • Conda ( 500K 30.11.2021):

     conda install -c conda-forge skorch
    
mace (25 4.6K) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
  • GitHub ( 63 780 1.4K 650 - 6% open 13.01.2022):

     git clone https://github.com/XiaoMi/mace
    
Neural Tangents (25 1.7K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
  • GitHub ( 21 190 190 27 100 - 30% open 14.12.2021):

     git clone https://github.com/google/neural-tangents
    
  • PyPi ( 430 / month 1 17.11.2021):

     pip install neural-tangents
    
Haiku (25 1.7K) - JAX-based neural network library. Apache-2
  • GitHub ( 56 120 290 130 - 25% open 12.01.2022):

     git clone https://github.com/deepmind/dm-haiku
    
fklearn (25 1.4K) - fklearn: Functional Machine Learning. Apache-2
  • GitHub ( 41 150 11 41 - 48% open 30.12.2021):

     git clone https://github.com/nubank/fklearn
    
  • PyPi ( 3K / month 30.12.2021):

     pip install fklearn
    
Objax (22 670 ) - Objax is a machine learning framework that provides an Object.. Apache-2 jax
  • GitHub ( 22 59 17 100 - 44% open 12.01.2022):

     git clone https://github.com/google/objax
    
  • PyPi ( 260 / month 2 10.01.2022):

     pip install objax
    
ThunderSVM (21 1.4K ) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
  • GitHub ( 33 180 2.3K 200 - 28% open 10.02.2021):

     git clone https://github.com/Xtra-Computing/thundersvm
    
  • PyPi ( 540 / month 13.03.2020):

     pip install thundersvm
    
Torchbearer (21 620 ) - torchbearer: A model fitting library for PyTorch. MIT
  • GitHub ( 13 68 56 240 - 3% open 26.03.2021):

     git clone https://github.com/pytorchbearer/torchbearer
    
  • PyPi ( 660 / month 4 31.01.2020):

     pip install torchbearer
    
elegy (19 310) - A High Level API for Deep Learning in JAX. MIT jax
  • GitHub ( 14 21 85 - 25% open 14.12.2021):

     git clone https://github.com/poets-ai/elegy
    
  • PyPi ( 380 / month 14.12.2021):

     pip install elegy
    
NeoML (18 660) - Machine learning framework for both deep learning and traditional.. Apache-2
  • GitHub ( 28 100 70 - 40% open 12.01.2022):

     git clone https://github.com/neoml-lib/neoml
    
ThunderGBM (17 620 ) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
  • GitHub ( 10 81 68 - 47% open 05.01.2021):

     git clone https://github.com/Xtra-Computing/thundergbm
    
  • PyPi ( 110 / month 01.05.2020):

     pip install thundergbm
    
Show 12 hidden projects...
  • dlib (38 11K) - A toolkit for making real world machine learning and data analysis.. BSL-1.0
  • TFlearn (32 9.6K ) - Deep learning library featuring a higher-level API for TensorFlow. MIT
  • CNTK (31 17K ) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit. MIT
  • MindsDB (29 4.3K) - Predictive AI layer for existing databases. GPL-3.0
  • Lasagne (29 3.8K ) - Lightweight library to build and train neural networks in Theano. MIT
  • NuPIC (28 6.3K ) - Numenta Platform for Intelligent Computing is an implementation.. AGPL-3.0
  • SHOGUN (26 2.9K ) - Unified and efficient Machine Learning. BSD-3
  • xLearn (25 3K ) - High performance, easy-to-use, and scalable machine learning (ML).. Apache-2
  • NeuPy (25 700 ) - NeuPy is a Tensorflow based python library for prototyping and building.. MIT
  • neon (22 3.9K ) - Intel Nervana reference deep learning framework committed to best.. Apache-2
  • chefboost (20 300) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT
  • StarSpace (16 3.7K ) - Learning embeddings for classification, retrieval and ranking. MIT

Data Visualization

Back to top

General-purpose and task-specific data visualization libraries.

Matplotlib (49 15K) - matplotlib: plotting with Python. Python-2.0
  • GitHub ( 1.3K 6.2K 490K 8.5K - 20% open 13.01.2022):

     git clone https://github.com/matplotlib/matplotlib
    
  • PyPi ( 24M / month 52K 11.12.2021):

     pip install matplotlib
    
  • Conda ( 11M 13.12.2021):

     conda install -c conda-forge matplotlib
    
Bokeh (43 16K) - Interactive Data Visualization in the browser, from Python. BSD-3
  • GitHub ( 590 3.9K 43K 6.8K - 10% open 12.01.2022):

     git clone https://github.com/bokeh/bokeh
    
  • PyPi ( 1.9M / month 3.5K 06.01.2022):

     pip install bokeh
    
  • Conda ( 6.3M 22.11.2021):

     conda install -c conda-forge bokeh
    
Seaborn (42 9.1K) - Statistical data visualization in Python. BSD-3
  • GitHub ( 160 1.5K 210 130K 2K - 4% open 02.01.2022):

     git clone https://github.com/mwaskom/seaborn
    
  • PyPi ( 5.2M / month 8.8K 16.08.2021):

     pip install seaborn
    
  • Conda ( 3.2M 16.08.2021):

     conda install -c conda-forge seaborn
    
Plotly (41 11K) - The interactive graphing library for Python (includes Plotly Express). MIT
  • GitHub ( 190 2.1K 9 2.2K - 47% open 21.12.2021):

     git clone https://github.com/plotly/plotly.py
    
  • PyPi ( 6.3M / month 3.8K 21.12.2021):

     pip install plotly
    
  • Conda ( 2.2M 21.12.2021):

     conda install -c conda-forge plotly
    
  • npm ( 44K / month 4 12.01.2021):

     npm install plotlywidget
    
Altair (39 7.2K) - Declarative statistical visualization library for Python. BSD-3
  • GitHub ( 130 630 21K 1.6K - 15% open 29.12.2021):

     git clone https://github.com/altair-viz/altair
    
  • PyPi ( 3.7M / month 340 29.12.2021):

     pip install altair
    
  • Conda ( 1.1M 29.12.2021):

     conda install -c conda-forge altair
    
dash (38 16K) - Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required. MIT
  • GitHub ( 100 1.6K 170 1.2K - 47% open 12.01.2022):

     git clone https://github.com/plotly/dash
    
  • PyPi ( 1.1M / month 1.1K 04.09.2021):

     pip install dash
    
  • Conda ( 340K 21.09.2021):

     conda install -c conda-forge dash
    
Graphviz (37 1.1K) - Simple Python interface for Graphviz. MIT
  • GitHub ( 17 170 27K 120 - 3% open 01.01.2022):

     git clone https://github.com/xflr6/graphviz
    
  • PyPi ( 7.8M / month 2.9K 12.12.2021):

     pip install graphviz
    
pyecharts (36 12K) - Python Echarts Plotting Library. MIT
  • GitHub ( 30 2.6K 2K 1.5K - 1% open 16.11.2021):

     git clone https://github.com/pyecharts/pyecharts
    
  • PyPi ( 66K / month 210 16.11.2021):

     pip install pyecharts
    
pandas-profiling (36 8.4K) - Create HTML profiling reports from pandas DataFrame.. MIT
  • GitHub ( 83 1.2K 6.4K 530 - 18% open 08.01.2022):

     git clone https://github.com/pandas-profiling/pandas-profiling
    
  • PyPi ( 960K / month 140 27.09.2021):

     pip install pandas-profiling
    
  • Conda ( 180K 28.09.2021):

     conda install -c conda-forge pandas-profiling
    
UMAP (36 5.3K) - Uniform Manifold Approximation and Projection. BSD-3
  • GitHub ( 97 600 4.4K 590 - 51% open 04.01.2022):

     git clone https://github.com/lmcinnes/umap
    
  • PyPi ( 1.4M / month 290 29.10.2021):

     pip install umap-learn
    
PyQtGraph (34 2.7K) - Fast data visualization and GUI tools for scientific / engineering.. MIT
  • GitHub ( 220 900 990 - 31% open 12.01.2022):

     git clone https://github.com/pyqtgraph/pyqtgraph
    
  • PyPi ( 56K / month 760 11.10.2021):

     pip install pyqtgraph
    
  • Conda ( 210K 11.10.2021):

     conda install -c conda-forge pyqtgraph
    
HoloViews (34 2.1K) - With Holoviews, your data visualizes itself. BSD-3
  • GitHub ( 120 340 2.7K - 29% open 18.12.2021):

     git clone https://github.com/holoviz/holoviews
    
  • PyPi ( 220K / month 190 16.12.2021):

     pip install holoviews
    
  • Conda ( 630K 13.01.2022):

     conda install -c conda-forge holoviews
    
  • npm ( 1.7K / month 24.05.2020):

     npm install @pyviz/jupyterlab_pyviz
    
wordcloud (33 8.6K) - A little word cloud generator in Python. MIT
  • GitHub ( 64 2.1K 470 - 23% open 13.11.2021):

     git clone https://github.com/amueller/word_cloud
    
  • PyPi ( 500K / month 710 11.11.2020):

     pip install wordcloud
    
  • Conda ( 250K 15.11.2021):

     conda install -c conda-forge wordcloud
    
Perspective (32 4.1K) - A data visualization and analytics component, especially.. Apache-2
  • GitHub ( 65 430 220 490 - 14% open 06.01.2022):

     git clone https://github.com/finos/perspective
    
  • PyPi ( 3K / month 9 06.01.2022):

     pip install perspective-python
    
  • npm ( 1.7K / month 05.01.2022):

     npm install @finos/perspective-jupyterlab
    
VisPy (32 2.8K) - High-performance interactive 2D/3D data visualization library. BSD-3
  • GitHub ( 170 580 660 1.3K - 22% open 10.12.2021):

     git clone https://github.com/vispy/vispy
    
  • PyPi ( 35K / month 93 24.11.2021):

     pip install vispy
    
  • Conda ( 200K 24.11.2021):

     conda install -c conda-forge vispy
    
  • npm ( 13 / month 15.03.2020):

     npm install vispy
    
bqplot (31 3.2K) - Plotting library for IPython/Jupyter notebooks. Apache-2
  • GitHub ( 55 460 29 580 - 38% open 07.01.2022):

     git clone https://github.com/bqplot/bqplot
    
  • PyPi ( 47K / month 90 07.01.2022):

     pip install bqplot
    
  • Conda ( 910K 07.01.2022):

     conda install -c conda-forge bqplot
    
  • npm ( 17K / month 10 07.01.2022):

     npm install bqplot
    
PyVista (31 1.1K) - 3D plotting and mesh analysis through a streamlined interface for.. MIT
  • GitHub ( 68 210 390 560 660 - 26% open 11.01.2022):

     git clone https://github.com/pyvista/pyvista
    
  • PyPi ( 32K / month 85 11.01.2022):

     pip install pyvista
    
  • Conda ( 140K 11.01.2022):

     conda install -c conda-forge pyvista
    
FiftyOne (31 930) - Visualize, create, and debug image and video datasets.. Apache-2
  • GitHub ( 22 110 71 650 - 30% open 13.01.2022):

     git clone https://github.com/voxel51/fiftyone
    
  • PyPi ( 18K / month 1 13.01.2022):

     pip install fiftyone
    
datashader (30 2.7K) - Quickly and accurately render even the largest data. BSD-3
  • GitHub ( 45 330 930 480 - 26% open 25.12.2021):

     git clone https://github.com/holoviz/datashader
    
  • PyPi ( 46K / month 74 09.06.2021):

     pip install datashader
    
  • Conda ( 250K 10.06.2021):

     conda install -c conda-forge datashader
    
Cufflinks (30 2.5K ) - Productivity Tools for Plotly + Pandas. MIT
  • GitHub ( 38 580 5K 220 - 42% open 25.02.2021):

     git clone https://github.com/santosjorge/cufflinks
    
  • PyPi ( 230K / month 160 15.12.2021):

     pip install cufflinks
    
data-validation (30 600) - Library for exploring and validating machine learning.. Apache-2
  • GitHub ( 23 110 290 370 150 - 25% open 07.01.2022):

     git clone https://github.com/tensorflow/data-validation
    
  • PyPi ( 6.7M / month 24 01.12.2021):

     pip install tensorflow-data-validation
    
missingno (29 3K) - Missing data visualization module for Python. MIT
  • GitHub ( 17 380 5.9K 110 - 9% open 04.07.2021):

     git clone https://github.com/ResidentMario/missingno
    
  • PyPi ( 650K / month 110 04.07.2021):

     pip install missingno
    
  • Conda ( 140K 15.02.2020):

     conda install -c conda-forge missingno
    
Facets Overview (28 6.8K ) - Visualizations for machine learning datasets. Apache-2
  • GitHub ( 28 830 95 150 - 50% open 06.05.2021):

     git clone https://github.com/pair-code/facets
    
  • PyPi ( 100K / month 4 24.07.2019):

     pip install facets-overview
    
D-Tale (28 2.9K) - Visualizer for pandas data structures. LGPL-2.1
  • GitHub ( 19 220 280 440 - 9% open 13.01.2022):

     git clone https://github.com/man-group/dtale
    
  • PyPi ( 47K / month 11 18.11.2021):

     pip install dtale
    
  • Conda ( 100K 18.11.2021):

     conda install -c conda-forge dtale
    
hvPlot (27 500) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
  • GitHub ( 33 62 970 400 - 35% open 16.12.2021):

     git clone https://github.com/holoviz/hvplot
    
  • PyPi ( 87K / month 55 09.12.2021):

     pip install hvplot
    
  • Conda ( 140K 23.07.2021):

     conda install -c conda-forge hvplot
    
openTSNE (26 940) - Extensible, parallel implementations of t-SNE. BSD-3
  • GitHub ( 10 99 270 99 - 4% open 25.10.2021):

     git clone https://github.com/pavlin-policar/openTSNE
    
  • PyPi ( 15K / month 8 25.10.2021):

     pip install opentsne
    
  • Conda ( 130K 13.11.2021):

     conda install -c conda-forge opentsne
    
pythreejs (26 780) - A Jupyter - Three.js bridge. BSD-3
  • GitHub ( 29 160 19 210 - 31% open 06.12.2021):

     git clone https://github.com/jupyter-widgets/pythreejs
    
  • PyPi ( 28K / month 34 26.02.2021):

     pip install pythreejs
    
  • Conda ( 360K 02.03.2021):

     conda install -c conda-forge pythreejs
    
  • npm ( 4.5K / month 7 26.02.2021):

     npm install jupyter-threejs
    
lets-plot (26 710) - An open-source plotting library for statistical data. MIT
  • GitHub ( 16 32 170 12 230 - 32% open 12.01.2022):

     git clone https://github.com/JetBrains/lets-plot
    
  • PyPi ( 1.5K / month 1 10.12.2021):

     pip install lets-plot
    
HyperTools (25 1.7K) - A Python toolbox for gaining geometric insights into high-dimensional.. MIT
  • GitHub ( 21 160 8 160 190 - 35% open 19.07.2021):

     git clone https://github.com/ContextLab/hypertools
    
  • PyPi ( 1.3K / month 9 15.06.2021):

     pip install hypertools
    
Chartify (24 3.1K ) - Python library that makes it easy for data scientists to create.. Apache-2
  • GitHub ( 21 270 61 72 - 56% open 05.02.2021):

     git clone https://github.com/spotify/chartify
    
  • PyPi ( 1.3K / month 5 02.11.2020):

     pip install chartify
    
  • Conda ( 18K 07.11.2020):

     conda install -c conda-forge chartify
    
HiPlot (24 2.2K) - HiPlot makes understanding high dimensional data easy. MIT
  • GitHub ( 7 110 3 71 - 14% open 05.11.2021):

     git clone https://github.com/facebookresearch/hiplot
    
  • PyPi ( 9.3K / month 9 05.11.2021):

     pip install hiplot
    
  • Conda ( 75K 05.11.2021):

     conda install -c conda-forge hiplot
    
AutoViz (24 600) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
  • GitHub ( 11 91 130 46 - 13% open 25.12.2021):

     git clone https://github.com/AutoViML/AutoViz
    
  • PyPi ( 22K / month 3 25.12.2021):

     pip install autoviz
    
vega (24 320) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3
  • GitHub ( 10 55 95 - 14% open 02.01.2022):

     git clone https://github.com/vega/ipyvega
    
  • PyPi ( 13K / month 82 03.06.2021):

     pip install vega
    
  • Conda ( 470K 18.11.2021):

     conda install -c conda-forge vega
    
Pandas-Bokeh (23 740) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
  • GitHub ( 13 94 270 95 - 30% open 11.01.2022):

     git clone https://github.com/PatrikHlobil/Pandas-Bokeh
    
  • PyPi ( 10K / month 11 11.04.2021):

     pip install pandas-bokeh
    
joypy (23 390) - Joyplots in Python with matplotlib & pandas. MIT
  • GitHub ( 6 44 110 48 - 22% open 19.12.2021):

     git clone https://github.com/leotac/joypy
    
  • PyPi ( 20K / month 5 19.12.2021):

     pip install joypy
    
  • Conda ( 12K 28.12.2020):

     conda install -c conda-forge joypy
    
PDPbox (22 650 ) - python partial dependence plot toolbox. MIT
  • GitHub ( 7 100 470 58 - 34% open 14.03.2021):

     git clone https://github.com/SauceCat/PDPbox
    
  • PyPi ( 52K / month 25 14.03.2021):

     pip install pdpbox
    
  • Conda ( 10K 14.03.2021):

     conda install -c conda-forge pdpbox
    
python-ternary (22 510) - Ternary plotting library for python with matplotlib. MIT
  • GitHub ( 27 130 17 81 120 - 21% open 21.10.2021):

     git clone https://github.com/marcharper/python-ternary
    
  • PyPi ( 15K / month 21 17.02.2021):

     pip install python-ternary
    
  • Conda ( 61K 17.02.2021):

     conda install -c conda-forge python-ternary
    
Sweetviz (21 1.9K) - Visualize and compare datasets, target values and associations, with one.. MIT
  • GitHub ( 6 190 97 - 27% open 08.07.2021):

     git clone https://github.com/fbdesignpro/sweetviz
    
  • PyPi ( 52K / month 5 08.07.2021):

     pip install sweetviz
    
PyWaffle (21 460) - Make Waffle Charts in Python. MIT
  • GitHub ( 6 80 110 15 - 20% open 21.12.2021):

     git clone https://github.com/gyli/PyWaffle
    
  • PyPi ( 2.4K / month 1 21.12.2021):

     pip install pywaffle
    
Show 11 hidden projects...
  • plotnine (31 2.9K) - A grammar of graphics for Python. GPL-2.0
  • cartopy (31 980) - Cartopy - a cartographic python library with matplotlib support. LGPL-3.0
  • PandasGUI (25 2.5K) - A GUI for Pandas DataFrames. MIT-0
  • Multicore-TSNE (24 1.7K ) - Parallel t-SNE implementation with Python and Torch.. BSD-3
  • pivottablejs (22 450 ) - Dragndrop Pivot Tables and Charts for Jupyter/IPython.. MIT
  • ivis (19 250) - Dimensionality reduction in very large datasets using Siamese.. Apache-2
  • animatplot (17 390 ) - A python package for animating plots build on matplotlib. MIT
  • nx-altair (17 190 ) - Draw interactive NetworkX graphs with Altair. MIT
  • pdvega (16 340 ) - Interactive plotting for Pandas using Vega-Lite. MIT
  • data-describe (16 290) - datadescribe: Pythonic EDA Accelerator for Data Science. Apache-2
  • nptsne (15 27 ) - nptsne is a numpy compatible python binary package that offers a.. Apache-2

Text Data & NLP

Back to top

Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.

transformers (47 57K) - Transformers: State-of-the-art Machine Learning for.. Apache-2
  • GitHub ( 1.1K 13K 1.4K 21K 8.6K - 4% open 13.01.2022):

     git clone https://github.com/huggingface/transformers
    
  • PyPi ( 2.8M / month 720 22.12.2021):

     pip install transformers
    
  • Conda ( 90K 23.12.2021):

     conda install -c conda-forge transformers
    
nltk (45 10K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
  • GitHub ( 410 2.5K 130K 1.6K - 13% open 11.01.2022):

     git clone https://github.com/nltk/nltk
    
  • PyPi ( 9.3M / month 12K 28.12.2021):

     pip install nltk
    
  • Conda ( 1.1M 29.12.2021):

     conda install -c conda-forge nltk
    
spaCy (44 22K) - Industrial-strength Natural Language Processing (NLP) in Python. MIT
  • GitHub ( 640 3.7K 3.1K 34K 5K - 2% open 13.01.2022):

     git clone https://github.com/explosion/spaCy
    
  • PyPi ( 4M / month 2.2K 15.12.2021):

     pip install spacy
    
  • Conda ( 2.5M 14.12.2021):

     conda install -c conda-forge spacy
    
gensim (41 13K) - Topic Modelling for Humans. LGPL-2.1
  • GitHub ( 420 4.1K 3.5K 30K 1.7K - 21% open 24.12.2021):

     git clone https://github.com/RaRe-Technologies/gensim
    
  • PyPi ( 10M / month 2.8K 17.09.2021):

     pip install gensim
    
  • Conda ( 760K 09.11.2021):

     conda install -c conda-forge gensim
    
Rasa (39 13K ) - Open source machine learning framework to automate text- and.. Apache-2
  • GitHub ( 520 3.9K 6.5K - 15% open 12.01.2022):

     git clone https://github.com/RasaHQ/rasa
    
  • PyPi ( 210K / month 56 23.12.2021):

     pip install rasa
    
flair (38 11K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT
  • GitHub ( 220 1.8K 1.1K 1.7K - 5% open 05.01.2022):

     git clone https://github.com/flairNLP/flair
    
  • PyPi ( 58K / month 66 18.11.2021):

     pip install flair
    
AllenNLP (37 11K) - An open-source NLP research library, built on PyTorch. Apache-2
  • GitHub ( 260 2.2K 43 2.2K 2.5K - 4% open 13.01.2022):

     git clone https://github.com/allenai/allennlp
    
  • PyPi ( 30K / month 180 13.01.2022):

     pip install allennlp
    
fairseq (36 15K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT
  • GitHub ( 370 3.8K 160 630 3.1K - 35% open 13.01.2022):

     git clone https://github.com/pytorch/fairseq
    
  • PyPi ( 27K / month 28 05.01.2021):

     pip install fairseq
    
ChatterBot (35 12K ) - ChatterBot is a machine learning, conversational dialog engine.. BSD-3
  • GitHub ( 100 3.9K 4.1K 1.5K - 19% open 01.06.2021):

     git clone https://github.com/gunthercox/ChatterBot
    
  • PyPi ( 28K / month 350 22.08.2020):

     pip install chatterbot
    
ParlAI (35 8.6K) - A framework for training and evaluating AI models on a variety of.. MIT
  • GitHub ( 170 1.7K 57 1.2K - 7% open 12.01.2022):

     git clone https://github.com/facebookresearch/ParlAI
    
  • PyPi ( 2.4K / month 3 12.10.2021):

     pip install parlai
    
TextBlob (34 8K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
  • GitHub ( 35 1.1K 97 17K 260 - 39% open 22.10.2021):

     git clone https://github.com/sloria/TextBlob
    
  • PyPi ( 750K / month 1.4K 15.12.2021):

     pip install textblob
    
  • Conda ( 150K 24.02.2019):

     conda install -c conda-forge textblob
    
sentence-transformers (34 6.8K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2
  • GitHub ( 70 1.3K 2.2K 1.2K - 49% open 06.01.2022):

     git clone https://github.com/UKPLab/sentence-transformers
    
  • PyPi ( 530K / month 80 01.10.2021):

     pip install sentence-transformers
    
spark-nlp (34 2.6K) - State of the Art Natural Language Processing. Apache-2
  • GitHub ( 110 530 610 - 14% open 12.01.2022):

     git clone https://github.com/JohnSnowLabs/spark-nlp
    
  • PyPi ( 1.4M / month 8 05.01.2022):

     pip install spark-nlp
    
sentencepiece (33 5.6K ) - Unsupervised text tokenizer for Neural Network-based.. Apache-2
  • GitHub ( 57 740 19K 12K 490 - 9% open 02.07.2021):

     git clone https://github.com/google/sentencepiece
    
  • PyPi ( 3.4M / month 280 18.06.2021):

     pip install sentencepiece
    
  • Conda ( 140K 05.11.2021):

     conda install -c conda-forge sentencepiece
    
OpenNMT (33 5.4K ) - Open Source Neural Machine Translation in PyTorch. MIT
  • GitHub ( 170 2K 120 1.3K - 9% open 11.01.2022):

     git clone https://github.com/OpenNMT/OpenNMT-py
    
  • PyPi ( 15K / month 8 14.09.2021):

     pip install OpenNMT-py
    
Tokenizers (33 5.1K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
  • GitHub ( 50 420 40 560 - 28% open 12.01.2022):

     git clone https://github.com/huggingface/tokenizers
    
  • PyPi ( 3.6M / month 93 04.01.2022):

     pip install tokenizers
    
  • Conda ( 110K 05.01.2022):

     conda install -c conda-forge tokenizers
    
torchtext (33 2.9K) - Data loaders and abstractions for text and NLP. BSD-3
  • GitHub ( 120 680 680 - 50% open 12.01.2022):

     git clone https://github.com/pytorch/text
    
  • PyPi ( 110K / month 430 15.12.2021):

     pip install torchtext
    
TensorFlow Text (32 880) - Making text a first-class citizen in TensorFlow. Apache-2
  • GitHub ( 67 170 1.3K 180 - 31% open 06.01.2022):

     git clone https://github.com/tensorflow/text
    
  • PyPi ( 1.5M / month 66 19.11.2021):

     pip install tensorflow-text
    
snowballstemmer (32 540 ) - Snowball compiler and stemming algorithms. BSD-3
  • GitHub ( 28 150 4 69 - 36% open 17.12.2021):

     git clone https://github.com/snowballstem/snowball
    
  • PyPi ( 4.9M / month 6.7K 16.11.2021):

     pip install snowballstemmer
    
  • Conda ( 3.6M 17.11.2021):

     conda install -c conda-forge snowballstemmer
    
stanza (31 5.9K) - Official Stanford NLP Python Library for Many Human Languages. Apache-2
  • GitHub ( 41 750 810 620 - 11% open 18.11.2021):

     git clone https://github.com/stanfordnlp/stanza
    
  • PyPi ( 310K / month 49 05.10.2021):

     pip install stanza
    
  • Conda ( 4.6K 05.10.2021):

     conda install -c stanfordnlp stanza
    
DeepPavlov (31 5.6K) - An open source library for deep learning end-to-end dialog.. Apache-2
  • GitHub ( 67 990 240 610 - 18% open 16.12.2021):

     git clone https://github.com/deepmipt/DeepPavlov
    
  • PyPi ( 9.6K / month 6 16.12.2021):

     pip install deeppavlov
    
Dedupe (31 3.3K) - A python library for accurate and scalable fuzzy matching, record.. MIT
  • GitHub ( 62 450 210 670 - 9% open 09.01.2022):

     git clone https://github.com/dedupeio/dedupe
    
  • PyPi ( 250K / month 47 17.04.2021):

     pip install dedupe
    
textacy (31 1.9K) - NLP, before and after spaCy. Apache-2
  • GitHub ( 31 230 240 - 10% open 06.12.2021):

     git clone https://github.com/chartbeat-labs/textacy
    
  • PyPi ( 25K / month 100 06.12.2021):

     pip install textacy
    
  • Conda ( 100K 13.04.2021):

     conda install -c conda-forge textacy
    
jellyfish (31 1.6K) - a python library for doing approximate and phonetic matching of.. BSD-2
  • GitHub ( 25 150 3.1K 110 - 7% open 07.01.2022):

     git clone https://github.com/jamesturk/jellyfish
    
  • PyPi ( 1.6M / month 400 07.01.2022):

     pip install jellyfish
    
  • Conda ( 170K 09.01.2022):

     conda install -c conda-forge jellyfish
    
nlpaug (30 2.8K) - Data augmentation for NLP. MIT
  • GitHub ( 26 320 230 160 - 14% open 04.01.2022):

     git clone https://github.com/makcedward/nlpaug
    
  • PyPi ( 30K / month 14 23.12.2021):

     pip install nlpaug
    
NeMo (29 3.8K) - NeMo: a toolkit for conversational AI. Apache-2
  • GitHub ( 120 850 18K 920 - 7% open 13.01.2022):

     git clone https://github.com/NVIDIA/NeMo
    
  • PyPi ( 7.8K / month 7 04.12.2021):

     pip install nemo-toolkit
    
haystack (29 3.6K) - Haystack is an open source NLP framework that leverages Transformer.. Apache-2
  • GitHub ( 94 610 99 1.1K - 14% open 12.01.2022):

     git clone https://github.com/deepset-ai/haystack
    
  • PyPi ( 1.2K / month 85 15.12.2021):

     pip install haystack
    
ftfy (29 3.2K ) - Fixes mojibake and other glitches in Unicode text, after the fact. MIT
  • GitHub ( 18 110 4.6K 120 - 9% open 17.05.2021):

     git clone https://github.com/LuminosoInsight/python-ftfy
    
  • PyPi ( 1.1M / month 480 24.05.2021):

     pip install ftfy
    
  • Conda ( 140K 25.05.2021):

     conda install -c conda-forge ftfy
    
Sumy (29 2.7K) - Module for automatic summarization of text documents and HTML pages. Apache-2
  • GitHub ( 21 460 1.1K 96 - 15% open 23.11.2021):

     git clone https://github.com/miso-belica/sumy
    
  • PyPi ( 20K / month 100 21.10.2021):

     pip install sumy
    
TextDistance (29 2.6K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT
  • GitHub ( 11 200 440 1.5K 29.11.2021):

     git clone https://github.com/life4/textdistance
    
  • PyPi ( 260K / month 39 27.10.2021):

     pip install textdistance
    
  • Conda ( 69K 27.10.2021):

     conda install -c conda-forge textdistance
    
GluonNLP (29 2.4K) - Toolkit that enables easy text preprocessing, datasets loading.. Apache-2
  • GitHub ( 82 520 690 560 - 46% open 24.08.2021):

     git clone https://github.com/dmlc/gluon-nlp
    
  • PyPi ( 100K / month 22 13.08.2020):

     pip install gluonnlp
    
PyText (28 6.3K) - A natural language modeling framework based on PyTorch. BSD-3
  • GitHub ( 220 800 280 100 220 - 66% open 09.01.2022):

     git clone https://github.com/facebookresearch/pytext
    
  • PyPi ( 210 / month 1 08.06.2020):

     pip install pytext-nlp
    
T5 (28 3.9K) - Code for the paper Exploring the Limits of Transfer Learning with a.. Apache-2
  • GitHub ( 44 530 78 400 - 16% open 11.01.2022):

     git clone https://github.com/google-research/text-to-text-transfer-transformer
    
  • PyPi ( 5.7K / month 2 18.10.2021):

     pip install t5
    
CLTK (28 700) - The Classical Language Toolkit. MIT
  • GitHub ( 110 300 22 190 520 - 5% open 05.01.2022):

     git clone https://github.com/cltk/cltk
    
  • PyPi ( 1.6K / month 42 05.01.2022):

     pip install cltk
    
vaderSentiment (27 3.3K ) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary.. MIT
  • GitHub ( 10 830 3.4K 110 - 30% open 15.03.2021):

     git clone https://github.com/cjhutto/vaderSentiment
    
  • PyPi ( 210K / month 170 22.05.2020):

     pip install vadersentiment
    
neuralcoref (27 2.5K ) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
  • GitHub ( 21 430 300 440 290 - 16% open 22.06.2021):

     git clone https://github.com/huggingface/neuralcoref
    
  • PyPi ( 28K / month 14 08.04.2019):

     pip install neuralcoref
    
  • Conda ( 10K 21.02.2020):

     conda install -c conda-forge neuralcoref
    
fastNLP (27 2.5K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
  • GitHub ( 54 400 65 55 180 - 19% open 06.12.2021):

     git clone https://github.com/fastnlp/fastNLP
    
  • PyPi ( 1.3K / month 3 04.02.2019):

     pip install fastnlp
    
PyTextRank (27 1.7K) - Python implementation of TextRank for phrase extraction and.. MIT
  • GitHub ( 18 320 220 79 - 25% open 01.01.2022):

     git clone https://github.com/DerwenAI/pytextrank
    
  • PyPi ( 16K / month 11 10.10.2021):

     pip install pytextrank
    
spacy-transformers (27 1.1K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT spacy
  • GitHub ( 18 130 360 16.12.2021):

     git clone https://github.com/explosion/spacy-transformers
    
  • PyPi ( 63K / month 13 07.12.2021):

     pip install spacy-transformers
    
Ciphey (26 9.3K) - Automatically decrypt encryptions without knowing the key or cipher,.. MIT
  • GitHub ( 46 580 280 - 17% open 03.11.2021):

     git clone https://github.com/Ciphey/Ciphey
    
  • PyPi ( 10K / month 06.06.2021):

     pip install ciphey
    
  • Docker Hub ( 14K 5 16.11.2021):

     docker pull remnux/ciphey
    
Snips NLU (26 3.6K ) - Snips Python library to extract meaning from text. Apache-2
  • GitHub ( 22 500 250 - 22% open 03.05.2021):

     git clone https://github.com/snipsco/snips-nlu
    
  • PyPi ( 3.1K / month 11 15.01.2020):

     pip install snips-nlu
    
scattertext (26 1.7K) - Beautiful visualizations of how language differs among document.. Apache-2
  • GitHub ( 12 240 250 85 - 20% open 15.11.2021):

     git clone https://github.com/JasonKessler/scattertext
    
  • PyPi ( 3.3K / month 10 15.11.2021):

     pip install scattertext
    
  • Conda ( 59K 15.11.2021):

     conda install -c conda-forge scattertext
    
SciSpacy (26 1.1K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
  • GitHub ( 21 140 380 230 - 15% open 15.07.2021):

     git clone https://github.com/allenai/scispacy
    
  • PyPi ( 19K / month 11 12.02.2021):

     pip install scispacy
    
english-words (24 6K) - A text file containing 479k English words for all your.. Unlicense
  • GitHub ( 26 1.2K 73 - 61% open 20.10.2021):

     git clone https://github.com/dwyl/english-words
    
  • PyPi ( 7.3K / month 4 02.08.2021):

     pip install english-words
    
MatchZoo (24 3.6K ) - Facilitating the design, comparison and sharing of deep.. Apache-2
  • GitHub ( 36 910 10 460 - 6% open 02.06.2021):

     git clone https://github.com/NTMC-Community/MatchZoo
    
  • PyPi ( 120 / month 24.10.2019):

     pip install matchzoo
    
pytorch-nlp (24 2K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3
  • GitHub ( 18 240 320 67 - 26% open 10.07.2021):

     git clone https://github.com/PetrochukM/PyTorch-NLP
    
  • PyPi ( 6.7K / month 17 04.11.2019):

     pip install pytorch-nlp
    
Sockeye (24 1K) - Sequence-to-sequence framework with a focus on Neural Machine.. Apache-2
  • GitHub ( 54 300 14 270 - 3% open 11.01.2022):

     git clone https://github.com/awslabs/sockeye
    
  • PyPi ( 1.1K / month 2 20.12.2021):

     pip install sockeye
    
Kashgari (23 2.2K) - Kashgari is a production-level NLP Transfer learning framework.. Apache-2
  • GitHub ( 21 420 49 360 - 9% open 09.07.2021):

     git clone https://github.com/BrikerMan/Kashgari
    
  • PyPi ( 78 / month 2 18.10.2019):

     pip install kashgari-tf
    
lightseq (23 1.9K) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
  • GitHub ( 7 200 550 140 - 51% open 10.01.2022):

     git clone https://github.com/bytedance/lightseq
    
  • PyPi ( 1.3K / month 30.11.2021):

     pip install lightseq
    
FARM (23 1.4K) - Fast & easy transfer learning for NLP. Harvesting language models.. Apache-2
  • GitHub ( 37 210 420 - 8% open 23.11.2021):

     git clone https://github.com/deepset-ai/FARM
    
  • PyPi ( 2.9K / month 2 10.06.2021):

     pip install farm
    
sense2vec (23 1.3K) - Contextually-keyed word vectors. MIT
  • GitHub ( 17 220 24K 110 110 - 17% open 16.08.2021):

     git clone https://github.com/explosion/sense2vec
    
  • PyPi ( 3.4K / month 8 19.04.2021):

     pip install sense2vec
    
  • Conda ( 24K 14.07.2021):

     conda install -c conda-forge sense2vec
    
gpt-2-simple (22 2.8K) - Python package to easily retrain OpenAIs GPT-2 text-.. MIT
  • GitHub ( 18 590 280 240 - 61% open 18.10.2021):

     git clone https://github.com/minimaxir/gpt-2-simple
    
  • PyPi ( 4.7K / month 5 18.10.2021):

     pip install gpt-2-simple
    
NLP Architect (22 2.8K) - A model library for exploring state-of-the-art deep learning.. Apache-2
  • GitHub ( 37 440 8 130 - 15% open 12.09.2021):

     git clone https://github.com/IntelLabs/nlp-architect
    
  • PyPi ( 320 / month 12.04.2020):

     pip install nlp-architect
    
Texthero (22 2.4K) - Text preprocessing, representation and visualization from zero to hero. MIT
  • GitHub ( 18 210 87 140 - 56% open 19.07.2021):

     git clone https://github.com/jbesomi/texthero
    
  • PyPi ( 9.2K / month 4 01.07.2021):

     pip install texthero
    
fast-bert (22 1.7K) - Super easy library for BERT based NLP models. Apache-2
  • GitHub ( 35 330 250 - 61% open 10.01.2022):

     git clone https://github.com/utterworks/fast-bert
    
  • PyPi ( 1.9K / month 2 10.01.2022):

     pip install fast-bert
    
YouTokenToMe (22 780 ) - Unsupervised text tokenizer focused on computational efficiency. MIT
  • GitHub ( 6 57 200 51 - 54% open 28.01.2021):

     git clone https://github.com/vkcom/youtokentome
    
  • PyPi ( 32K / month 15 12.02.2020):

     pip install youtokentome
    
pySBD (22 400 ) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
  • GitHub ( 6 46 260 60 - 20% open 11.02.2021):

     git clone https://github.com/nipunsadvilkar/pySBD
    
  • PyPi ( 32K / month 3 11.02.2021):

     pip install pysbd
    
DeepMatcher (21 4K ) - Python package for performing Entity and Text Matching using.. BSD-3
  • GitHub ( 7 1.5K 14 76 - 71% open 13.06.2021):

     git clone https://github.com/anhaidgroup/deepmatcher
    
  • PyPi ( 500 / month 13.06.2021):

     pip install deepmatcher
    
anaGo (21 1.4K ) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,.. MIT
  • GitHub ( 11 360 27 110 - 33% open 01.04.2021):

     git clone https://github.com/Hironsan/anago
    
  • PyPi ( 400 / month 5 17.07.2018):

     pip install anago
    
jiant (21 1.4K) - jiant is an nlp toolkit. MIT
  • GitHub ( 56 260 2 550 - 11% open 31.12.2021):

     git clone https://github.com/nyu-mll/jiant
    
  • PyPi ( 120 / month 10.05.2021):

     pip install jiant
    
finetune (21 660) - Scikit-learn style model finetuning for NLP. MPL-2.0
  • GitHub ( 19 71 9 140 - 15% open 20.12.2021):

     git clone https://github.com/IndicoDataSolutions/finetune
    
  • PyPi ( 210 / month 2 20.12.2021):

     pip install finetune
    
TextBox (18 320) - TextBox is an open-source library for building text generation system. MIT
  • GitHub ( 12 57 5 18 - 16% open 26.12.2021):

     git clone https://github.com/RUCAIBox/TextBox
    
  • PyPi ( 51 / month 15.04.2021):

     pip install textbox
    
OpenNRE (17 3.5K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
  • GitHub ( 10 910 340 - 6% open 09.12.2021):

     git clone https://github.com/thunlp/OpenNRE
    
Camphr (16 340) - Camphr - NLP libary for creating pipeline components. Apache-2 spacy
  • GitHub ( 7 17 27 - 7% open 18.08.2021):

     git clone https://github.com/PKSHATechnology-Research/camphr
    
  • PyPi ( 230 / month 2 28.07.2021):

     pip install camphr
    
Translate (15 720) - Translate - a PyTorch Language Library. BSD-3
  • GitHub ( 87 170 93 - 70% open 06.10.2021):

     git clone https://github.com/pytorch/translate
    
  • PyPi ( 4 / month 01.05.2018):

     pip install pytorch-translate
    
VizSeq (14 380) - An Analysis Toolkit for Natural Language Generation (Translation,.. MIT
  • GitHub ( 3 44 3 16 - 43% open 02.09.2021):

     git clone https://github.com/facebookresearch/vizseq
    
  • PyPi ( 73 / month 07.08.2020):

     pip install vizseq
    
BLINK (13 780 ) - Entity Linker solution. MIT
  • GitHub ( 16 140 77 - 62% open 02.04.2021):

     git clone https://github.com/facebookresearch/BLINK
    
Show 23 hidden projects...
  • fastText (33 23K ) - Library for fast text representation and classification. MIT
  • fuzzywuzzy (33 8.6K) - Fuzzy String Matching in Python. GPL-2.0
  • langid (27 1.9K ) - Stand-alone language identification system. BSD-3
  • polyglot (26 1.9K ) - Multilingual text (NLP) processing toolkit. GPL-3.0
  • flashtext (25 5K ) - Extract Keywords from sentence or Replace keywords in sentences. MIT
  • textgenrnn (25 4.6K ) - Easily train your own text-generating neural network of any.. MIT
  • Texar (23 2.2K ) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2
  • stop-words (23 140 ) - Get list of common stop words in various languages in Python. BSD-3
  • DELTA (21 1.5K ) - DELTA is a deep learning based natural language and speech.. Apache-2
  • textpipe (20 290 ) - Textpipe: clean and extract metadata from text. MIT
  • pyfasttext (20 230 ) - Yet another Python binding for fastText. GPL-3.0
  • fastT5 (19 240) - boost inference speed of T5 models by 5x & reduce the model size by 3x. Apache-2
  • skift (18 220) - scikit-learn wrappers for Python fastText. MIT
  • NeuroNER (17 1.6K ) - Named-entity recognition using neural networks. Easy-to-use and.. MIT
  • nboost (17 600 ) - NBoost is a scalable, search-api-boosting platform for deploying.. Apache-2
  • textaugment (17 200) - TextAugment: Text Augmentation Library. MIT
  • NeuralQA (15 220 ) - NeuralQA: A Usable Library for Question Answering on Large Datasets.. MIT
  • Headliner (14 230 ) - Easy training and deployment of seq2seq models. MIT
  • numerizer (14 130) - A Python module to convert natural language numerics into ints and.. MIT
  • TransferNLP (13 290 ) - NLP library designed for reproducible experimentation.. MIT
  • ONNX-T5 (13 190 ) - Summarization, translation, sentiment-analysis, text-generation.. Apache-2
  • textvec (13 180 ) - Text vectorization tool to outperform TFIDF for classification.. MIT
  • spacy-dbpedia-spotlight (9 48) - A spaCy wrapper for DBpedia Spotlight. MIT spacy

Image Data

Back to top

Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.

Pillow (45 9.3K) - The friendly PIL fork (Python Imaging Library). PIL
  • GitHub ( 380 1.8K 2.4K - 6% open 11.01.2022):

     git clone https://github.com/python-pillow/Pillow
    
  • PyPi ( 31M / month 62K 02.01.2022):

     pip install Pillow
    
  • Conda ( 12M 10.11.2021):

     conda install -c conda-forge pillow
    
scikit-image (44 4.7K) - Image processing in Python. BSD-2
  • GitHub ( 540 1.9K 90K 2.3K - 13% open 11.01.2022):

     git clone https://github.com/scikit-image/scikit-image
    
  • PyPi ( 4.9M / month 9K 15.12.2021):

     pip install scikit-image
    
  • Conda ( 3.1M 17.12.2021):

     conda install -c conda-forge scikit-image
    
torchvision (41 11K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
  • GitHub ( 450 5.4K 2.1K - 27% open 13.01.2022):

     git clone https://github.com/pytorch/vision
    
  • PyPi ( 2.9M / month 3.4K 15.12.2021):

     pip install torchvision
    
  • Conda ( 160K 27.09.2021):

     conda install -c conda-forge torchvision
    
imageio (37 980 ) - Python library for reading and writing image data. BSD-2
  • GitHub ( 83 190 100 54K 410 - 17% open 11.01.2022):

     git clone https://github.com/imageio/imageio
    
  • PyPi ( 16M / month 2.5K 23.12.2021):

     pip install imageio
    
  • Conda ( 2.5M 29.12.2021):

     conda install -c conda-forge imageio
    
detectron2 (35 20K) - Detectron2 is a platform for object detection, segmentation.. Apache-2
  • GitHub ( 200 5K 470 2.8K - 5% open 13.01.2022):

     git clone https://github.com/facebookresearch/detectron2
    
  • Conda ( 37K 11.01.2022):

     conda install -c conda-forge detectron2
    
PyTorch Image Models (35 16K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
  • GitHub ( 64 2.5K 830K 1.7K 440 - 12% open 08.01.2022):

     git clone https://github.com/rwightman/pytorch-image-models
    
Albumentations (35 9.5K) - Fast image augmentation library and an easy-to-use wrapper.. MIT
  • GitHub ( 100 1.2K 6.2K 570 - 41% open 24.12.2021):

     git clone https://github.com/albumentations-team/albumentations
    
  • PyPi ( 190K / month 180 04.10.2021):

     pip install albumentations
    
  • Conda ( 30K 15.07.2021):

     conda install -c conda-forge albumentations
    
MoviePy (35 8.9K) - Video editing with Python. MIT
  • GitHub ( 140 1.2K 13K 1.1K - 30% open 12.11.2021):

     git clone https://github.com/Zulko/moviepy
    
  • PyPi ( 1.5M / month 720 15.12.2021):

     pip install moviepy
    
  • Conda ( 100K 23.02.2020):

     conda install -c conda-forge moviepy
    
InsightFace (34 11K) - State-of-the-art 2D and 3D Face Analysis Project. MIT
  • GitHub ( 35 3.5K 130 1.8K - 54% open 12.01.2022):

     git clone https://github.com/deepinsight/insightface
    
  • PyPi ( 24K / month 5 21.09.2021):

     pip install insightface
    
Kornia (34 5.7K) - Open Source Differentiable Computer Vision Library. Apache-2
  • GitHub ( 140 560 180 900 530 - 26% open 12.01.2022):

     git clone https://github.com/kornia/kornia
    
  • PyPi ( 260K / month 44 03.12.2021):

     pip install kornia
    
opencv-python (34 2.5K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
  • GitHub ( 36 480 500 - 5% open 27.12.2021):

     git clone https://github.com/opencv/opencv-python
    
  • PyPi ( 4.1M / month 8.7K 29.12.2021):

     pip install opencv-python
    
Face Recognition (33 43K ) - The worlds simplest facial recognition api for Python.. MIT
  • GitHub ( 47 12K 450 1.2K - 54% open 14.06.2021):

     git clone https://github.com/ageitgey/face_recognition
    
  • PyPi ( 46K / month 210 21.08.2018):

     pip install face_recognition
    
MMDetection (33 18K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
  • GitHub ( 290 6.4K 230 5K - 8% open 30.12.2021):

     git clone https://github.com/open-mmlab/mmdetection
    
Wand (33 1.1K) - The ctypes-based simple ImageMagick binding for Python. MIT
  • GitHub ( 97 190 5.5K 8.4K 360 - 4% open 20.11.2021):

     git clone https://github.com/emcconville/wand
    
  • PyPi ( 290K / month 680 17.08.2021):

     pip install wand
    
GluonCV (32 5K) - Gluon CV Toolkit. Apache-2
  • GitHub ( 120 1.1K 650 810 - 7% open 09.01.2022):

     git clone https://github.com/dmlc/gluon-cv
    
  • PyPi ( 510K / month 59 13.01.2022):

     pip install gluoncv
    
imageai (30 6.8K ) - A python library built to empower developers to build applications.. MIT
  • GitHub ( 15 1.9K 690K 1K 670 - 37% open 08.05.2021):

     git clone https://github.com/OlafenwaMoses/ImageAI
    
  • PyPi ( 7.8K / month 16 05.01.2021):

     pip install imageai
    
imutils (30 3.9K ) - A series of convenience functions to make basic image processing.. MIT
  • GitHub ( 20 940 22K 160 - 52% open 15.01.2021):

     git clone https://github.com/jrosebr1/imutils
    
  • PyPi ( 340K / month 760 15.01.2021):

     pip install imutils
    
  • Conda ( 75K 09.12.2021):

     conda install -c conda-forge imutils
    
PaddleSeg (30 3.5K) - Easy-to-use image segmentation library with awesome pre-.. Apache-2
  • GitHub ( 60 780 440 840 - 49% open 05.01.2022):

     git clone https://github.com/PaddlePaddle/PaddleSeg
    
  • PyPi ( 1.1K / month 2 13.10.2021):

     pip install paddleseg
    
deepface (30 3K) - A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender,.. MIT
  • GitHub ( 20 690 370 380 - 1% open 13.01.2022):

     git clone https://github.com/serengil/deepface
    
  • PyPi ( 38K / month 3 12.01.2022):

     pip install deepface
    
ImageHash (30 2.2K) - A Python Perceptual Image Hashing Module. BSD-2
  • GitHub ( 20 280 4.1K 100 - 12% open 07.09.2021):

     git clone https://github.com/JohannesBuchner/imagehash
    
  • PyPi ( 1.1M / month 320 15.07.2021):

     pip install ImageHash
    
  • Conda ( 160K 15.07.2021):

     conda install -c conda-forge imagehash
    
vit-pytorch (28 8.2K) - Implementation of Vision Transformer, a simple way to achieve.. MIT
  • GitHub ( 14 1.3K 63 160 - 49% open 06.01.2022):

     git clone https://github.com/lucidrains/vit-pytorch
    
  • PyPi ( 8K / month 1 06.01.2022):

     pip install vit-pytorch
    
PaddleDetection (28 6.1K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2
  • GitHub ( 78 1.5K 7 2.9K - 30% open 12.01.2022):

     git clone https://github.com/PaddlePaddle/PaddleDetection
    
Face Alignment (27 5.5K) - 2D and 3D Face alignment library build using pytorch. BSD-3
  • GitHub ( 23 1.1K 270 - 19% open 04.08.2021):

     git clone https://github.com/1adrianb/face-alignment
    
  • PyPi ( 6.8K / month 8 14.09.2021):

     pip install face-alignment
    
vidgear (27 2.1K) - A High-performance cross-platform Video Processing Python framework.. Apache-2
  • GitHub ( 9 160 510 160 190 - 1% open 05.12.2021):

     git clone https://github.com/abhiTronix/vidgear
    
  • PyPi ( 3.1K / month 3 05.12.2021):

     pip install vidgear
    
mahotas (27 730) - Computer Vision in Python. MIT
  • GitHub ( 32 140 730 77 - 20% open 07.12.2021):

     git clone https://github.com/luispedro/mahotas
    
  • PyPi ( 8.8K / month 110 14.10.2021):

     pip install mahotas
    
  • Conda ( 310K 17.11.2021):

     conda install -c conda-forge mahotas
    
MMF (26 4.8K) - A modular framework for vision & language multimodal research from.. BSD-3
  • GitHub ( 89 780 10 610 - 31% open 11.01.2022):

     git clone https://github.com/facebookresearch/mmf
    
  • PyPi ( 450 / month 1 12.06.2020):

     pip install mmf
    
Augmentor (26 4.6K) - Image augmentation library in Python for machine learning. MIT
  • GitHub ( 22 830 390 190 - 63% open 15.10.2021):

     git clone https://github.com/mdbloice/Augmentor
    
  • PyPi ( 7.6K / month 29 14.10.2021):

     pip install Augmentor
    
mtcnn (26 1.7K) - MTCNN face detection implementation for TensorFlow, as a PIP package. MIT
  • GitHub ( 15 430 1.8K 99 - 62% open 09.07.2021):

     git clone https://github.com/ipazc/mtcnn
    
  • PyPi ( 33K / month 43 09.07.2021):

     pip install mtcnn
    
lightly (26 1.4K) - A python library for self-supervised learning on images. MIT
  • GitHub ( 14 93 26 300 - 21% open 06.01.2022):

     git clone https://github.com/lightly-ai/lightly
    
  • PyPi ( 1.8K / month 1 04.01.2022):

     pip install lightly
    
pyvips (26 380) - python binding for libvips using cffi. MIT
  • GitHub ( 12 33 260 260 - 38% open 15.12.2021):

     git clone https://github.com/libvips/pyvips
    
  • PyPi ( 13K / month 28 20.11.2021):

     pip install pyvips
    
  • Conda ( 14K 30.12.2021):

     conda install -c conda-forge pyvips
    
facenet-pytorch (25 2.6K) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT
  • GitHub ( 14 560 180K 600 140 - 38% open 13.12.2021):

     git clone https://github.com/timesler/facenet-pytorch
    
  • PyPi ( 12K / month 7 10.03.2021):

     pip install facenet-pytorch
    
tensorflow-graphics (25 2.6K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2
  • GitHub ( 34 340 220 - 59% open 06.01.2022):

     git clone https://github.com/tensorflow/graphics
    
  • PyPi ( 2.6K / month 4 03.12.2021):

     pip install tensorflow-graphics
    
pytorchvideo (25 2.2K) - A deep learning library for video understanding research. Apache-2
  • GitHub ( 24 210 130 - 43% open 11.01.2022):

     git clone https://github.com/facebookresearch/pytorchvideo
    
  • PyPi ( 11K / month 3 14.09.2021):

     pip install pytorchvideo
    
CellProfiler (25 640) - An open-source application for biological image analysis. BSD-3
  • GitHub ( 120 290 2.1K 6 3K - 6% open 05.11.2021):

     git clone https://github.com/CellProfiler/CellProfiler
    
  • PyPi ( 590 / month 04.09.2017):

     pip install cellprofiler
    
Image Super-Resolution (24 3.4K ) - Super-scale your images and run experiments with.. Apache-2
  • GitHub ( 10 580 69 190 - 42% open 02.06.2021):

     git clone https://github.com/idealo/image-super-resolution
    
  • PyPi ( 5.7K / month 5 08.01.2020):

     pip install ISR
    
  • Docker Hub ( 200 01.04.2019):

     docker pull idealo/image-super-resolution-gpu
    
vissl (24 2.3K) - VISSL is FAIRs library of extensible, modular and scalable components.. MIT
  • GitHub ( 30 220 4 130 - 33% open 10.01.2022):

     git clone https://github.com/facebookresearch/vissl
    
  • PyPi ( 200 / month 1 02.11.2021):

     pip install vissl
    
deep-daze (23 4K) - Simple command line tool for text to image generation using OpenAIs CLIP.. MIT
  • GitHub ( 13 280 32 160 - 53% open 19.10.2021):

     git clone https://github.com/lucidrains/deep-daze
    
  • PyPi ( 18K / month 19.10.2021):

     pip install deep-daze
    
layout-parser (23 2.8K) - A Unified Toolkit for Deep Learning Based Document Image.. Apache-2
  • GitHub ( 7 260 39 75 - 52% open 12.01.2022):

     git clone https://github.com/Layout-Parser/layout-parser
    
  • PyPi ( 3.7K / month 1 23.09.2021):

     pip install layoutparser
    
Classy Vision (22 1.4K) - An end-to-end PyTorch framework for image and video.. MIT
  • GitHub ( 67 240 110 - 46% open 28.12.2021):

     git clone https://github.com/facebookresearch/ClassyVision
    
  • PyPi ( 770 / month 2 09.07.2021):

     pip install classy_vision
    
  • Conda ( 11K 11.12.2020):

     conda install -c conda-forge classy_vision
    
image-match (20 2.7K) - Quickly search over billions of images. Apache-2
  • GitHub ( 19 380 100 - 53% open 21.09.2021):

     git clone https://github.com/ProvenanceLabs/image-match
    
  • PyPi ( 780 / month 4 13.02.2017):

     pip install image_match
    
Norfair (20 1.2K) - Lightweight Python library for adding real-time object tracking to any.. BSD-3
  • GitHub ( 9 97 40 - 22% open 01.10.2021):

     git clone https://github.com/tryolabs/norfair
    
  • PyPi ( 3K / month 1 29.07.2021):

     pip install norfair
    
DETR (18 8.2K) - End-to-End Object Detection with Transformers. Apache-2
  • GitHub ( 24 1.4K 400 - 33% open 18.10.2021):

     git clone https://github.com/facebookresearch/detr
    
PySlowFast (18 4.5K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
  • GitHub ( 25 850 6 480 - 50% open 28.10.2021):

     git clone https://github.com/facebookresearch/SlowFast
    
Caer (18 580) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
  • GitHub ( 8 100 19 15 - 13% open 13.10.2021):

     git clone https://github.com/jasmcaus/caer
    
  • PyPi ( 3.8K / month 1 13.10.2021):

     pip install caer
    
pycls (16 1.8K) - Codebase for Image Classification Research, written in PyTorch. MIT
  • GitHub ( 13 210 4 77 - 27% open 19.08.2021):

     git clone https://github.com/facebookresearch/pycls
    
Show 11 hidden projects...
  • imgaug (35 12K ) - Image augmentation for machine learning experiments. MIT
  • glfw (35 8.5K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input. Zlib
  • Pillow-SIMD (31 1.7K) - The friendly PIL fork. PIL
  • PyTorch3D (28 5.5K) - PyTorch3D is FAIRs library of reusable components for.. Unlicensed
  • chainercv (27 1.5K ) - ChainerCV: a Library for Deep Learning in Computer Vision. MIT
  • segmentation_models (25 3.6K ) - Segmentation models with pretrained backbones. Keras.. MIT
  • Image Deduplicator (23 3.9K ) - Finding duplicate images made easy!. Apache-2
  • nude.py (22 830 ) - Nudity detection with Python. MIT
  • Luminoth (21 2.4K ) - Deep Learning toolkit for Computer Vision. BSD-3
  • solt (16 250 ) - Streaming over lightweight data transformations. MIT
  • Torch Points 3D (13 1 ) - Pytorch framework for doing deep learning on point.. BSD-3

Graph Data

Back to top

Libraries for graph processing, clustering, embedding, and machine learning tasks.

networkx (44 10K) - Network Analysis in Python. BSD-3
  • GitHub ( 560 2.5K 57 95K 2.8K - 11% open 12.01.2022):

     git clone https://github.com/networkx/networkx
    
  • PyPi ( 14M / month 13K 15.12.2021):

     pip install networkx
    
  • Conda ( 5.3M 26.10.2021):

     conda install -c conda-forge networkx
    
PyTorch Geometric (35 14K) - Graph Neural Network Library for PyTorch. MIT
  • GitHub ( 230 2.3K 2.4K - 36% open 12.01.2022):

     git clone https://github.com/rusty1s/pytorch_geometric
    
  • PyPi ( 40K / month 31 22.12.2021):

     pip install torch-geometric
    
dgl (35 8.8K) - Python package built to ease deep learning on graph, on top of existing.. Apache-2
  • GitHub ( 180 1.9K 1.3K - 22% open 13.01.2022):

     git clone https://github.com/dmlc/dgl
    
  • PyPi ( 71K / month 36 27.05.2021):

     pip install dgl
    
StellarGraph (27 2.3K) - StellarGraph - Machine Learning on Graphs. Apache-2
  • GitHub ( 36 330 110 980 - 25% open 29.10.2021):

     git clone https://github.com/stellargraph/stellargraph
    
  • PyPi ( 12K / month 3 30.06.2020):

     pip install stellargraph
    
ogb (27 1.2K) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
  • GitHub ( 18 250 200 07.01.2022):

     git clone https://github.com/snap-stanford/ogb
    
  • PyPi ( 7.1K / month 13 29.09.2021):

     pip install ogb
    
Paddle Graph Learning (27 1.2K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2
  • GitHub ( 21 190 23 100 - 36% open 29.12.2021):

     git clone https://github.com/PaddlePaddle/PGL
    
  • PyPi ( 1.3K / month 2 20.12.2021):

     pip install pgl
    
Spektral (25 2K) - Graph Neural Networks with Keras and Tensorflow 2. MIT
  • GitHub ( 19 260 91 200 - 18% open 26.10.2021):

     git clone https://github.com/danielegrattarola/spektral
    
  • PyPi ( 3.6K / month 2 23.08.2021):

     pip install spektral
    
pygraphistry (25 1.5K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3
  • GitHub ( 19 150 54 180 - 40% open 22.12.2021):

     git clone https://github.com/graphistry/pygraphistry
    
  • PyPi ( 2.2K / month 4 07.12.2021):

     pip install graphistry
    
PyKEEN (25 660) - A Python library for learning and evaluating knowledge graph embeddings. MIT
  • GitHub ( 24 96 92 340 - 33% open 11.01.2022):

     git clone https://github.com/pykeen/pykeen
    
  • PyPi ( 670 / month 3 11.01.2022):

     pip install pykeen
    
Node2Vec (24 830) - Implementation of the node2vec algorithm. MIT
  • GitHub ( 9 190 09.10.2021):

     git clone https://github.com/eliorc/node2vec
    
  • PyPi ( 520K / month 14 09.10.2021):

     pip install node2vec
    
  • Conda ( 20K 25.04.2020):

     conda install -c conda-forge node2vec
    
PyTorch-BigGraph (22 3K) - Generate embeddings from large-scale graph-structured.. BSD-3
  • GitHub ( 24 400 120 170 - 31% open 27.10.2021):

     git clone https://github.com/facebookresearch/PyTorch-BigGraph
    
  • PyPi ( 1.2K / month 3 01.05.2019):

     pip install torchbiggraph
    
AmpliGraph (22 1.7K ) - Python library for Representation Learning on Knowledge.. Apache-2
  • GitHub ( 19 190 17 200 - 9% open 25.05.2021):

     git clone https://github.com/Accenture/AmpliGraph
    
  • PyPi ( 720 / month 25.05.2021):

     pip install ampligraph
    
pytorch_geometric_temporal (22 1.3K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT
  • GitHub ( 15 180 74 - 6% open 03.01.2022):

     git clone https://github.com/benedekrozemberczki/pytorch_geometric_temporal
    
  • PyPi ( 830 / month 1 31.12.2021):

     pip install torch-geometric-temporal
    
graph4nlp (21 1.2K) - Graph4nlp is the library for the easy use of Graph Neural.. Apache-2
  • GitHub ( 22 140 97 - 5% open 27.09.2021):

     git clone https://github.com/graph4ai/graph4nlp
    
  • PyPi ( 85 / month 29.09.2021):

     pip install graph4nlp
    
torch-cluster (20 460) - PyTorch Extension Library of Optimized Graph Cluster.. MIT
  • GitHub ( 19 88 93 - 9% open 28.12.2021):

     git clone https://github.com/rusty1s/pytorch_cluster
    
  • PyPi ( 7.7K / month 26 01.03.2021):

     pip install torch-cluster
    
AutoGL (19 750) - An autoML framework & toolkit for machine learning on graphs. Apache-2
  • GitHub ( 13 77 18 - 27% open 31.12.2021):

     git clone https://github.com/THUMNLab/AutoGL
    
  • PyPi ( 35 / month 23.12.2020):

     pip install auto-graph-learning
    
OpenKE (15 2.9K ) - An Open-Source Package for Knowledge Embedding (KE). MIT
  • GitHub ( 10 840 380 - 17% open 06.04.2021):

     git clone https://github.com/thunlp/OpenKE
    
kglib (15 480) - Grakn Knowledge Graph Library (ML R&D). Apache-2
  • GitHub ( 7 86 210 61 - 19% open 22.10.2021):

     git clone https://github.com/vaticle/kglib
    
  • PyPi ( 89 / month 19.08.2020):

     pip install grakn-kglib
    
GraphVite (12 990 ) - GraphVite: A General and High-performance Graph Embedding.. Apache-2
  • GitHub ( 1 130 92 - 39% open 14.01.2021):

     git clone https://github.com/DeepGraphLearning/graphvite
    
  • Conda ( 4.2K 19.03.2020):

     conda install -c milagraph graphvite
    
Show 13 hidden projects...
  • igraph (31 920) - Python interface for igraph. GPL-2.0
  • pygal (29 2.4K) - PYthon svg GrAph plotting Library. LGPL-3.0
  • Karate Club (24 1.5K) - Karate Club: An API Oriented Open-source Python Framework for.. GPL-3.0
  • DeepWalk (21 2.4K ) - DeepWalk - Deep Learning for Graphs. GPL-3.0
  • graph-nets (20 5.1K ) - Build Graph Nets in Tensorflow. Apache-2
  • DIG (20 1K) - A library for graph deep learning research. GPL-3.0
  • Sematch (17 370 ) - semantic similarity framework for knowledge graph. Apache-2
  • DeepGraph (17 250 ) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
  • Euler (16 2.7K ) - A distributed graph deep learning framework. Apache-2
  • GraphEmbedding (16 2.5K ) - Implementation and experiments of graph embedding.. MIT
  • pyRDF2Vec (16 140) - Python Implementation and Extension of RDF2Vec. MIT
  • GraphSAGE (14 2.6K ) - Representation learning on large graphs using stochastic.. MIT
  • OpenNE (14 1.5K ) - An Open-Source Package for Network Embedding (NE). MIT

Audio Data

Back to top

Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.

espnet (35 4.6K) - End-to-End Speech Processing Toolkit. Apache-2
  • GitHub ( 220 1.4K 74 30 1.6K - 16% open 13.01.2022):

     git clone https://github.com/espnet/espnet
    
  • PyPi ( 2.9K / month 1 31.12.2021):

     pip install espnet
    
librosa (33 4.9K) - Python library for audio and music analysis. ISC
  • GitHub ( 93 780 930 - 3% open 12.01.2022):

     git clone https://github.com/librosa/librosa
    
  • PyPi ( 500K / month 1.2K 26.05.2021):

     pip install librosa
    
  • Conda ( 420K 26.05.2021):

     conda install -c conda-forge librosa
    
Magenta (32 17K ) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2
  • GitHub ( 150 3.5K 330 890 - 36% open 30.06.2021):

     git clone https://github.com/magenta/magenta
    
  • PyPi ( 6.6K / month 36 12.11.2020):

     pip install magenta
    
Pydub (32 5.8K ) - Manipulate audio with a simple and easy high level interface. MIT
  • GitHub ( 90 770 10K 460 - 44% open 08.06.2021):

     git clone https://github.com/jiaaro/pydub
    
  • PyPi ( 940K / month 900 10.03.2021):

     pip install pydub
    
  • Conda ( 20K 13.03.2021):

     conda install -c conda-forge pydub
    
torchaudio (32 1.5K ) - Data manipulation and transformation for audio signal.. BSD-2
  • GitHub ( 150 370 590 - 25% open 08.01.2022):

     git clone https://github.com/pytorch/audio
    
  • PyPi ( 240K / month 100 15.12.2021):

     pip install torchaudio
    
speechbrain (31 3.6K) - A PyTorch-based Speech Toolkit. Apache-2
  • GitHub ( 140 640 110 560 - 19% open 11.01.2022):

     git clone https://github.com/speechbrain/speechbrain
    
  • PyPi ( 4.8K / month 2 20.12.2021):

     pip install speechbrain
    
SpeechRecognition (30 6K) - Speech recognition module for Python, supporting several.. BSD-3
  • GitHub ( 41 2K 510 - 45% open 14.12.2021):

     git clone https://github.com/Uberi/speech_recognition
    
  • PyPi ( 250K / month 670 05.12.2017):

     pip install SpeechRecognition
    
  • Conda ( 130K 13.12.2021):

     conda install -c conda-forge speechrecognition
    
pyAudioAnalysis (30 4.5K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
  • GitHub ( 26 1.1K 250 290 - 60% open 09.01.2022):

     git clone https://github.com/tyiannak/pyAudioAnalysis
    
  • PyPi ( 14K / month 19 10.01.2022):

     pip install pyAudioAnalysis
    
Coqui TTS (29 3.5K) - - a deep learning toolkit for Text-to-Speech, battle-.. MPL-2.0
  • GitHub ( 82 290 63K 200 - 10% open 03.01.2022):

     git clone https://github.com/coqui-ai/TTS
    
  • PyPi ( 3.8K / month 14.07.2017):

     pip install tts
    
audioread (29 390) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.. MIT
  • GitHub ( 21 89 7.1K 77 - 40% open 03.12.2021):

     git clone https://github.com/beetbox/audioread
    
  • PyPi ( 470K / month 320 20.10.2020):

     pip install audioread
    
  • Conda ( 380K 08.01.2022):

     conda install -c conda-forge audioread
    
spleeter (28 18K) - Deezer source separation library including pretrained models. MIT
  • GitHub ( 18 2K 1.4M 620 - 16% open 08.12.2021):

     git clone https://github.com/deezer/spleeter
    
  • PyPi ( 10K / month 3 03.09.2021):

     pip install spleeter
    
  • Conda ( 62K 30.06.2020):

     conda install -c conda-forge spleeter
    
Porcupine (28 2.6K) - On-device wake word detection powered by deep learning. Apache-2
  • GitHub ( 30 380 7 360 - 1% open 11.01.2022):

     git clone https://github.com/Picovoice/Porcupine
    
  • PyPi ( 1.7K / month 8 21.12.2021):

     pip install pvporcupine
    
Madmom (28 850) - Python audio and music signal processing library. BSD-3
  • GitHub ( 20 150 180 250 - 21% open 06.01.2022):

     git clone https://github.com/CPJKU/madmom
    
  • PyPi ( 8.8K / month 27 14.11.2018):

     pip install madmom
    
tinytag (27 500) - Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A,.. MIT
  • GitHub ( 21 87 460 87 - 13% open 17.12.2021):

     git clone https://github.com/devsnd/tinytag
    
  • PyPi ( 14K / month 67 14.12.2021):

     pip install tinytag
    
DDSP (26 2K) - DDSP: Differentiable Digital Signal Processing. Apache-2
  • GitHub ( 29 210 19 130 - 17% open 21.12.2021):

     git clone https://github.com/magenta/ddsp
    
  • PyPi ( 2.6K / month 1 24.12.2021):

     pip install ddsp
    
kapre (25 800) - kapre: Keras Audio Preprocessors. MIT
  • GitHub ( 13 140 19 1.3K 93 - 11% open 14.11.2021):

     git clone https://github.com/keunwoochoi/kapre
    
  • PyPi ( 1.5K / month 14 14.11.2021):

     pip install kapre
    
python-soundfile (25 430) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
  • GitHub ( 23 59 2.8K 160 - 38% open 07.12.2021):

     git clone https://github.com/bastibe/python-soundfile
    
  • PyPi ( 730K / month 540 27.11.2019):

     pip install soundfile
    
audiomentations (24 830) - A Python library for audio data augmentation. Inspired by.. MIT
  • GitHub ( 20 100 100 91 - 27% open 13.01.2022):

     git clone https://github.com/iver56/audiomentations
    
  • PyPi ( 2.5K / month 18.11.2021):

     pip install audiomentations
    
DeepSpeech (22 18K) - DeepSpeech is an open source embedded (offline, on-device).. MPL-2.0
  • GitHub ( 140 3.3K):

     git clone https://github.com/mozilla/DeepSpeech
    
  • PyPi ( 8.6K / month 35 19.12.2020):

     pip install deepspeech
    
TTS (22 5.5K ) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
  • GitHub ( 56 860 1.6K 520 - 2% open 12.02.2021):

     git clone https://github.com/mozilla/TTS
    
nnAudio (21 640) - Audio processing by using pytorch 1D convolution network. MIT
  • GitHub ( 13 61 39 48 - 22% open 24.12.2021):

     git clone https://github.com/KinWaiCheuk/nnAudio
    
  • PyPi ( 850 / month 1 24.12.2021):

     pip install nnAudio
    
Show 6 hidden projects...
  • aubio (29 2.6K) - a library for audio and music analysis. GPL-3.0
  • Essentia (28 2K) - C++ library for audio and music analysis, description and.. AGPL-3.0
  • python_speech_features (24 2K ) - This library provides common speech features for ASR.. MIT
  • Dejavu (22 5.6K ) - Audio fingerprinting and recognition in Python. MIT
  • Muda (19 200 ) - A library for augmenting annotated audio data. ISC
  • Julius (16 250) - Fast PyTorch based DSP for audio and 1D signals. MIT

Geospatial Data

Back to top

Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.

pydeck (42 9.4K) - WebGL2 powered visualization framework. MIT
  • GitHub ( 190 1.7K 3.4K 2.3K - 5% open 13.01.2022):

     git clone https://github.com/visgl/deck.gl
    
  • PyPi ( 620K / month 15 25.10.2021):

     pip install pydeck
    
  • Conda ( 65K 26.10.2021):

     conda install -c conda-forge pydeck
    
  • npm ( 200K / month 380 30.12.2021):

     npm install deck.gl
    
Shapely (38 2.6K) - Manipulation and analysis of geometric objects. BSD-3
  • GitHub ( 130 440 25K 820 - 17% open 13.01.2022):

     git clone https://github.com/Toblerity/Shapely
    
  • PyPi ( 5.5M / month 3.6K 25.10.2021):

     pip install shapely
    
  • Conda ( 3.1M 20.11.2021):

     conda install -c conda-forge shapely
    
Rasterio (37 1.7K) - Rasterio reads and writes geospatial raster datasets. BSD-3
  • GitHub ( 120 460 740 4.2K 1.5K - 9% open 06.01.2022):

     git clone https://github.com/mapbox/rasterio
    
  • PyPi ( 620K / month 730 15.10.2021):

     pip install rasterio
    
  • Conda ( 1.4M 04.01.2022):

     conda install -c conda-forge rasterio
    
folium (36 5.6K) - Python Data. Leaflet.js Maps. MIT
  • GitHub ( 130 2K 14K 920 - 22% open 08.01.2022):

     git clone https://github.com/python-visualization/folium
    
  • PyPi ( 680K / month 630 19.11.2021):

     pip install folium
    
  • Conda ( 500K 03.12.2021):

     conda install -c conda-forge folium
    
GeoPandas (36 2.9K) - Python tools for geographic data. BSD-3
  • GitHub ( 160 660 1.4K 12K 1.3K - 32% open 12.01.2022):

     git clone https://github.com/geopandas/geopandas
    
  • PyPi ( 1.8M / month 1.1K 16.10.2021):

     pip install geopandas
    
  • Conda ( 1.3M 01.12.2021):

     conda install -c conda-forge geopandas
    
pyproj (35 700) - Python interface to PROJ (cartographic projections and coordinate.. MIT
  • GitHub ( 45 170 12K 460 - 1% open 12.01.2022):

     git clone https://github.com/pyproj4/pyproj
    
  • PyPi ( 3.5M / month 1.6K 18.11.2021):

     pip install pyproj
    
  • Conda ( 2.8M 03.01.2022):

     conda install -c conda-forge pyproj
    
geopy (33 3.5K) - Geocoding library for Python. MIT
  • GitHub ( 120 560 32K 250 - 9% open 26.09.2021):

     git clone https://github.com/geopy/geopy
    
  • PyPi ( 2.8M / month 3.9K 11.07.2021):

     pip install geopy
    
  • Conda ( 640K 12.07.2021):

     conda install -c conda-forge geopy
    
ipyleaflet (32 1.2K) - A Jupyter - Leaflet.js bridge. MIT
  • GitHub ( 73 320 1.4K 470 - 38% open 22.12.2021):

     git clone https://github.com/jupyter-widgets/ipyleaflet
    
  • PyPi ( 43K / month 100 06.12.2021):

     pip install ipyleaflet
    
  • Conda ( 790K 09.12.2021):

     conda install -c conda-forge ipyleaflet
    
  • npm ( 36K / month 2 06.12.2021):

     npm install jupyter-leaflet
    
Fiona (32 880) - Fiona reads and writes geographic data files. BSD-3
  • GitHub ( 65 180 7.5K 650 - 12% open 28.12.2021):

     git clone https://github.com/Toblerity/Fiona
    
  • PyPi ( 2M / month 770 31.05.2021):

     pip install fiona
    
  • Conda ( 2.5M 01.12.2021):

     conda install -c conda-forge fiona
    
ArcGIS API (29 1.2K) - Documentation and samples for ArcGIS API for Python. Apache-2
  • GitHub ( 74 830 1.4K 400 - 27% open 12.01.2022):

     git clone https://github.com/Esri/arcgis-python-api
    
  • PyPi ( 48K / month 18 04.10.2021):

     pip install arcgis
    
  • Docker Hub ( 5.5K 33 05.10.2021):

     docker pull esridocker/arcgis-api-python-notebook
    
geojson (29 680) - Python bindings and utilities for GeoJSON. BSD-3
  • GitHub ( 46 89 8.4K 79 - 26% open 03.01.2022):

     git clone https://github.com/jazzband/geojson
    
  • PyPi ( 520K / month 1.1K 09.08.2019):

     pip install geojson
    
  • Conda ( 470K 11.08.2019):

     conda install -c conda-forge geojson
    
GeoViews (26 390) - Simple, concise geographical visualization in Python. BSD-3
  • GitHub ( 25 69 300 - 35% open 25.12.2021):

     git clone https://github.com/holoviz/geoviews
    
  • PyPi ( 8.8K / month 25 25.12.2021):

     pip install geoviews
    
  • Conda ( 90K 13.01.2022):

     conda install -c conda-forge geoviews
    
EarthPy (26 320) - A package built to support working with spatial data using open source.. BSD-3
  • GitHub ( 40 120 120 220 - 7% open 20.12.2021):

     git clone https://github.com/earthlab/earthpy
    
  • PyPi ( 5.7K / month 7 01.10.2021):

     pip install earthpy
    
  • Conda ( 40K 04.10.2021):

     conda install -c conda-forge earthpy
    
PySAL (25 950) - PySAL: Python Spatial Analysis Library Meta-Package. BSD-3
  • GitHub ( 74 270 600 - 1% open 18.10.2021):

     git clone https://github.com/pysal/pysal
    
  • PyPi ( 23K / month 30 01.08.2021):

     pip install pysal
    
  • Conda ( 430K 02.08.2021):

     conda install -c conda-forge pysal
    
Mapbox GL (24 590 ) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT
  • GitHub ( 21 130 120 100 - 34% open 19.04.2021):

     git clone https://github.com/mapbox/mapboxgl-jupyter
    
  • PyPi ( 13K / month 14 02.06.2019):

     pip install mapboxgl
    
Show 6 hidden projects...
  • Geocoder (33 1.4K ) - Python Geocoder. MIT
  • Satpy (30 790) - Python package for earth-observing satellite data processing. GPL-3.0
  • Sentinelsat (29 700) - Search and download Copernicus Sentinel satellite images. GPL-3.0
  • gmaps (24 730 ) - Google maps for Jupyter notebooks. BSD-3
  • pymap3d (23 220) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
  • geoplotlib (21 950 ) - python toolbox for visualizing geographical data and making maps. MIT

Financial Data

Back to top

Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.

yfinance (36 6.3K) - Download market data from Yahoo! Finances API. Apache-2
  • GitHub ( 51 1.5K 8.9K 720 - 55% open 12.01.2022):

     git clone https://github.com/ranaroussi/yfinance
    
  • PyPi ( 280K / month 120 12.01.2022):

     pip install yfinance
    
  • Conda ( 24K 10.07.2021):

     conda install -c ranaroussi yfinance
    
Qlib (30 7.8K) - Qlib is an AI-oriented quantitative investment platform, which aims to.. MIT
  • GitHub ( 78 1.3K 270 9 420 - 32% open 13.01.2022):

     git clone https://github.com/microsoft/qlib
    
  • PyPi ( 3K / month 07.12.2021):

     pip install pyqlib
    
ta (30 2.7K) - Technical Analysis Library using Pandas and Numpy. MIT
  • GitHub ( 24 650 940 200 - 53% open 09.01.2022):

     git clone https://github.com/bukosabino/ta
    
  • PyPi ( 52K / month 27 09.01.2022):

     pip install ta
    
IB-insync (29 1.7K) - Python sync/async framework for Interactive Brokers API. BSD-2
  • GitHub ( 29 460 360 - 1% open 06.01.2022):

     git clone https://github.com/erdewit/ib_insync
    
  • PyPi ( 8.9K / month 19 28.11.2021):

     pip install ib_insync
    
  • Conda ( 14K 29.11.2021):

     conda install -c conda-forge ib-insync
    
Alpha Vantage (28 3.6K ) - A python wrapper for Alpha Vantage API for financial data. MIT
  • GitHub ( 39 630 260 - 5% open 14.06.2021):

     git clone https://github.com/RomelTorres/alpha_vantage
    
  • PyPi ( 31K / month 100 26.08.2018):

     pip install alpha_vantage
    
TensorTrade (26 3.7K) - An open source reinforcement learning framework for training,.. Apache-2
  • GitHub ( 58 830 29 200 - 12% open 04.01.2022):

     git clone https://github.com/tensortrade-org/tensortrade
    
  • PyPi ( 950 / month 1 10.05.2021):

     pip install tensortrade
    
Enigma Catalyst (26 2.3K) - An Algorithmic Trading Library for Crypto-Assets in Python. Apache-2
  • GitHub ( 150 680 23 480 - 25% open 22.09.2021):

     git clone https://github.com/enigmampc/catalyst
    
  • PyPi ( 1.3K / month 2 11.11.2018):

     pip install enigma-catalyst
    
stockstats (26 930) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
  • GitHub ( 8 250 380 79 - 3% open 07.01.2022):

     git clone https://github.com/jealous/stockstats
    
  • PyPi ( 11K / month 29 07.01.2022):

     pip install stockstats
    
bt (25 1.3K ) - bt - flexible backtesting for Python. MIT
  • GitHub ( 24 290 91 270 - 17% open 15.05.2021):

     git clone https://github.com/pmorissette/bt
    
  • PyPi ( 6.2K / month 21 21.04.2021):

     pip install bt
    
ffn (24 1.1K ) - ffn - a financial function library for Python. MIT
  • GitHub ( 26 200 160 96 - 16% open 24.04.2021):

     git clone https://github.com/pmorissette/ffn
    
  • PyPi ( 27K / month 25 21.04.2021):

     pip install ffn
    
Crypto Signals (21 3.7K ) - Github.com/CryptoSignal - #1 Quant Trading & Technical.. MIT
  • GitHub ( 28 990 250 - 21% open 28.06.2021):

     git clone https://github.com/CryptoSignal/crypto-signal
    
  • Docker Hub ( 140K 7 03.09.2020):

     docker pull shadowreaver/crypto-signal
    
tf-quant-finance (21 2.9K) - High-performance TensorFlow library for quantitative.. Apache-2
  • GitHub ( 36 400 38 - 50% open 12.01.2022):

     git clone https://github.com/google/tf-quant-finance
    
  • PyPi ( 590 / month 2 07.01.2022):

     pip install tf-quant-finance
    
finmarketpy (20 2.8K) - Python library for backtesting trading strategies & analyzing.. Apache-2
  • GitHub ( 14 450 40 4 26 - 88% open 07.10.2021):

     git clone https://github.com/cuemacro/finmarketpy
    
  • PyPi ( 110 / month 07.10.2021):

     pip install finmarketpy
    
Show 10 hidden projects...

Time Series Data

Back to top

Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.

sktime (33 4.8K) - A unified framework for machine learning with time series. BSD-3
  • GitHub ( 140 720 64 320 870 - 34% open 12.01.2022):

     git clone https://github.com/alan-turing-institute/sktime
    
  • PyPi ( 120K / month 19 08.12.2021):

     pip install sktime
    
Prophet (32 14K) - Tool for producing high quality forecasts for time series data that has.. MIT
  • GitHub ( 140 3.9K 640 1.7K - 9% open 07.01.2022):

     git clone https://github.com/facebook/prophet
    
  • PyPi ( 1.1M / month 120 05.09.2020):

     pip install fbprophet
    
tsfresh (31 6.1K) - Automatic extraction of relevant features from time series:. MIT
  • GitHub ( 82 960 480 - 8% open 21.12.2021):

     git clone https://github.com/blue-yonder/tsfresh
    
  • PyPi ( 260K / month 55 21.12.2021):

     pip install tsfresh
    
  • Conda ( 81K 21.12.2021):

     conda install -c conda-forge tsfresh
    
tslearn (31 2K) - A machine learning toolkit dedicated to time-series data. BSD-2
  • GitHub ( 36 250 380 250 - 28% open 06.12.2021):

     git clone https://github.com/tslearn-team/tslearn
    
  • PyPi ( 71K / month 19 16.08.2021):

     pip install tslearn
    
  • Conda ( 250K 16.08.2021):

     conda install -c conda-forge tslearn
    
pmdarima (31 1.1K) - A statistical library designed to fill the void in Pythons time series.. MIT
  • GitHub ( 19 190 1.7K 260 - 8% open 04.01.2022):

     git clone https://github.com/alkaline-ml/pmdarima
    
  • PyPi ( 930K / month 44 05.11.2021):

     pip install pmdarima
    
pytorch-forecasting (29 1.6K) - Time series forecasting with PyTorch. MIT
  • GitHub ( 29 240 370 - 37% open 12.01.2022):

     git clone https://github.com/jdb78/pytorch-forecasting
    
  • PyPi ( 18K / month 4 29.11.2021):

     pip install pytorch-forecasting
    
Darts (28 3.3K) - A python library for easy manipulation and forecasting of time series. Apache-2
  • GitHub ( 41 300 24 310 - 37% open 13.01.2022):

     git clone https://github.com/unit8co/darts
    
  • PyPi ( 2.7K / month 2 24.12.2021):

     pip install u8darts
    
  • Docker Hub ( 240 24.12.2021):

     docker pull unit8/darts
    
GluonTS (28 2.4K) - Probabilistic time series modeling in Python. Apache-2
  • GitHub ( 82 500 700 - 39% open 06.01.2022):

     git clone https://github.com/awslabs/gluon-ts
    
  • PyPi ( 60K / month 3 11.11.2021):

     pip install gluonts
    
STUMPY (28 2K) - STUMPY is a powerful and scalable Python library for modern time series.. BSD-3
  • GitHub ( 26 190 280 - 11% open 09.01.2022):

     git clone https://github.com/TDAmeritrade/stumpy
    
  • PyPi ( 260K / month 4 24.12.2021):

     pip install stumpy
    
  • Conda ( 35K 24.12.2021):

     conda install -c conda-forge stumpy
    
Streamz (27 1K) - Real-time stream processing for python. BSD-3
  • GitHub ( 45 130 260 250 - 41% open 24.12.2021):

     git clone https://github.com/python-streamz/streamz
    
  • PyPi ( 8.5K / month 29 04.10.2021):

     pip install streamz
    
  • Conda ( 230K 04.10.2021):

     conda install -c conda-forge streamz
    
pyts (25 1.1K) - A Python package for time series classification. BSD-3
  • GitHub ( 10 120 160 57 - 57% open 09.12.2021):

     git clone https://github.com/johannfaouzi/pyts
    
  • PyPi ( 84K / month 11 31.10.2021):

     pip install pyts
    
  • Conda ( 9.8K 31.10.2021):

     conda install -c conda-forge pyts
    
greykite (20 1.4K) - A flexible, intuitive and fast forecasting library. BSD-2
  • GitHub ( 7 63 8 57 - 12% open 15.12.2021):

     git clone https://github.com/linkedin/greykite
    
  • PyPi ( 6.9K / month 15.12.2021):

     pip install greykite
    
TSFEL (20 360) - An intuitive library to extract features from time series. BSD-3
  • GitHub ( 13 49 25 48 - 10% open 23.12.2021):

     git clone https://github.com/fraunhoferportugal/tsfel
    
  • PyPi ( 3.3K / month 14.02.2021):

     pip install tsfel
    
seglearn (19 480 ) - Python module for machine learning time series:. BSD-3
  • GitHub ( 13 52 11 28 - 17% open 12.03.2021):

     git clone https://github.com/dmbee/seglearn
    
  • PyPi ( 1K / month 1 13.03.2021):

     pip install seglearn
    
Auto TS (19 370) - Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost.. Apache-2
  • GitHub ( 6 67 61 - 14% open 27.12.2021):

     git clone https://github.com/AutoViML/Auto_TS
    
  • PyPi ( 2.2K / month 27.12.2021):

     pip install auto-ts
    
atspy (14 400) - AtsPy: Automated Time Series Models in Python (by @firmai). MIT
  • GitHub ( 5 80 5 20 - 90% open 18.12.2021):

     git clone https://github.com/firmai/atspy
    
  • PyPi ( 1.1K / month 24.04.2020):

     pip install atspy
    
Show 7 hidden projects...
  • PyFlux (24 1.9K ) - Open source time series library for Python. BSD-3
  • luminol (21 990 ) - Anomaly Detection and Correlation library. Apache-2
  • pydlm (20 400 ) - A python library for Bayesian time series modeling. BSD-3
  • tick (20 370 ) - Module for statistical learning, with a particular emphasis on time-.. BSD-3
  • matrixprofile-ts (19 670 ) - A Python library for detecting patterns and anomalies.. Apache-2
  • ADTK (18 770 ) - A Python toolkit for rule-based/unsupervised anomaly detection in time.. MPL-2.0
  • tsaug (14 220 ) - A Python package for time series augmentation. Apache-2

Medical Data

Back to top

Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.

MNE (37 1.8K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
  • GitHub ( 280 970 1.3K 3.9K - 9% open 12.01.2022):

     git clone https://github.com/mne-tools/mne-python
    
  • PyPi ( 24K / month 190 01.12.2021):

     pip install mne
    
  • Conda ( 180K 18.12.2021):

     conda install -c conda-forge mne
    
Nilearn (35 800) - Machine learning for NeuroImaging in Python. BSD-3
  • GitHub ( 180 440 14 1.3K 1.6K - 17% open 10.01.2022):

     git clone https://github.com/nilearn/nilearn
    
  • PyPi ( 16K / month 230 16.09.2021):

     pip install nilearn
    
  • Conda ( 130K 16.09.2021):

     conda install -c conda-forge nilearn
    
NIPYPE (34 610) - Workflows and interfaces for neuroimaging packages. Apache-2
  • GitHub ( 230 480 820 1.3K - 28% open 15.12.2021):

     git clone https://github.com/nipy/nipype
    
  • PyPi ( 29K / month 150 20.10.2021):

     pip install nipype
    
  • Conda ( 460K 20.10.2021):

     conda install -c conda-forge nipype
    
MONAI (33 2.7K) - AI Toolkit for Healthcare Imaging. Apache-2
  • GitHub ( 84 510 180 1.4K - 10% open 13.01.2022):

     git clone https://github.com/Project-MONAI/MONAI
    
  • PyPi ( 20K / month 11 10.01.2022):

     pip install monai
    
Hail (33 770) - Scalable genomic data analysis. MIT
  • GitHub ( 77 200 47 2K - 2% open 13.01.2022):

     git clone https://github.com/hail-is/hail
    
  • PyPi ( 15K / month 5 21.12.2021):

     pip install hail
    
Lifelines (31 1.8K) - Survival analysis in Python. MIT
  • GitHub ( 98 450 750 840 - 25% open 30.11.2021):

     git clone https://github.com/CamDavidsonPilon/lifelines
    
  • PyPi ( 310K / month 98 30.11.2021):

     pip install lifelines
    
  • Conda ( 180K 01.12.2021):

     conda install -c conda-forge lifelines
    
DIPY (31 480) - DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic.. BSD-3
  • GitHub ( 130 330 490 760 - 16% open 03.12.2021):

     git clone https://github.com/dipy/dipy
    
  • PyPi ( 7.3K / month 80 06.05.2021):

     pip install dipy
    
  • Conda ( 280K 06.05.2021):

     conda install -c conda-forge dipy
    
NiBabel (31 460) - Python package to access a cacophony of neuro-imaging file formats. MIT
  • GitHub ( 93 220 6.1K 440 - 29% open 30.09.2021):

     git clone https://github.com/nipy/nibabel
    
  • PyPi ( 140K / month 960 28.11.2020):

     pip install nibabel
    
  • Conda ( 400K 29.11.2020):

     conda install -c conda-forge nibabel
    
DeepVariant (27 2.4K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3
  • GitHub ( 21 580 3.8K 450 - 1% open 10.12.2021):

     git clone https://github.com/google/deepvariant
    
  • Conda ( 37K 16.12.2021):

     conda install -c bioconda deepvariant
    
NIPY (24 310 ) - Neuroimaging in Python FMRI analysis package. BSD-3
  • GitHub ( 63 130 160 - 28% open 29.03.2021):

     git clone https://github.com/nipy/nipy
    
  • PyPi ( 1.1K / month 47 19.02.2018):

     pip install nipy
    
  • Conda ( 90K 04.05.2020):

     conda install -c conda-forge nipy
    
MedicalTorch (16 760 ) - A medical imaging framework for Pytorch. Apache-2
  • GitHub ( 8 110 11 23 - 60% open 16.04.2021):

     git clone https://github.com/perone/medicaltorch
    
  • PyPi ( 130 / month 24.11.2018):

     pip install medicaltorch
    
Medical Detection Toolkit (14 1.1K) - The Medical Detection Toolkit contains 2D + 3D.. Apache-2
  • GitHub ( 3 280 120 - 32% open 09.09.2021):

     git clone https://github.com/MIC-DKFZ/medicaldetectiontoolkit
    
Show 7 hidden projects...
  • NiftyNet (23 1.3K ) - [unmaintained] An open-source convolutional neural.. Apache-2
  • DLTK (21 1.3K ) - Deep Learning Toolkit for Medical Image Analysis. Apache-2
  • MedPy (21 380 ) - Medical image processing in Python. GPL-3.0
  • Brainiak (21 260 ) - Brain Imaging Analysis Kit. Apache-2
  • Glow (21 180) - An open-source toolkit for large-scale genomic analysis. Apache-2
  • DeepNeuro (13 110 ) - A deep learning python package for neuroimaging data. Made by:. MIT
  • MedicalNet (12 1.3K ) - Many studies have shown that the performance on deep learning is.. MIT

Tabular Data

Back to top

Libraries for processing tabular and structured data.

pytorch_tabular (21 510) - A standard framework for modelling Deep Learning Models.. MIT
  • GitHub ( 8 47 48 - 33% open 03.01.2022):

     git clone https://github.com/manujosephv/pytorch_tabular
    
  • PyPi ( 1.4K / month 01.09.2021):

     pip install pytorch_tabular
    
carefree-learn (19 360) - Deep Learning PyTorch. MIT
  • GitHub ( 1 30 2 04.01.2022):

     git clone https://github.com/carefree0910/carefree-learn
    
  • PyPi ( 52 / month 29.10.2021):

     pip install carefree-learn
    
deltapy (11 400) - DeltaPy - Tabular Data Augmentation (by @firmai). MIT
  • GitHub ( 4 40 2 3 - 66% open 18.12.2021):

     git clone https://github.com/firmai/deltapy
    
  • PyPi ( 52 / month 09.04.2020):

     pip install deltapy
    

Optical Character Recognition

Back to top

Libraries for optical character recognition (OCR) and text extraction from images or videos.

PaddleOCR (36 19K) - Awesome multilingual OCR toolkits based on PaddlePaddle.. Apache-2
  • GitHub ( 85 3.8K 490 3.8K - 26% open 10.01.2022):

     git clone https://github.com/PaddlePaddle/PaddleOCR
    
  • PyPi ( 28K / month 4 10.01.2022):

     pip install paddleocr
    
EasyOCR (33 14K ) - Ready-to-use OCR with 80+ supported languages and all popular.. Apache-2
  • GitHub ( 90 1.8K 980K 810 480 - 31% open 15.10.2021):

     git clone https://github.com/JaidedAI/EasyOCR
    
  • PyPi ( 120K / month 21 11.09.2021):

     pip install easyocr
    
Tesseract (32 4K) - Python-tesseract is an optical character recognition (OCR) tool for.. Apache-2
  • GitHub ( 38 560 280 - 3% open 05.01.2022):

     git clone https://github.com/madmaze/pytesseract
    
  • PyPi ( 490K / month 920 28.06.2021):

     pip install pytesseract
    
  • Conda ( 480K 05.06.2021):

     conda install -c conda-forge pytesseract
    
OCRmyPDF (30 5.8K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0
  • GitHub ( 60 520 790 - 11% open 04.01.2022):

     git clone https://github.com/jbarlow83/OCRmyPDF
    
  • PyPi ( 20K / month 12 19.12.2021):

     pip install ocrmypdf
    
tesserocr (27 1.6K) - A Python wrapper for the tesseract-ocr API. MIT
  • GitHub ( 26 200 600 240 - 32% open 09.11.2021):

     git clone https://github.com/sirfz/tesserocr
    
  • PyPi ( 52K / month 61 19.06.2021):

     pip install tesserocr
    
  • Conda ( 63K 13.01.2021):

     conda install -c conda-forge tesserocr
    
keras-ocr (21 960) - A packaged and flexible version of the CRAFT text detector and.. MIT
  • GitHub ( 12 240 220K 160 - 32% open 24.11.2021):

     git clone https://github.com/faustomorales/keras-ocr
    
  • PyPi ( 4.3K / month 2 24.11.2021):

     pip install keras-ocr
    
calamari (21 890) - Line based ATR Engine based on OCRopy. Apache-2
  • GitHub ( 19 180 230 - 17% open 20.12.2021):

     git clone https://github.com/Calamari-OCR/calamari
    
  • PyPi ( 550 / month 2 13.11.2018):

     pip install calamari_ocr
    
attention-ocr (21 890) - A Tensorflow model for text recognition (CNN + seq2seq with.. MIT
  • GitHub ( 27 240 18 150 - 14% open 29.10.2021):

     git clone https://github.com/emedvedev/attention-ocr
    
  • PyPi ( 190 / month 19.04.2019):

     pip install aocr
    
Mozart (10 350 ) - An optical music recognition (OMR) system. Converts sheet.. Apache-2
  • GitHub ( 5 49 9 - 33% open 05.05.2021):

     git clone https://github.com/aashrafh/Mozart
    
Show 2 hidden projects...
  • pdftabextract (19 2K ) - A set of tools for extracting tables from PDF files.. Apache-2
  • doc2text (19 1.3K ) - Detect text blocks and OCR poorly scanned PDFs in bulk. Python.. MIT

Data Containers & Structures

Back to top

General-purpose data containers & structures as well as utilities & extensions for pandas.

pandas (52 32K) - Flexible and powerful data analysis / manipulation library for.. BSD-3
  • GitHub ( 2.9K 14K 130K 610K 22K - 15% open 13.01.2022):

     git clone https://github.com/pandas-dev/pandas
    
  • PyPi ( 68M / month 57K 06.01.2022):

     pip install pandas
    
  • Conda ( 22M 07.01.2022):

     conda install -c conda-forge pandas
    
numpy (50 19K) - The fundamental package for scientific computing with Python. BSD-3
  • GitHub ( 1.4K 6.3K 440K 940K 10K - 22% open 13.01.2022):

     git clone https://github.com/numpy/numpy
    
  • PyPi ( 83M / month 120K 31.12.2021):

     pip install numpy
    
  • Conda ( 28M 03.01.2022):

     conda install -c conda-forge numpy
    
Arrow (43 8.9K) - Apache Arrow is a multi-language toolbox for accelerated data.. Apache-2
  • GitHub ( 790 2.2K 61 890 - 22% open 13.01.2022):

     git clone https://github.com/apache/arrow
    
  • PyPi ( 33M / month 1.3K 18.11.2021):

     pip install pyarrow
    
  • Conda ( 860K 26.10.2021):

     conda install -c conda-forge arrow
    
h5py (40 1.7K) - HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5.. BSD-3
  • GitHub ( 170 430 1.7K 150K 1.3K - 17% open 08.01.2022):

     git clone https://github.com/h5py/h5py
    
  • PyPi ( 11M / month 14K 16.11.2021):

     pip install h5py
    
  • Conda ( 6.9M 26.11.2021):

     conda install -c conda-forge h5py
    
xarray (38 2.4K) - N-D labeled arrays and datasets in Python. Apache-2
  • GitHub ( 360 750 8.7K 3.2K - 29% open 13.01.2022):

     git clone https://github.com/pydata/xarray
    
  • PyPi ( 870K / month 1.3K 10.12.2021):

     pip install xarray
    
  • Conda ( 4.4M 10.12.2021):

     conda install -c conda-forge xarray
    
Modin (34 6.7K) - Modin: Speed up your Pandas workflows by changing a single line of.. Apache-2
  • GitHub ( 87 470 200K 540 2.3K - 29% open 12.01.2022):

     git clone https://github.com/modin-project/modin
    
  • PyPi ( 170K / month 20 19.12.2021):

     pip install modin
    
PyTables (34 1.1K) - A Python package to manage extremely large amounts of data. BSD-3
  • GitHub ( 100 220 160 680 - 26% open 07.01.2022):

     git clone https://github.com/PyTables/PyTables
    
  • PyPi ( 630K / month 2.3K 28.12.2021):

     pip install tables
    
  • Conda ( 3.7M 29.11.2021):

     conda install -c conda-forge pytables
    
numexpr (33 1.7K) - Fast numerical array expression evaluator for Python, NumPy, PyTables,.. MIT
  • GitHub ( 58 170 320 - 18% open 07.01.2022):

     git clone https://github.com/pydata/numexpr
    
  • PyPi ( 1.4M / month 3K 15.12.2021):

     pip install numexpr
    
  • Conda ( 3.5M 09.12.2021):

     conda install -c conda-forge numexpr
    
zarr (32 840) - An implementation of chunked, compressed, N-dimensional arrays for Python. MIT
  • GitHub ( 55 130 990 480 - 41% open 13.01.2022):

     git clone https://github.com/zarr-developers/zarr-python
    
  • PyPi ( 52K / month 180 19.11.2021):

     pip install zarr
    
  • Conda ( 1.2M 19.11.2021):

     conda install -c conda-forge zarr
    
TinyDB (31 4.8K) - TinyDB is a lightweight document oriented database optimized for your.. MIT
  • GitHub ( 72 420 280 - 3% open 10.01.2022):

     git clone https://github.com/msiemens/tinydb
    
  • PyPi ( 290K / month 780 23.09.2021):

     pip install tinydb
    
  • Conda ( 150K 23.09.2021):

     conda install -c conda-forge tinydb
    
Koalas (31 3.1K) - Koalas: pandas API on Apache Spark. Apache-2
  • GitHub ( 51 320 1K 170 580 - 16% open 21.10.2021):

     git clone https://github.com/databricks/koalas
    
  • PyPi ( 2M / month 7 19.10.2021):

     pip install koalas
    
  • Conda ( 110K 20.10.2021):

     conda install -c conda-forge koalas
    
Bottleneck (31 690 ) - Fast NumPy array functions written in C. BSD-2
  • GitHub ( 21 75 30K 220 - 17% open 24.01.2021):

     git clone https://github.com/pydata/bottleneck
    
  • PyPi ( 410K / month 1.6K 21.02.2020):

     pip install Bottleneck
    
  • Conda ( 1.8M 04.11.2021):

     conda install -c conda-forge bottleneck
    
Vaex (30 6.8K) - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization.. MIT
  • GitHub ( 62 530 220 970 - 36% open 13.01.2022):

     git clone https://github.com/vaexio/vaex
    
  • PyPi ( 16K / month 16 18.12.2021):

     pip install vaex
    
  • Conda ( 120K 30.12.2021):

     conda install -c conda-forge vaex
    
polars (30 4.2K) - Fast multi-threaded DataFrame library in Rust | Python | Node.js. MIT
  • GitHub ( 65 230 870 - 9% open 12.01.2022):

     git clone https://github.com/pola-rs/polars
    
  • PyPi ( 34K / month 8 10.01.2022):

     pip install polars
    
Arctic (29 2.6K) - Arctic is a high performance datastore for numeric data. LGPL-2.1
  • GitHub ( 72 520 180 150 540 - 17% open 10.01.2022):

     git clone https://github.com/man-group/arctic
    
  • PyPi ( 5.4K / month 34 10.01.2022):

     pip install arctic
    
  • Conda ( 17K 16.12.2019):

     conda install -c conda-forge arctic
    
datatable (29 1.4K) - A Python package for manipulating 2-dimensional tabular data.. MPL-2.0
  • GitHub ( 32 120 1.2K 1.4K - 9% open 13.01.2022):

     git clone https://github.com/h2oai/datatable
    
  • PyPi ( 68K / month 12 01.07.2021):

     pip install datatable
    
datasketch (28 1.6K) - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog,.. MIT
  • GitHub ( 20 230 18 340 130 - 23% open 27.12.2021):

     git clone https://github.com/ekzhu/datasketch
    
  • PyPi ( 380K / month 54 27.12.2021):

     pip install datasketch
    
swifter (26 1.9K ) - A package which efficiently applies any function to a pandas.. MIT
  • GitHub ( 14 86 450 110 - 21% open 25.06.2021):

     git clone https://github.com/jmcarpenter2/swifter
    
  • PyPi ( 91K / month 27 25.06.2021):

     pip install swifter
    
  • Conda ( 130K 26.06.2021):

     conda install -c conda-forge swifter
    
Pandarallel (24 1.9K) - A simple and efficient tool to parallelize Pandas.. BSD-3
  • GitHub ( 18 120 370 140 - 54% open 17.10.2021):

     git clone https://github.com/nalepae/pandarallel
    
  • PyPi ( 180K / month 13 17.10.2021):

     pip install pandarallel
    
Pandas Summary (21 380) - A library for managing, validating, summarizing, and.. Apache-2
  • GitHub ( 8 37 3 14 - 50% open 06.01.2022):

     git clone https://github.com/mouradmourafiq/pandas-summary
    
  • PyPi ( 38K / month 57 25.11.2021):

     pip install pandas-summary
    
Bounter (18 930 ) - Efficient Counter that uses a limited (bounded) amount of memory.. MIT
  • GitHub ( 8 48 25 24 - 62% open 24.05.2021):

     git clone https://github.com/RaRe-Technologies/bounter
    
  • PyPi ( 190 / month 8 17.08.2020):

     pip install bounter
    
PandaPy (10 490) - PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x.. MIT
  • GitHub ( 3 56 1 3 - 66% open 20.10.2021):

     git clone https://github.com/firmai/pandapy
    
  • PyPi ( 70 / month 25.01.2020):

     pip install pandapy
    
Show 7 hidden projects...
  • Blaze (31 3K ) - NumPy and Pandas interface to Big Data. BSD-3
  • sklearn-pandas (27 2.5K ) - Pandas integration with sklearn. Zlib
  • bcolz (26 940 ) - A columnar data container that can be compressed. BSD-3
  • StaticFrame (26 260) - Immutable and grow-only Pandas-like DataFrames with a more explicit.. MIT
  • pandasql (25 1.1K ) - sqldf for pandas. MIT
  • pickleDB (23 630 ) - pickleDB is an open source key-value store using Pythons json module. BSD-3
  • fletcher (19 220 ) - Pandas ExtensionDType/Array backed by Apache Arrow. MIT

Data Loading & Extraction

Back to top

Libraries for loading, collecting, and extracting data from a variety of data sources and formats.

 best-of-python - Data Extraction ( 1.9K) - Collection of data-loading and -extraction libraries.


Web Scraping & Crawling

Back to top

Libraries for web scraping, crawling, downloading, and mining as well as libraries.

 best-of-web-python - Web Scraping ( 1.4K) - Collection of web-scraping and crawling libraries.


Data Pipelines & Streaming

Back to top

Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.

Celery (46 19K) - Asynchronous task queue/job queue based on distributed message passing. BSD-3
  • GitHub ( 1.2K 4.2K 63K 4.7K - 10% open 13.01.2022):

     git clone https://github.com/celery/celery
    
  • PyPi ( 4.2M / month 15K 29.12.2021):

     pip install celery
    
  • Conda ( 770K 09.01.2022):

     conda install -c conda-forge celery
    
Airflow (44 25K ) - Platform to programmatically author, schedule, and monitor.. Apache-2
  • GitHub ( 2.2K 9.8K 230K 4.9K - 18% open 13.01.2022):

     git clone https://github.com/apache/airflow
    
  • PyPi ( 3.5M / month 440 21.12.2021):

     pip install apache-airflow
    
  • Conda ( 520K 22.12.2021):

     conda install -c conda-forge airflow
    
  • Docker Hub ( 59M 310 21.12.2021):

     docker pull apache/airflow
    
Beam (39 5.2K) - Unified programming model to define and execute data processing.. Apache-2
  • GitHub ( 1.2K 3.3K 13.01.2022):

     git clone https://github.com/apache/beam
    
  • PyPi ( 22M / month 140 30.12.2021):

     pip install apache-beam
    
luigi (38 15K) - Luigi is a Python module that helps you build complex pipelines of batch.. Apache-2
  • GitHub ( 580 2.3K 1.6K 970 - 10% open 11.01.2022):

     git clone https://github.com/spotify/luigi
    
  • PyPi ( 570K / month 400 23.09.2020):

     pip install luigi
    
  • Conda ( 8.8K 2 17.04.2021):

     conda install -c anaconda luigi
    
Dagster (37 4.2K) - An orchestration platform for the development, production, and.. Apache-2
  • GitHub ( 180 510 290 3.6K - 24% open 13.01.2022):

     git clone https://github.com/dagster-io/dagster
    
  • PyPi ( 150K / month 83 06.01.2022):

     pip install dagster
    
  • Conda ( 430K 07.01.2022):

     conda install -c conda-forge dagster
    
joblib (37 2.6K ) - Computing with Python functions. BSD-3
  • GitHub ( 110 320 150K 710 - 46% open 08.11.2021):

     git clone https://github.com/joblib/joblib
    
  • PyPi ( 24M / month 4.9K 07.10.2021):

     pip install joblib
    
  • Conda ( 6.7M 07.10.2021):

     conda install -c conda-forge joblib
    
rq (36 8.1K) - Simple job queues for Python. BSD-3
  • GitHub ( 250 1.3K 9.4K 940 - 17% open 03.01.2022):

     git clone https://github.com/rq/rq
    
  • PyPi ( 410K / month 1.7K 07.12.2021):

     pip install rq
    
  • Conda ( 67K 30.06.2021):

     conda install -c conda-forge rq
    
Prefect (36 8.1K) - The easiest way to automate your data. Apache-2
  • GitHub ( 290 780 550 2K - 19% open 12.01.2022):

     git clone https://github.com/PrefectHQ/prefect
    
  • PyPi ( 140K / month 40 12.01.2022):

     pip install prefect
    
  • Conda ( 230K 23.12.2021):

     conda install -c conda-forge prefect
    
dbt (36 4K ) - dbt enables data analysts and engineers to transform their data using.. Apache-2
  • GitHub ( 190 720 120 250 2.4K - 10% open 12.01.2022):

     git clone https://github.com/fishtown-analytics/dbt
    
  • PyPi ( 570K / month 29 06.12.2021):

     pip install dbt
    
  • Conda ( 190K 09.12.2021):

     conda install -c conda-forge dbt
    
Great Expectations (35 5.9K ) - Always know what to expect from your data. Apache-2
  • GitHub ( 260 800 1.2K - 15% open 13.01.2022):

     git clone https://github.com/great-expectations/great_expectations
    
  • PyPi ( 2.3M / month 27 06.01.2022):

     pip install great_expectations
    
Kedro (33 4.8K) - A Python framework for creating reproducible, maintainable and modular.. Apache-2
  • GitHub ( 140 540 680 610 - 8% open 13.01.2022):

     git clone https://github.com/quantumblacklabs/kedro
    
  • PyPi ( 220K / month 35 09.12.2021):

     pip install kedro
    
Activeloop (31 4.2K) - Dataset format for AI. Build, manage, & visualize datasets for.. MPL-2.0
  • GitHub ( 89 340 140 330 - 20% open 12.01.2022):

     git clone https://github.com/activeloopai/Hub
    
  • PyPi ( 2.1K / month 52 22.12.2021):

     pip install hub
    
huey (31 3.8K) - a little task queue for python. MIT
  • GitHub ( 66 330 850 10.01.2022):

     git clone https://github.com/coleifer/huey
    
  • PyPi ( 45K / month 160 28.12.2021):

     pip install huey
    
  • Conda ( 23K 16.10.2019):

     conda install -c conda-forge huey
    
mleap (31 1.3K) - MLeap: Deploy ML Pipelines to Production. Apache-2
  • GitHub ( 71 290 170 440 - 20% open 12.01.2022):

     git clone https://github.com/combust/mleap
    
  • PyPi ( 200K / month 25 12.01.2022):

     pip install mleap
    
  • Conda ( 43K 12.01.2022):

     conda install -c conda-forge mleap
    
TFX (30 1.7K) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
  • GitHub ( 130 520 730 - 34% open 13.01.2022):

     git clone https://github.com/tensorflow/tfx
    
  • PyPi ( 340K / month 5 14.12.2021):

     pip install tfx
    
streamparse (28 1.4K) - Run Python in Apache Storm topologies. Pythonic API, CLI.. Apache-2
  • GitHub ( 42 220 53 330 - 21% open 10.01.2022):

     git clone https://github.com/Parsely/streamparse
    
  • PyPi ( 2.7K / month 27 10.01.2022):

     pip install streamparse
    
petl (28 960) - Python Extract Transform and Load Tables of Data. MIT
  • GitHub ( 52 170 610 420 - 16% open 04.01.2022):

     git clone https://github.com/petl-developers/petl
    
  • PyPi ( 39K / month 73 27.03.2021):

     pip install petl
    
  • Conda ( 63K 05.04.2021):

     conda install -c conda-forge petl
    
Optimus (27 1.2K) - Agile Data Preparation Workflows madeeasy with Pandas, Dask,.. Apache-2
  • GitHub ( 25 200 220 - 13% open 12.01.2022):

     git clone https://github.com/hi-primus/optimus
    
  • PyPi ( 12K / month 30.05.2019):

     pip install optimuspyspark
    
PyFunctional (26 2K) - Python library for creating data pipelines with chain functional.. MIT
  • GitHub ( 25 110 380 120 - 4% open 05.11.2021):

     git clone https://github.com/EntilZha/PyFunctional
    
  • PyPi ( 71K / month 12 12.01.2021):

     pip install pyfunctional
    
zenml (26 1.6K) - ZenML : MLOps framework to create reproducible pipelines. Apache-2
  • GitHub ( 23 93 56 - 16% open 13.01.2022):

     git clone https://github.com/maiot-io/zenml
    
  • PyPi ( 570 / month 13.01.2022):

     pip install zenml
    
ploomber (26 870) - Write maintainable, production-ready pipelines using Jupyter or your.. Apache-2
  • GitHub ( 25 66 26 430 - 21% open 12.01.2022):

     git clone https://github.com/ploomber/ploomber
    
  • PyPi ( 4.3K / month 4 08.01.2022):

     pip install ploomber
    
bonobo (24 1.5K ) - Extract Transform Load for Python 3.5+. Apache-2
  • GitHub ( 37 120 130 200 - 46% open 10.03.2021):

     git clone https://github.com/python-bonobo/bonobo
    
  • PyPi ( 3.2K / month 33 20.07.2019):

     pip install bonobo
    
arq (24 1.1K) - Fast job queuing and RPC in python with asyncio and redis. MIT
  • GitHub ( 32 84 160 120 - 27% open 15.10.2021):

     git clone https://github.com/samuelcolvin/arq
    
  • PyPi ( 15K / month 10 02.09.2021):

     pip install arq
    
whylogs (23 720) - Open standard for end-to-end data and ML monitoring for any scale in.. Apache-2
  • GitHub ( 26 39 47 110 - 44% open 13.01.2022):

     git clone https://github.com/whylabs/whylogs
    
  • PyPi ( 3.3K / month 08.01.2022):

     pip install whylogs
    
pdpipe (23 640) - Easy pipelines for pandas DataFrames. MIT
  • GitHub ( 9 30 38 40 - 35% open 26.12.2021):

     git clone https://github.com/pdpipe/pdpipe
    
  • PyPi ( 2.2K / month 5 26.12.2021):

     pip install pdpipe
    
Pypeline (22 1.3K) - Concurrent data pipelines in Python . MIT
  • GitHub ( 12 77 57 - 26% open 06.01.2022):

     git clone https://github.com/cgarciae/pypeln
    
  • PyPi ( 4.7K / month 9 06.01.2022):

     pip install pypeln
    
TaskTiger (22 1.1K) - Python task queue using Redis. MIT
  • GitHub ( 25 62 22 68 - 48% open 02.12.2021):

     git clone https://github.com/closeio/tasktiger
    
  • PyPi ( 1.6K / month 10 02.12.2021):

     pip install tasktiger
    
riko (20 1.6K) - A Python stream processing engine modeled after Yahoo! Pipes. MIT
  • GitHub ( 18 75 30 - 73% open 28.12.2021):

     git clone https://github.com/nerevu/riko
    
  • PyPi ( 220 / month 1 28.12.2021):

     pip install riko
    
spark-deep-learning (19 1.9K) - Deep Learning Pipelines for Apache Spark. Apache-2
  • GitHub ( 16 460 19 100 - 73% open 19.08.2021):

     git clone https://github.com/databricks/spark-deep-learning
    
Databolt Flow (19 930) - Python library for building highly effective data science workflows. MIT
  • GitHub ( 12 69 17 23 - 43% open 28.09.2021):

     git clone https://github.com/d6t/d6tflow
    
  • PyPi ( 200 / month 06.10.2021):

     pip install d6tflow
    
Mara Pipelines (16 1.8K) - A lightweight opinionated ETL framework, halfway between plain.. MIT
  • GitHub ( 16 86 8 24 - 45% open 18.09.2021):

     git clone https://github.com/mara/mara-pipelines
    
  • PyPi ( 79 / month 23.01.2021):

     pip install mara-pipelines
    
kale (16 470) - Kubeflows superfood for Data Scientists. Apache-2
  • GitHub ( 10 89 160 - 53% open 20.10.2021):

     git clone https://github.com/kubeflow-kale/kale
    
  • PyPi ( 980 / month 19.05.2021):

     pip install kubeflow-kale
    
Show 10 hidden projects...
  • mrjob (32 2.6K ) - Run MapReduce jobs on Hadoop or Amazon Web Services. Apache-2
  • faust (31 5.9K ) - Python Stream Processing. BSD-3
  • dbnd (25 210) - DBND is an agile pipeline framework that helps data engineering teams.. Apache-2
  • dpark (22 2.7K ) - Python clone of Spark, a MapReduce alike framework in Python. BSD-3
  • pysparkling (22 240 ) - A pure Python implementation of Apache Sparks RDD and DStream.. MIT
  • mrq (21 860 ) - Mr. Queue - A distributed worker task queue in Python using Redis & gevent. MIT
  • BatchFlow (20 170) - BatchFlow helps you conveniently work with random or sequential.. Apache-2
  • bodywork-core (18 320) - ML pipeline orchestration and model deployments on.. AGPL-3.0
  • flupy (17 170 ) - Fluent data pipelines for python and your shell. MIT
  • Botflow (15 1.2K ) - Python Fast Dataflow programming framework for Data pipeline work(.. BSD-3

Distributed Machine Learning

Back to top

Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.

Ray (43 19K) - An open source framework that provides a simple, universal API for.. Apache-2
  • GitHub ( 610 3.2K 3.7K 9.2K - 24% open 13.01.2022):

     git clone https://github.com/ray-project/ray
    
  • PyPi ( 620K / month 230 11.01.2022):

     pip install ray
    
dask (43 9.4K) - Parallel computing with task scheduling. BSD-3
  • GitHub ( 500 1.4K 33K 4.1K - 18% open 13.01.2022):

     git clone https://github.com/dask/dask
    
  • PyPi ( 4.5M / month 2.5K 10.12.2021):

     pip install dask
    
  • Conda ( 4.8M 11.12.2021):

     conda install -c conda-forge dask
    
dask.distributed (40 1.3K) - A distributed task scheduler for Dask. BSD-3
  • GitHub ( 260 580 22K 2.5K - 34% open 13.01.2022):

     git clone https://github.com/dask/distributed
    
  • PyPi ( 4.4M / month 1.1K 10.12.2021):

     pip install distributed
    
  • Conda ( 6.1M 11.12.2021):

     conda install -c conda-forge distributed
    
horovod (36 12K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2
  • GitHub ( 140 1.9K 480 1.9K - 13% open 10.01.2022):

     git clone https://github.com/horovod/horovod
    
  • PyPi ( 38K / month 29 06.10.2021):

     pip install horovod
    
ipyparallel (33 2.1K) - IPython Parallel: Interactive Parallel Computing in Python. BSD-3
  • GitHub ( 100 850 1.8K 320 - 15% open 10.01.2022):

     git clone https://github.com/ipython/ipyparallel
    
  • PyPi ( 44K / month 280 22.12.2021):

     pip install ipyparallel
    
  • Conda ( 550K 23.12.2021):

     conda install -c conda-forge ipyparallel
    
BigDL (31 3.8K) - Building Large-Scale AI Applications for Distributed Big Data. Apache-2
  • GitHub ( 130 960 32 1.1K - 30% open 13.01.2022):

     git clone https://github.com/intel-analytics/BigDL
    
  • PyPi ( 5.7K / month 1 13.01.2022):

     pip install bigdl
    
  • Maven ( 4 20.04.2021):

     <dependency>
     	<groupId>com.intel.analytics.bigdl</groupId>
     	<artifactId>bigdl-SPARK_2.4</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
DeepSpeed (30 6.1K) - DeepSpeed is a deep learning optimization library that makes.. MIT
  • GitHub ( 87 660 140 760 - 49% open 12.01.2022):

     git clone https://github.com/microsoft/DeepSpeed
    
  • PyPi ( 43K / month 9 04.01.2022):

     pip install deepspeed
    
  • Docker Hub ( 13K 3 05.05.2021):

     docker pull deepspeed/deepspeed
    
FairScale (29 1.6K) - PyTorch extensions for high performance and large scale training. BSD-3
  • GitHub ( 53 140 140 270 - 23% open 13.01.2022):

     git clone https://github.com/facebookresearch/fairscale
    
  • PyPi ( 38K / month 13 21.12.2021):

     pip install fairscale
    
dask-ml (29 780) - Scalable Machine Learning with Dask. BSD-3
  • GitHub ( 70 220 540 460 - 48% open 05.01.2022):

     git clone https://github.com/dask/dask-ml
    
  • PyPi ( 47K / month 55 30.11.2021):

     pip install dask-ml
    
  • Conda ( 260K 30.11.2021):

     conda install -c conda-forge dask-ml
    
metrics (29 610) - Machine learning metrics for distributed, scalable PyTorch.. Apache-2
  • GitHub ( 110 140 360 1.6K 260 - 17% open 13.01.2022):

     git clone https://github.com/PyTorchLightning/metrics
    
  • PyPi ( 2.8K / month 14 28.04.2018):

     pip install metrics
    
petastorm (28 1.3K) - Petastorm library enables single machine or distributed training.. Apache-2
  • GitHub ( 43 220 310 54 270 - 50% open 10.01.2022):

     git clone https://github.com/uber/petastorm
    
  • PyPi ( 140K / month 4 04.09.2021):

     pip install petastorm
    
mpi4py (28 500) - Python bindings for MPI. BSD-2
  • GitHub ( 20 74 2.9K 62 - 22% open 10.01.2022):

     git clone https://github.com/mpi4py/mpi4py
    
  • PyPi ( 200K / month 580 15.12.2021):

     pip install mpi4py
    
  • Conda ( 880K 25.11.2021):

     conda install -c conda-forge mpi4py
    
MMLSpark (27 3.1K) - Simple and Distributed Machine Learning. MIT
  • GitHub ( 81 640 490 - 40% open 12.01.2022):

     git clone https://github.com/Azure/mmlspark
    
  • PyPi ( 41K / month 18.03.2020):

     pip install mmlspark
    
analytics-zoo (27 2.5K) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2
  • GitHub ( 100 710 3 1.4K - 40% open 06.01.2022):

     git clone https://github.com/intel-analytics/analytics-zoo
    
  • PyPi ( 4.2K / month 12.01.2022):

     pip install analytics-zoo
    
Elephas (27 1.5K) - Distributed Deep learning with Keras & Spark. MIT keras
  • GitHub ( 27 290 52 160 - 15% open 17.08.2021):

     git clone https://github.com/maxpumperla/elephas
    
  • PyPi ( 17K / month 3 17.08.2021):

     pip install elephas
    
TensorFlowOnSpark (26 3.8K) - TensorFlowOnSpark brings TensorFlow programs to.. Apache-2
  • GitHub ( 34 960 360 - 1% open 10.01.2022):

     git clone https://github.com/yahoo/TensorFlowOnSpark
    
  • PyPi ( 380K / month 5 25.05.2021):

     pip install tensorflowonspark
    
Mesh (26 1.2K) - Mesh TensorFlow: Model Parallelism Made Easier. Apache-2
  • GitHub ( 44 200 620 100 - 86% open 18.10.2021):

     git clone https://github.com/tensorflow/mesh
    
  • PyPi ( 160K / month 32 24.03.2021):

     pip install mesh-tensorflow
    
Hivemind (21 900) - Decentralized deep learning in PyTorch. Built to train models on.. MIT
  • GitHub ( 19 58 4 110 - 36% open 09.01.2022):

     git clone https://github.com/learning-at-home/hivemind
    
  • PyPi ( 320 / month 1 20.12.2021):

     pip install hivemind
    
Submit it (21 520) - Python 3.6+ toolbox for submitting jobs to Slurm. MIT
  • GitHub ( 17 49 57 - 43% open 09.12.2021):

     git clone https://github.com/facebookincubator/submitit
    
  • PyPi ( 11K / month 6 30.11.2021):

     pip install submitit
    
  • Conda ( 5.1K 10.02.2021):

     conda install -c conda-forge submitit
    
Apache Singa (20 2.4K) - a distributed deep learning platform. Apache-2
  • GitHub ( 76 720 1 89 - 44% open 10.08.2021):

     git clone https://github.com/apache/singa
    
  • Conda ( 410 09.08.2021):

     conda install -c nusdbsystem singa
    
  • Docker Hub ( 210 4 04.06.2019):

     docker pull apache/singa
    
BytePS (19 3K) - A high performance and generic framework for distributed DNN training. Apache-2
  • GitHub ( 19 430 260 - 39% open 12.01.2022):

     git clone https://github.com/bytedance/byteps
    
  • PyPi ( 63 / month 02.08.2021):

     pip install byteps
    
  • Docker Hub ( 1.2K 03.03.2020):

     docker pull bytepsimage/tensorflow
    
Fiber (18 950 ) - Distributed Computing for AI Made Simple. Apache-2
  • GitHub ( 5 100 31 28 - 71% open 15.03.2021):

     git clone https://github.com/uber/fiber
    
  • PyPi ( 1.5K / month 1 09.07.2020):

     pip install fiber
    
Show 7 hidden projects...
  • DEAP (32 4.6K) - Distributed Evolutionary Algorithms in Python. LGPL-3.0
  • TensorFrames (20 760 ) - [DEPRECATED] Tensorflow wrapper for DataFrames on.. Apache-2
  • launchpad (20 250) - Launchpad is a library that simplifies writing distributed.. Apache-2
  • somoclu (20 230) - Massively parallel self-organizing maps: accelerate training on multicore.. MIT
  • sk-dist (19 270) - Distributed scikit-learn meta-estimators in PySpark. Apache-2
  • LazyCluster (12 43) - Distributed machine learning made simple. Apache-2
  • autodist (11 120 ) - Simple Distributed Deep Learning on TensorFlow. Apache-2

Hyperparameter Optimization & AutoML

Back to top

Libraries for hyperparameter optimization, automl and neural architecture search.

Optuna (38 5.8K) - A hyperparameter optimization framework. MIT
  • GitHub ( 170 640 2.5K 1.1K - 13% open 12.01.2022):

     git clone https://github.com/optuna/optuna
    
  • PyPi ( 640K / month 160 06.12.2021):

     pip install optuna
    
  • Conda ( 50K 04.10.2021):

     conda install -c conda-forge optuna
    
NNI (35 11K) - An open source AutoML toolkit for automate machine learning lifecycle,.. MIT
  • GitHub ( 150 1.5K 170 1.5K - 16% open 13.01.2022):

     git clone https://github.com/microsoft/nni
    
  • PyPi ( 5.1K / month 28 13.01.2022):

     pip install nni
    
featuretools (35 5.9K) - An open source python library for automated feature engineering. BSD-3
  • GitHub ( 57 780 910 710 - 22% open 11.01.2022):

     git clone https://github.com/alteryx/featuretools
    
  • PyPi ( 1.1M / month 62 11.01.2022):

     pip install featuretools
    
  • Conda ( 76K 11.01.2022):

     conda install -c conda-forge featuretools
    
Hyperopt (34 6K) - Distributed Asynchronous Hyperparameter Optimization in Python. BSD-3
  • GitHub ( 93 920 5.5K 590 - 60% open 29.11.2021):

     git clone https://github.com/hyperopt/hyperopt
    
  • PyPi ( 1.7M / month 400 17.11.2021):

     pip install hyperopt
    
  • Conda ( 320K 14.10.2020):

     conda install -c conda-forge hyperopt
    
auto-sklearn (33 6K ) - Automated Machine Learning with scikit-learn. BSD-3
  • GitHub ( 77 1.1K 3 250 830 - 13% open 24.12.2021):

     git clone https://github.com/automl/auto-sklearn
    
  • PyPi ( 23K / month 30 11.01.2022):

     pip install auto-sklearn
    
Keras Tuner (33 2.5K) - Hyperparameter tuning for humans. Apache-2
  • GitHub ( 41 320 1.1K 360 - 45% open 12.01.2022):

     git clone https://github.com/keras-team/keras-tuner
    
  • PyPi ( 520K / month 34 05.11.2021):

     pip install keras-tuner
    
Ax (33 1.7K) - Adaptive Experimentation Platform. MIT
  • GitHub ( 110 180 230 350 - 11% open 12.01.2022):

     git clone https://github.com/facebook/Ax
    
  • PyPi ( 90K / month 14 16.12.2021):

     pip install ax-platform
    
AutoKeras (32 8.3K) - AutoML library for deep learning. Apache-2
  • GitHub ( 130 1.3K 2.2K 270 790 - 8% open 22.12.2021):

     git clone https://github.com/keras-team/autokeras
    
  • PyPi ( 40K / month 9 02.11.2021):

     pip install autokeras
    
scikit-optimize (32 2.3K) - Sequential model-based optimization with a `scipy.optimize`.. BSD-3
  • GitHub ( 75 420 2.3K 620 - 37% open 12.10.2021):

     git clone https://github.com/scikit-optimize/scikit-optimize
    
  • PyPi ( 380K / month 170 12.10.2021):

     pip install scikit-optimize
    
  • Conda ( 520K 15.12.2021):

     conda install -c conda-forge scikit-optimize
    
BoTorch (32 2.2K) - Bayesian optimization in PyTorch. MIT
  • GitHub ( 67 230 200 240 - 23% open 12.01.2022):

     git clone https://github.com/pytorch/botorch
    
  • PyPi ( 110K / month 12 09.12.2021):

     pip install botorch
    
AutoGluon (30 4K) - AutoGluon: AutoML for Text, Image, and Tabular Data. Apache-2
  • GitHub ( 64 540 89 600 - 22% open 13.01.2022):

     git clone https://github.com/awslabs/autogluon
    
  • PyPi ( 20K / month 4 13.01.2022):

     pip install autogluon
    
nevergrad (30 3.2K) - A Python toolbox for performing gradient-free optimization. MIT
  • GitHub ( 46 300 270 250 - 38% open 13.01.2022):

     git clone https://github.com/facebookresearch/nevergrad
    
  • PyPi ( 16K / month 17 10.11.2021):

     pip install nevergrad
    
  • Conda ( 20K 14.06.2021):

     conda install -c conda-forge nevergrad
    
Hyperas (27 2.1K) - Keras + Hyperopt: A very simple wrapper for convenient.. MIT
  • GitHub ( 21 300 220 250 - 36% open 19.11.2021):

     git clone https://github.com/maxpumperla/hyperas
    
  • PyPi ( 13K / month 24 28.02.2019):

     pip install hyperas
    
mljar-supervised (27 1.7K) - Python package for AutoML on Tabular Data with Feature.. MIT
  • GitHub ( 14 240 35 450 - 17% open 06.12.2021):

     git clone https://github.com/mljar/mljar-supervised
    
  • PyPi ( 23K / month 01.10.2021):

     pip install mljar-supervised
    
SMAC3 (26 650) - Sequential Model-based Algorithm Configuration. BSD-3
  • GitHub ( 38 170 360 - 19% open 05.11.2021):

     git clone https://github.com/automl/SMAC3
    
  • PyPi ( 18K / month 32 05.11.2021):

     pip install smac
    
AdaNet (24 3.4K) - Fast and flexible AutoML with learning guarantees. Apache-2
  • GitHub ( 27 520 42 110 - 58% open 30.08.2021):

     git clone https://github.com/tensorflow/adanet
    
  • PyPi ( 840 / month 2 09.07.2020):

     pip install adanet
    
Hyperactive (23 340) - An optimization and data collection toolbox for convenient and fast.. MIT
  • GitHub ( 4 32 97 11 41 - 17% open 13.01.2022):

     git clone https://github.com/SimonBlanke/Hyperactive
    
  • PyPi ( 480 / month 3 05.12.2021):

     pip install hyperactive
    
Talos (22 1.5K ) - Hyperparameter Optimization for TensorFlow, Keras and PyTorch. MIT
  • GitHub ( 19 250 140 390 - 8% open 27.05.2021):

     git clone https://github.com/autonomio/talos
    
  • PyPi ( 780 / month 6 09.11.2020):

     pip install talos
    
Neuraxle (22 490) - A Sklearn-like Framework for Hyperparameter Tuning and AutoML in.. Apache-2
  • GitHub ( 7 52 24 340 - 41% open 01.11.2021):

     git clone https://github.com/Neuraxio/Neuraxle
    
  • PyPi ( 220 / month 1 17.10.2021):

     pip install neuraxle
    
Auto ViML (20 320) - Automatically Build Multiple ML Models with a Single Line of Code... Apache-2
  • GitHub ( 6 70 15 18 - 22% open 06.12.2021):

     git clone https://github.com/AutoViML/Auto_ViML
    
  • PyPi ( 1.5K / month 2 06.12.2021):

     pip install autoviml
    
sklearn-deap (19 670) - Use evolutionary algorithms instead of gridsearch in scikit-.. MIT
  • GitHub ( 22 110 31 55 - 38% open 30.07.2021):

     git clone https://github.com/rsteca/sklearn-deap
    
  • PyPi ( 730 / month 2 30.07.2021):

     pip install sklearn-deap
    
AlphaPy (18 690) - Automated Machine Learning [AutoML] with Python, scikit-learn, Keras,.. Apache-2
  • GitHub ( 3 150 3 41 - 29% open 23.10.2021):

     git clone https://github.com/ScottfreeLLC/AlphaPy
    
  • PyPi ( 120 / month 29.08.2020):

     pip install alphapy
    
HyperparameterHunter (16 680 ) - Easy hyperparameter optimization and automatic result.. MIT
  • GitHub ( 4 87 330 120 - 30% open 20.01.2021):

     git clone https://github.com/HunterMcGushion/hyperparameter_hunter
    
  • PyPi ( 91 / month 2 06.08.2019):

     pip install hyperparameter-hunter
    
model_search (11 3.2K ) - AutoML algorithms for model architecture search at scale. Apache-2
  • GitHub ( 1 310 50 - 70% open 17.03.2021):

     git clone https://github.com/google/model_search
    
Show 23 hidden projects...
  • Bayesian Optimization (32 5.7K ) - A Python implementation of global optimization with.. MIT
  • TPOT (31 8.4K ) - A Python Automated Machine Learning tool that optimizes.. LGPL-3.0
  • GPyOpt (26 790 ) - Gaussian Process Optimization using GPy. BSD-3
  • Orion (26 220) - Asynchronous Distributed Hyperparameter Optimization. BSD-3
  • auto_ml (23 1.6K ) - [UNMAINTAINED] Automated machine learning for analytics & production. MIT
  • MLBox (22 1.3K ) - MLBox is a powerful Automated Machine Learning python library. BSD-1-Clause
  • HpBandSter (22 510 ) - a distributed Hyperband implementation on Steroids. BSD-3
  • optunity (22 380 ) - optimization routines for hyperparameter tuning. BSD-3
  • lazypredict (22 280) - Lazy Predict help build a lot of basic models without much code.. MIT
  • Test Tube (20 710 ) - Python library to easily log experiments and parallelize.. MIT
  • Dragonfly (19 620 ) - An open source python library for scalable Bayesian optimisation. MIT
  • Sherpa (18 310 ) - Hyperparameter optimization that enables researchers to.. GPL-3.0
  • Advisor (17 1.4K ) - Open-source implementation of Google Vizier for hyper parameters.. Apache-2
  • Auto Tune Models (17 510 ) - Auto Tune Models - A multi-tenant, multi-data system for.. MIT
  • featurewiz (17 120) - Use advanced feature engineering strategies and select best.. Apache-2
  • automl-gs (16 1.8K ) - Provide an input CSV and a target field to predict, generate a.. MIT
  • Xcessiv (16 1.3K ) - A web-based application for quick, scalable, and automated.. Apache-2
  • Parfit (15 200 ) - A package for parallelizing the fit and flexibly scoring of.. MIT
  • ENAS (13 2.5K ) - PyTorch implementation of Efficient Neural Architecture Search via.. Apache-2
  • Auptimizer (13 180 ) - An automatic ML model optimization tool. GPL-3.0
  • Hypermax (12 99 ) - Better, faster hyper-parameter optimization. BSD-3
  • Devol (11 940 ) - Genetic neural architecture search with Keras. MIT
  • Hypertunity (9 120 ) - A toolset for black-box hyperparameter optimisation. Apache-2

Reinforcement Learning

Back to top

Libraries for building and evaluating reinforcement learning & agent-based systems.

OpenAI Gym (42 26K) - A toolkit for developing and comparing reinforcement learning.. MIT
  • GitHub ( 330 7.3K 26K 1.5K - 7% open 11.01.2022):

     git clone https://github.com/openai/gym
    
  • PyPi ( 970K / month 2.3K 06.10.2021):

     pip install gym
    
TF-Agents (33 2.2K) - TF-Agents: A reliable, scalable and easy to use TensorFlow.. Apache-2
  • GitHub ( 120 590 670 520 - 20% open 12.01.2022):

     git clone https://github.com/tensorflow/agents
    
  • PyPi ( 70K / month 14 15.11.2021):

     pip install tf-agents
    
Dopamine (31 9.7K) - Dopamine is a research framework for fast prototyping of.. Apache-2
  • GitHub ( 14 1.3K 160 - 49% open 14.12.2021):

     git clone https://github.com/google/dopamine
    
  • PyPi ( 1.2M / month 37 13.12.2021):

     pip install dopamine-rl
    
TensorForce (29 3.1K) - Tensorforce: a TensorFlow library for applied.. Apache-2
  • GitHub ( 82 510 630 - 1% open 08.01.2022):

     git clone https://github.com/tensorforce/tensorforce
    
  • PyPi ( 1.4K / month 26 07.09.2019):

     pip install tensorforce
    
FinRL (28 3.2K) - FinRL: Financial Reinforcement Learning Framework. Please star. MIT
  • GitHub ( 50 860 8 280 - 29% open 13.01.2022):

     git clone https://github.com/AI4Finance-LLC/FinRL
    
  • PyPi ( 360 / month 08.01.2022):

     pip install finrl
    
Acme (28 2.5K) - A library of reinforcement learning components and agents. Apache-2
  • GitHub ( 54 290 49 160 - 25% open 13.01.2022):

     git clone https://github.com/deepmind/acme
    
  • PyPi ( 2.6K / month 2 08.01.2022):

     pip install dm-acme
    
ViZDoom (28 1.3K) - Doom-based AI Research Platform for Reinforcement Learning from Raw.. MIT
  • GitHub ( 45 310 11K 120 430 - 20% open 10.01.2022):

     git clone https://github.com/mwydmuch/ViZDoom
    
  • PyPi ( 930 / month 14 22.11.2021):

     pip install vizdoom
    
TensorLayer (27 6.8K) - Deep Learning and Reinforcement Learning Library for.. Apache-2
  • GitHub ( 130 1.5K 1.3K 460 - 4% open 29.10.2021):

     git clone https://github.com/tensorlayer/tensorlayer
    
  • PyPi ( 2.3K / month 39 19.06.2020):

     pip install tensorlayer
    
PARL (27 2.4K) - A high-performance distributed training framework for Reinforcement.. Apache-2
  • GitHub ( 28 600 86 310 - 23% open 07.01.2022):

     git clone https://github.com/PaddlePaddle/PARL
    
  • PyPi ( 760 / month 30.12.2021):

     pip install parl
    
Stable Baselines (25 3.4K) - A fork of OpenAI Baselines, implementations of reinforcement.. MIT
  • GitHub ( 110 660 930 - 13% open 25.08.2021):

     git clone https://github.com/hill-a/stable-baselines
    
  • PyPi ( 9.1K / month 34 06.04.2021):

     pip install stable-baselines
    
garage (24 1.4K) - A toolkit for reproducible reinforcement learning research. MIT
  • GitHub ( 78 240 26 1K - 20% open 20.10.2021):

     git clone https://github.com/rlworkgroup/garage
    
  • PyPi ( 740 / month 2 23.03.2021):

     pip install garage
    
TRFL (23 3.1K) - TensorFlow Reinforcement Learning. Apache-2
  • GitHub ( 13 370 69 22 - 27% open 16.08.2021):

     git clone https://github.com/deepmind/trfl
    
  • PyPi ( 2.7K / month 3 16.08.2021):

     pip install trfl
    
ChainerRL (23 1K ) - ChainerRL is a deep reinforcement learning library built on top of.. MIT
  • GitHub ( 29 210 110 220 - 33% open 17.04.2021):

     git clone https://github.com/chainer/chainerrl
    
  • PyPi ( 450 / month 8 14.02.2020):

     pip install chainerrl
    
Coach (22 2.1K ) - Reinforcement Learning Coach by Intel AI Lab enables easy.. Apache-2
  • GitHub ( 35 410 270 - 32% open 28.06.2021):

     git clone https://github.com/IntelLabs/coach
    
  • PyPi ( 150 / month 2 10.10.2019):

     pip install rl_coach
    
RLax (21 730) - A library of reinforcement learning building blocks in JAX. Apache-2 jax
  • GitHub ( 16 56 32 18 - 50% open 12.01.2022):

     git clone https://github.com/deepmind/rlax
    
  • PyPi ( 1.9K / month 19.11.2021):

     pip install rlax
    
ReAgent (20 3.1K) - A platform for Reasoning systems (Reinforcement Learning,.. BSD-3
  • GitHub ( 120 430 120 - 35% open 29.12.2021):

     git clone https://github.com/facebookresearch/ReAgent
    
PFRL (20 770) - PFRL: a PyTorch-based deep reinforcement learning library. MIT
  • GitHub ( 15 100 30 60 - 45% open 06.12.2021):

     git clone https://github.com/pfnet/pfrl
    
  • PyPi ( 4.4K / month 1 07.07.2021):

     pip install pfrl
    
Show 4 hidden projects...
  • baselines (30 12K ) - OpenAI Baselines: high-quality implementations of reinforcement.. MIT
  • keras-rl (29 5.2K ) - Deep Reinforcement Learning for Keras. MIT
  • DeepMind Lab (19 6.6K) - A customisable 3D platform for agent-based AI research. GPL-2.0
  • Maze (16 200) - Maze Applied Reinforcement Learning Framework. Custom

Recommender Systems

Back to top

Libraries for building and evaluating recommendation systems.

implicit (31 2.6K) - Fast Python Collaborative Filtering for Implicit Feedback Datasets. MIT
  • GitHub ( 30 510 530 390 - 22% open 12.01.2022):

     git clone https://github.com/benfred/implicit
    
  • PyPi ( 110K / month 29 29.08.2021):

     pip install implicit
    
  • Conda ( 330K 29.08.2021):

     conda install -c conda-forge implicit
    
TF Recommenders (29 1.2K) - TensorFlow Recommenders is a library for building.. Apache-2
  • GitHub ( 30 160 64 220 - 52% open 07.01.2022):

     git clone https://github.com/tensorflow/recommenders
    
  • PyPi ( 200K / month 1 23.08.2021):

     pip install tensorflow-recommenders
    
Recommenders (28 12K) - Best Practices on Recommendation Systems. MIT
  • GitHub ( 100 2.1K 93 9 650 - 20% open 13.01.2022):

     git clone https://github.com/microsoft/recommenders
    
lightfm (28 3.9K) - A Python implementation of LightFM, a hybrid recommendation algorithm. Apache-2
  • GitHub ( 44 610 620 440 - 20% open 31.12.2021):

     git clone https://github.com/lyst/lightfm
    
  • PyPi ( 190K / month 45 27.11.2020):

     pip install lightfm
    
  • Conda ( 110K 07.02.2021):

     conda install -c conda-forge lightfm
    
TF Ranking (28 2.4K) - Learning to Rank in TensorFlow. Apache-2
  • GitHub ( 25 410 280 - 16% open 22.11.2021):

     git clone https://github.com/tensorflow/ranking
    
  • PyPi ( 48K / month 11 16.11.2021):

     pip install tensorflow_ranking
    
RecBole (25 1.6K) - A unified, comprehensive and efficient recommendation library. MIT
  • GitHub ( 43 270 280 - 20% open 10.01.2022):

     git clone https://github.com/RUCAIBox/RecBole
    
  • PyPi ( 670 / month 16.09.2021):

     pip install recbole
    
  • Conda ( 980 16.09.2021):

     conda install -c aibox recbole
    
Cornac (25 510) - A Comparative Framework for Multimodal Recommender Systems. Apache-2
  • GitHub ( 13 84 72 80 - 6% open 30.09.2021):

     git clone https://github.com/PreferredAI/cornac
    
  • PyPi ( 7.1K / month 14 26.09.2021):

     pip install cornac
    
  • Conda ( 200K 15.11.2021):

     conda install -c conda-forge cornac
    
fastFM (22 960 ) - fastFM: A Library for Factorization Machines. BSD-3
  • GitHub ( 20 190 430 93 110 - 43% open 24.03.2021):

     git clone https://github.com/ibayer/fastFM
    
  • PyPi ( 600 / month 8 23.11.2017):

     pip install fastfm
    
Case Recommender (18 380) - Case Recommender: A Flexible and Extensible Python.. MIT
  • GitHub ( 11 77 9 27 - 25% open 25.11.2021):

     git clone https://github.com/caserec/CaseRecommender
    
  • PyPi ( 540 / month 25.11.2021):

     pip install caserecommender
    
recmetrics (18 340) - A library of metrics for evaluating recommender systems. MIT
  • GitHub ( 13 76 22 17 - 41% open 27.10.2021):

     git clone https://github.com/statisticianinstilettos/recmetrics
    
  • PyPi ( 600 / month 24.09.2021):

     pip install recmetrics
    
Show 5 hidden projects...
  • scikit-surprise (28 5.2K ) - A Python scikit for building and analyzing recommender.. BSD-3
  • tensorrec (22 1.2K ) - A TensorFlow recommendation algorithm and framework in.. Apache-2
  • lkpy (21 190) - Python recommendation toolkit. MIT
  • Spotlight (18 2.6K ) - Deep recommender models using PyTorch. MIT
  • OpenRec (16 390 ) - OpenRec is an open-source and modular library for neural network-.. Apache-2

Privacy Machine Learning

Back to top

Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.

PySyft (35 7.9K) - A library for answering questions using data you cannot see. Apache-2
  • GitHub ( 430 1.7K 3.1K - 9% open 12.01.2022):

     git clone https://github.com/OpenMined/PySyft
    
  • PyPi ( 3.3K / month 5 01.12.2021):

     pip install syft
    
FATE (28 3.9K) - An Industrial Grade Federated Learning Framework. Apache-2
  • GitHub ( 69 1.1K 1.1K - 32% open 13.01.2022):

     git clone https://github.com/FederatedAI/FATE
    
Opacus (28 1K) - Training PyTorch models with differential privacy. Apache-2
  • GitHub ( 40 170 40 72 120 - 20% open 04.01.2022):

     git clone https://github.com/pytorch/opacus
    
  • PyPi ( 2.8K / month 10 04.01.2022):

     pip install opacus
    
TensorFlow Privacy (26 1.5K) - Library for training machine learning models with.. Apache-2
  • GitHub ( 43 330 59 150 - 41% open 10.01.2022):

     git clone https://github.com/tensorflow/privacy
    
  • PyPi ( 22K / month 6 01.09.2021):

     pip install tensorflow-privacy
    
CrypTen (24 980) - A framework for Privacy Preserving Machine Learning. MIT
  • GitHub ( 25 150 12 140 - 23% open 15.12.2021):

     git clone https://github.com/facebookresearch/CrypTen
    
  • PyPi ( 240 / month 09.09.2021):

     pip install crypten
    
Show 1 hidden projects...

Workflow & Experiment Tracking

Back to top

Libraries to organize, track, and visualize machine learning experiments.

Tensorboard (42 5.7K) - TensorFlows Visualization Toolkit. Apache-2
  • GitHub ( 280 1.5K 91K 1.6K - 34% open 13.01.2022):

     git clone https://github.com/tensorflow/tensorboard
    
  • PyPi ( 11M / month 2.3K 13.10.2021):

     pip install tensorboard
    
  • Conda ( 2.7M 10.11.2021):

     conda install -c conda-forge tensorboard
    
mlflow (40 11K) - Open source platform for the machine learning lifecycle. Apache-2
  • GitHub ( 350 2.4K 2.1K - 44% open 13.01.2022):

     git clone https://github.com/mlflow/mlflow
    
  • PyPi ( 15M / month 260 30.11.2021):

     pip install mlflow
    
  • Conda ( 460K 08.12.2021):

     conda install -c conda-forge mlflow
    
DVC (36 9.1K) - Data Version Control | Git for Data & Models | ML Experiments Management. Apache-2
  • GitHub ( 260 880 54K 3.5K - 17% open 13.01.2022):

     git clone https://github.com/iterative/dvc
    
  • PyPi ( 260K / month 46 22.12.2021):

     pip install dvc
    
  • Conda ( 970K 25.12.2021):

     conda install -c conda-forge dvc
    
PyCaret (36 4.8K ) - An open-source, low-code machine learning library in Python. MIT
  • GitHub ( 69 1.1K 510 1.7K 1.3K - 16% open 12.01.2022):

     git clone https://github.com/pycaret/pycaret
    
  • PyPi ( 280K / month 11 12.01.2022):

     pip install pycaret
    
SageMaker SDK (36 1.5K) - A library for training and deploying machine learning.. Apache-2
  • GitHub ( 240 730 1K 980 - 33% open 12.01.2022):

     git clone https://github.com/aws/sagemaker-python-sdk
    
  • PyPi ( 2.1M / month 44 10.01.2022):

     pip install sagemaker
    
tensorboardX (34 7.2K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
  • GitHub ( 70 840 340 16K 440 - 16% open 26.12.2021):

     git clone https://github.com/lanpa/tensorboardX
    
  • PyPi ( 540K / month 860 21.11.2021):

     pip install tensorboardX
    
  • Conda ( 630K 10.08.2021):

     conda install -c conda-forge tensorboardx
    
wandb client (34 3.6K) - A tool for visualizing and tracking your machine learning.. MIT
  • GitHub ( 98 280 1.6K - 22% open 12.01.2022):

     git clone https://github.com/wandb/client
    
  • PyPi ( 490K / month 190 17.12.2021):

     pip install wandb
    
AzureML SDK (34 2.8K) - Python notebooks with ML and deep learning examples with Azure.. MIT
  • GitHub ( 57 2K 430 1.2K - 16% open 13.12.2021):

     git clone https://github.com/Azure/MachineLearningNotebooks
    
  • PyPi ( 1.8M / month 45 13.12.2021):

     pip install azureml-sdk
    
snakemake (33 1.2K) - This is the development home of the workflow management system.. MIT
  • GitHub ( 230 280 1K 820 - 64% open 11.01.2022):

     git clone https://github.com/snakemake/snakemake
    
  • PyPi ( 49K / month 200 11.01.2022):

     pip install snakemake
    
  • Conda ( 370K 12.01.2022):

     conda install -c bioconda snakemake
    
Metaflow (32 5.2K) - Build and manage real-life data science projects with ease!. Apache-2
  • GitHub ( 44 460 240 410 - 53% open 11.01.2022):

     git clone https://github.com/Netflix/metaflow
    
  • PyPi ( 45K / month 4 11.01.2022):

     pip install metaflow
    
  • Conda ( 30K 11.01.2022):

     conda install -c conda-forge metaflow
    
ClearML (32 2.9K) - ClearML - Auto-Magical CI/CD to streamline your ML workflow... Apache-2
  • GitHub ( 44 400 380 170 440 - 33% open 13.01.2022):

     git clone https://github.com/allegroai/clearml
    
  • PyPi ( 59K / month 4 08.01.2022):

     pip install clearml
    
  • Docker Hub ( 30K 05.10.2020):

     docker pull allegroai/trains
    
VisualDL (31 4.3K) - Deep Learning Visualization Toolkit. Apache-2
  • GitHub ( 31 580 160 900 390 - 14% open 30.12.2021):

     git clone https://github.com/PaddlePaddle/VisualDL
    
  • PyPi ( 42K / month 23 06.01.2022):

     pip install visualdl
    
Catalyst (31 2.8K) - Accelerated deep learning R&D. Apache-2
  • GitHub ( 100 350 460 330 - 1% open 28.12.2021):

     git clone https://github.com/catalyst-team/catalyst
    
  • PyPi ( 8.9K / month 29 28.12.2021):

     pip install catalyst
    
aim (31 1.9K) - Aim an easy-to-use and performant open-source experiment tracker. Apache-2
  • GitHub ( 25 110 49 300 - 37% open 13.01.2022):

     git clone https://github.com/aimhubio/aim
    
  • PyPi ( 8.5K / month 2 10.01.2022):

     pip install aim
    
sacred (29 3.7K) - Sacred is a tool to help you configure, organize, log and reproduce.. MIT
  • GitHub ( 95 340 1.2K 520 - 18% open 05.11.2021):

     git clone https://github.com/IDSIA/sacred
    
  • PyPi ( 30K / month 100 14.12.2020):

     pip install sacred
    
kaggle (28 4.5K ) - Official Kaggle API. Apache-2
  • GitHub ( 36 870 330 - 56% open 15.03.2021):

     git clone https://github.com/Kaggle/kaggle-api
    
  • PyPi ( 110K / month 320 13.03.2021):

     pip install kaggle
    
  • Conda ( 76K 17.12.2021):

     conda install -c conda-forge kaggle
    
ml-metadata (28 420) - For recording and retrieving metadata associated with ML.. Apache-2
  • GitHub ( 13 81 1.7K 170 78 - 25% open 24.12.2021):

     git clone https://github.com/google/ml-metadata
    
  • PyPi ( 460K / month 18 30.11.2021):

     pip install ml-metadata
    
livelossplot (25 1.2K) - Live training loss plot in Jupyter Notebook for Keras,.. MIT
  • GitHub ( 17 140 700 74 - 5% open 12.10.2021):

     git clone https://github.com/stared/livelossplot
    
  • PyPi ( 65K / month 8 03.02.2021):

     pip install livelossplot
    
Labml (25 940) - Monitor deep learning model training and hardware usage from your mobile.. MIT
  • GitHub ( 6 69 41 25 - 52% open 13.01.2022):

     git clone https://github.com/labmlai/labml
    
  • PyPi ( 16K / month 6 13.01.2022):

     pip install labml
    
Guild AI (25 650) - Experiment tracking, ML developer tools. Apache-2
  • GitHub ( 18 59 42 300 - 40% open 12.01.2022):

     git clone https://github.com/guildai/guildai
    
  • PyPi ( 2.8K / month 07.01.2022):

     pip install guildai
    
TNT (23 1.4K ) - Simple tools for logging and visualizing, loading and training. BSD-3
  • GitHub ( 35 190 65 - 46% open 05.01.2021):

     git clone https://github.com/pytorch/tnt
    
  • PyPi ( 17K / month 36 29.07.2018):

     pip install torchnet
    
Studio.ml (22 370) - Studio: Simplify and expedite model building process. Apache-2
  • GitHub ( 21 52 5 250 - 22% open 14.09.2021):

     git clone https://github.com/studioml/studio
    
  • PyPi ( 260 / month 14.09.2021):

     pip install studioml
    
TensorWatch (21 3.2K ) - Debugging, monitoring and visualization for Python Machine.. MIT
  • GitHub ( 13 340 65 65 - 76% open 15.01.2021):

     git clone https://github.com/microsoft/tensorwatch
    
  • PyPi ( 2.9K / month 6 04.03.2020):

     pip install tensorwatch
    
quinn (19 300 ) - pyspark methods to enhance developer productivity. Apache-2
  • GitHub ( 6 40 26 - 65% open 09.02.2021):

     git clone https://github.com/MrPowers/quinn
    
  • PyPi ( 470K / month 4 06.02.2021):

     pip install quinn
    
keepsake (18 1.5K ) - Version control for machine learning. Apache-2
  • GitHub ( 16 59 190 - 65% open 07.05.2021):

     git clone https://github.com/replicate/keepsake
    
  • PyPi ( 730 / month 11.03.2021):

     pip install keepsake
    
Show 11 hidden projects...
  • SKLL (25 530) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine.. BSD-1-Clause
  • knockknock (22 2.3K ) - Knock Knock: Get notified when your training ends with only two.. MIT
  • hiddenlayer (21 1.6K ) - Neural network graphs and training metrics for.. MIT
  • lore (21 1.5K ) - Lore makes machine learning approachable for Software Engineers and.. MIT
  • TensorBoard Logger (21 620 ) - Log TensorBoard events without touching TensorFlow. MIT
  • gokart (21 230) - Gokart solves reproducibility, task dependencies, constraints of good code,.. MIT
  • MXBoard (20 330 ) - Logging MXNet data for visualization in TensorBoard. Apache-2
  • datmo (19 340 ) - Open source production model management tool for data scientists. MIT
  • steppy (16 130 ) - Lightweight, Python library for fast and reproducible experimentation. MIT
  • ModelChimp (13 120) - Experiment tracking for machine and deep learning projects. BSD-2
  • traintool (8 9 ) - Train off-the-shelf machine learning models in one.. Apache-2

Model Serialization & Deployment

Back to top

Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.

onnx (40 12K) - Open standard for machine learning interoperability. Apache-2
  • GitHub ( 220 2.3K 17K 5.2K 1.8K - 26% open 13.01.2022):

     git clone https://github.com/onnx/onnx
    
  • PyPi ( 1.2M / month 320 26.10.2021):

     pip install onnx
    
  • Conda ( 340K 14.12.2021):

     conda install -c conda-forge onnx
    
Core ML Tools (31 2.5K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
  • GitHub ( 120 390 3.9K 730 850 - 37% open 10.01.2022):

     git clone https://github.com/apple/coremltools
    
  • PyPi ( 95K / month 140 09.11.2021):

     pip install coremltools
    
TorchServe (30 2.4K) - Model Serving on PyTorch. Apache-2
  • GitHub ( 95 430 790 770 - 16% open 07.01.2022):

     git clone https://github.com/pytorch/serve
    
  • PyPi ( 9.2K / month 8 29.12.2021):

     pip install torchserve
    
  • Conda ( 18K 29.12.2021):

     conda install -c pytorch torchserve
    
  • Docker Hub ( 960K 11 29.12.2021):

     docker pull pytorch/torchserve
    
BentoML (29 3.1K) - Model Serving Made Easy. Apache-2
  • GitHub ( 90 360 930 160 520 - 17% open 03.12.2021):

     git clone https://github.com/bentoml/BentoML
    
  • PyPi ( 10K / month 1 14.12.2021):

     pip install bentoml
    
huggingface_hub (29 310) - All the open source things related to the Hugging Face Hub. Apache-2
  • GitHub ( 50 64 200 - 52% open 13.01.2022):

     git clone https://github.com/huggingface/huggingface_hub
    
  • PyPi ( 1.9M / month 51 11.01.2022):

     pip install huggingface_hub
    
  • Conda ( 24K 12.01.2022):

     conda install -c conda-forge huggingface_hub
    
cortex (26 7.6K) - Production infrastructure for machine learning at scale. Apache-2
  • GitHub ( 23 590 1.1K - 9% open 13.01.2022):

     git clone https://github.com/cortexlabs/cortex
    
  • PyPi ( 920 / month 1 10.01.2022):

     pip install cortex
    
Hummingbird (26 2.7K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
  • GitHub ( 27 210 150 20 230 - 23% open 12.01.2022):

     git clone https://github.com/microsoft/hummingbird
    
  • PyPi ( 1.5K / month 14.12.2021):

     pip install hummingbird-ml
    
m2cgen (24 2K) - Transform ML models into a native code (Java, C, Python, Go, JavaScript,.. MIT
  • GitHub ( 12 170 9 87 - 41% open 13.01.2022):

     git clone https://github.com/BayesWitnesses/m2cgen
    
  • PyPi ( 58K / month 18.09.2020):

     pip install m2cgen
    
pytorch2keras (18 770) - PyTorch to Keras model convertor. MIT
  • GitHub ( 13 130 26 120 - 42% open 06.08.2021):

     git clone https://github.com/gmalivenko/pytorch2keras
    
  • PyPi ( 620 / month 1 14.05.2020):

     pip install pytorch2keras
    
tfdeploy (16 340 ) - Deploy tensorflow graphs for fast evaluation and export to.. BSD-3
  • GitHub ( 4 38 34 - 32% open 08.01.2021):

     git clone https://github.com/riga/tfdeploy
    
  • PyPi ( 66 / month 2 30.03.2017):

     pip install tfdeploy
    
Show 4 hidden projects...
  • mmdnn (25 5.5K ) - MMdnn is a set of tools to help users inter-operate among different deep.. MIT
  • Larq Compute Engine (20 180) - Highly optimized inference engine for Binarized.. Apache-2
  • sklearn-porter (19 1.1K ) - Transpile trained scikit-learn estimators to C, Java,.. MIT
  • backprop (13 220 ) - Backprop makes it simple to use, finetune, and deploy state-of-.. Apache-2

Model Interpretability

Back to top

Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.

shap (39 15K) - A game theoretic approach to explain the output of any machine learning model. MIT
  • GitHub ( 160 2.2K 4.2K 1.8K - 68% open 04.12.2021):

     git clone https://github.com/slundberg/shap
    
  • PyPi ( 4.7M / month 220 20.10.2021):

     pip install shap
    
  • Conda ( 750K 24.10.2021):

     conda install -c conda-forge shap
    
arviz (34 1.1K) - Exploratory analysis of Bayesian models with Python. Apache-2
  • GitHub ( 110 260 110 1.8K 750 - 25% open 07.01.2022):

     git clone https://github.com/arviz-devs/arviz
    
  • PyPi ( 240K / month 71 03.10.2021):

     pip install arviz
    
  • Conda ( 600K 03.10.2021):

     conda install -c conda-forge arviz
    
Lime (32 9.5K) - Lime: Explaining the predictions of any machine learning classifier. BSD-2
  • GitHub ( 61 1.5K 1.9K 550 - 4% open 29.07.2021):

     git clone https://github.com/marcotcr/lime
    
  • PyPi ( 1.2M / month 110 26.06.2020):

     pip install lime
    
  • Conda ( 90K 28.06.2020):

     conda install -c conda-forge lime
    
InterpretML (31 4.4K) - Fit interpretable models. Explain blackbox machine learning. MIT
  • GitHub ( 28 540 140 270 - 31% open 13.01.2022):

     git clone https://github.com/interpretml/interpret
    
  • PyPi ( 53K / month 8 23.09.2021):

     pip install interpret
    
Captum (31 2.9K) - Model interpretability and understanding for PyTorch. BSD-3
  • GitHub ( 77 300 360 320 - 24% open 14.12.2021):

     git clone https://github.com/pytorch/captum
    
  • PyPi ( 25K / month 17 02.11.2021):

     pip install captum
    
pyLDAvis (30 1.6K ) - Python library for interactive topic model visualization... BSD-3
  • GitHub ( 32 320 3K 160 - 52% open 24.03.2021):

     git clone https://github.com/bmabey/pyLDAvis
    
  • PyPi ( 470K / month 130 24.03.2021):

     pip install pyldavis
    
  • Conda ( 33K 24.03.2021):

     conda install -c conda-forge pyldavis
    
Model Analysis (30 1.1K) - Model analysis tools for TensorFlow. Apache-2
  • GitHub ( 36 240 72 - 34% open 12.01.2022):

     git clone https://github.com/tensorflow/model-analysis
    
  • PyPi ( 6.5M / month 21 02.12.2021):

     pip install tensorflow-model-analysis
    
shapash (29 1.5K) - Shapash makes Machine Learning models transparent and.. Apache-2
  • GitHub ( 31 200 59 96 - 11% open 04.01.2022):

     git clone https://github.com/MAIF/shapash
    
  • PyPi ( 13K / month 06.12.2021):

     pip install shapash
    
DoWhy (27 3.5K) - DoWhy is a Python library for causal inference that supports explicit.. MIT
  • GitHub ( 47 540 24 80 170 - 31% open 11.01.2022):

     git clone https://github.com/Microsoft/dowhy
    
  • PyPi ( 50K / month 3 10.01.2022):

     pip install dowhy
    
  • Conda ( 4.2K 28.04.2021):

     conda install -c conda-forge dowhy
    
dtreeviz (27 1.9K) - A python library for decision tree visualization and model interpretation. MIT
  • GitHub ( 17 240 290 110 - 17% open 03.01.2022):

     git clone https://github.com/parrt/dtreeviz
    
  • PyPi ( 56K / month 13 10.11.2021):

     pip install dtreeviz
    
Alibi (27 1.5K) - Algorithms for explaining machine learning models. Apache-2
  • GitHub ( 18 160 130 250 - 41% open 12.01.2022):

     git clone https://github.com/SeldonIO/alibi
    
  • PyPi ( 35K / month 17 18.11.2021):

     pip install alibi
    
yellowbrick (26 3.5K) - Visual analysis and diagnostic tools to facilitate machine.. Apache-2
  • GitHub ( 100 500 640 - 14% open 05.01.2022):

     git clone https://github.com/DistrictDataLabs/yellowbrick
    
  • PyPi ( 260K / month 47 13.02.2021):

     pip install yellowbrick
    
Lucid (25 4.4K ) - A collection of infrastructure and tools for research in.. Apache-2
  • GitHub ( 40 590 600 170 - 41% open 19.03.2021):

     git clone https://github.com/tensorflow/lucid
    
  • PyPi ( 930 / month 6 19.03.2021):

     pip install lucid
    
checklist (25 1.6K) - Beyond Accuracy: Behavioral Testing of NLP models with CheckList. MIT
  • GitHub ( 12 150 67 91 - 13% open 28.09.2021):

     git clone https://github.com/marcotcr/checklist
    
  • PyPi ( 22K / month 3 24.05.2021):

     pip install checklist
    
CausalNex (25 1.4K) - A Python library that helps data scientists to infer.. Apache-2
  • GitHub ( 22 150 36 100 - 14% open 11.11.2021):

     git clone https://github.com/quantumblacklabs/causalnex
    
  • PyPi ( 4.8K / month 2 11.11.2021):

     pip install causalnex
    
fairlearn (25 1.2K) - A Python package to assess and improve fairness of machine.. MIT
  • GitHub ( 61 270 330 - 40% open 15.12.2021):

     git clone https://github.com/fairlearn/fairlearn
    
  • PyPi ( 24K / month 9 07.07.2021):

     pip install fairlearn
    
  • Conda ( 17K 07.07.2021):

     conda install -c conda-forge fairlearn
    
responsible-ai-widgets (25 380) - This project provides responsible AI user interfaces.. MIT
  • GitHub ( 24 91 18 220 - 21% open 13.01.2022):

     git clone https://github.com/microsoft/responsible-ai-widgets
    
  • PyPi ( 3.7K / month 2 05.01.2022):

     pip install raiwidgets
    
LIT (24 2.7K) - The Language Interpretability Tool: Interactively analyze NLP models for.. Apache-2
  • GitHub ( 18 270 7 130 - 50% open 21.12.2021):

     git clone https://github.com/PAIR-code/lit
    
  • PyPi ( 650 / month 21.12.2021):

     pip install lit-nlp
    
Fairness 360 (24 1.6K) - A comprehensive set of fairness metrics for datasets and.. Apache-2
  • GitHub ( 46 500 130 120 - 50% open 18.11.2021):

     git clone https://github.com/Trusted-AI/AIF360
    
  • PyPi ( 5.5K / month 8 04.03.2021):

     pip install aif360
    
Explainability 360 (23 1K) - Interpretability and explainability of data and machine.. Apache-2
  • GitHub ( 29 210 37 63 - 60% open 12.10.2021):

     git clone https://github.com/Trusted-AI/AIX360
    
  • PyPi ( 1.2K / month 1 28.10.2020):

     pip install aix360
    
explainerdashboard (23 960) - Quickly build Explainable AI dashboards that show the inner.. MIT
  • GitHub ( 12 120 51 150 - 11% open 24.12.2021):

     git clone https://github.com/oegedijk/explainerdashboard
    
  • PyPi ( 5.2K / month 24.10.2021):

     pip install explainerdashboard
    
tf-explain (23 890) - Interpretability Methods for tf.keras models with Tensorflow 2.x. MIT
  • GitHub ( 16 91 95 89 - 43% open 30.11.2021):

     git clone https://github.com/sicara/tf-explain
    
  • PyPi ( 1.4K / month 6 18.11.2021):

     pip install tf-explain
    
keract (22 960) - Layers Outputs and Gradients in Keras. Made easy. MIT
  • GitHub ( 16 180 110 84 - 3% open 12.01.2022):

     git clone https://github.com/philipperemy/keract
    
  • PyPi ( 1.3K / month 5 19.06.2021):

     pip install keract
    
DiCE (22 750) - Generate Diverse Counterfactual Explanations for any machine.. MIT
  • GitHub ( 12 100 99 - 46% open 07.01.2022):

     git clone https://github.com/interpretml/DiCE
    
  • PyPi ( 16K / month 3 27.09.2021):

     pip install dice-ml
    
TreeInterpreter (22 690 ) - Package for interpreting scikit-learns decision tree.. BSD-3
  • GitHub ( 11 130 190 28 - 85% open 28.02.2021):

     git clone https://github.com/andosa/treeinterpreter
    
  • PyPi ( 160K / month 10 10.01.2021):

     pip install treeinterpreter
    
What-If Tool (22 620) - Source code/webpage/demos for the What-If Tool. Apache-2
  • GitHub ( 20 120 100 - 54% open 05.01.2022):

     git clone https://github.com/PAIR-code/what-if-tool
    
  • PyPi ( 6.2K / month 3 12.10.2021):

     pip install witwidget
    
  • npm ( 3.2K / month 12.10.2021):

     npm install wit-widget
    
random-forest-importances (22 490 ) - Code to compute permutation and drop-column.. MIT
  • GitHub ( 14 110 88 34 - 17% open 30.01.2021):

     git clone https://github.com/parrt/random-forest-importances
    
  • PyPi ( 13K / month 5 28.01.2021):

     pip install rfpimp
    
ecco (21 1.3K) - Explain, analyze, and visualize NLP language models. Ecco creates.. BSD-3
  • GitHub ( 9 79 2 3 28 - 28% open 12.01.2022):

     git clone https://github.com/jalammar/ecco
    
  • PyPi ( 560 / month 1 09.01.2022):

     pip install ecco
    
imodels (21 430) - Interpretable ML package for concise, transparent, and accurate predictive.. MIT
  • GitHub ( 7 43 10 18 - 22% open 09.01.2022):

     git clone https://github.com/csinva/imodels
    
  • PyPi ( 640 / month 07.12.2021):

     pip install imodels
    
iNNvestigate (20 940) - A toolbox to iNNvestigate neural networks predictions!. BSD-2
  • GitHub ( 19 200 230 - 30% open 03.08.2021):

     git clone https://github.com/albermax/innvestigate
    
  • PyPi ( 370 / month 1 14.11.2020):

     pip install innvestigate
    
deeplift (20 600) - Public facing deeplift repo. MIT
  • GitHub ( 11 130 53 82 - 41% open 11.11.2021):

     git clone https://github.com/kundajelab/deeplift
    
  • PyPi ( 400 / month 4 11.11.2020):

     pip install deeplift
    
aequitas (20 450 ) - Bias and Fairness Audit Toolkit. MIT
  • GitHub ( 16 85 87 58 - 63% open 27.05.2021):

     git clone https://github.com/dssg/aequitas
    
  • PyPi ( 740 / month 6 16.12.2020):

     pip install aequitas
    
sklearn-evaluation (20 320) - Machine learning model evaluation made easy: plots,.. MIT
  • GitHub ( 6 28 38 37 - 21% open 17.10.2021):

     git clone https://github.com/edublancas/sklearn-evaluation
    
  • PyPi ( 670 / month 2 17.10.2021):

     pip install sklearn-evaluation
    
tcav (19 500) - Code for the TCAV ML interpretability project. Apache-2
  • GitHub ( 19 120 11 59 - 10% open 16.09.2021):

     git clone https://github.com/tensorflow/tcav
    
  • PyPi ( 97 / month 3 23.02.2021):

     pip install tcav
    
XAI (18 760) - XAI - An eXplainability toolbox for machine learning. MIT
  • GitHub ( 3 120 11 8 - 12% open 30.10.2021):

     git clone https://github.com/EthicalML/xai
    
  • PyPi ( 330 / month 6 30.10.2021):

     pip install xai
    
Anchor (16 680) - Code for High-Precision Model-Agnostic Explanations paper. BSD-2
  • GitHub ( 10 93 69 - 26% open 17.11.2021):

     git clone https://github.com/marcotcr/anchor
    
  • PyPi ( 3.2K / month 26.06.2020):

     pip install anchor_exp
    
LOFO (16 420) - Leave One Feature Out Importance. MIT
  • GitHub ( 3 52 6 17 - 23% open 04.10.2021):

     git clone https://github.com/aerdem4/lofo-importance
    
  • PyPi ( 200 / month 04.10.2021):

     pip install lofo-importance
    
FlashTorch (15 640 ) - Visualization toolkit for neural networks in PyTorch! Demo --. MIT
  • GitHub ( 2 76 9 32 - 31% open 27.04.2021):

     git clone https://github.com/MisaOgura/flashtorch
    
  • PyPi ( 280 / month 29.05.2020):

     pip install flashtorch
    
Show 12 hidden projects...
  • eli5 (27 2.5K ) - A library for debugging/inspecting machine learning classifiers and.. MIT
  • scikit-plot (26 2.2K ) - An intuitive library to add plotting functionality to.. MIT
  • keras-vis (25 2.9K ) - Neural network visualization toolkit for keras. MIT
  • DALEX (22 980) - moDel Agnostic Language for Exploration and eXplanation. GPL-3.0
  • Skater (21 1K ) - Python Library for Model Interpretation/Explanations. UPL-1.0
  • model-card-toolkit (18 250) - a tool that leverages rich metadata and lineage.. Apache-2
  • fairness-indicators (18 240) - Tensorflows Fairness Evaluation and Visualization.. Apache-2
  • interpret-text (15 290) - A library that incorporates state-of-the-art explainers for.. MIT
  • ExplainX.ai (15 270 ) - Explainable AI framework for data scientists. Explain & debug any.. MIT
  • Attribution Priors (12 91 ) - Tools for training explainable models using.. MIT
  • contextual-ai (12 77) - Contextual AI adds explainability to different stages of.. Apache-2
  • bias-detector (12 37) - Bias Detector is a python package for detecting bias in machine.. MIT

Vector Similarity Search (ANN)

Back to top

Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.

 ANN Benchmarks ( 2.7K) - Benchmarks of approximate nearest neighbor libraries in Python.

Milvus (38 9.2K) - An open-source vector database for embedding similarity search and AI.. Apache-2
  • GitHub ( 180 1.4K 6.3K 4.3K - 5% open 13.01.2022):

     git clone https://github.com/milvus-io/milvus
    
  • PyPi ( 24K / month 15 31.12.2021):

     pip install pymilvus
    
  • Docker Hub ( 720K 17 31.12.2021):

     docker pull milvusdb/milvus
    
Faiss (34 16K ) - A library for efficient similarity search and clustering of dense vectors. MIT
  • GitHub ( 90 2.4K 540 1.7K - 14% open 10.01.2022):

     git clone https://github.com/facebookresearch/faiss
    
  • PyPi ( 24K / month 15 31.12.2021):

     pip install pymilvus
    
  • Conda ( 230K 20.11.2021):

     conda install -c conda-forge faiss
    
Annoy (33 9.3K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage.. Apache-2
  • GitHub ( 76 960 1.9K 340 - 11% open 03.01.2022):

     git clone https://github.com/spotify/annoy
    
  • PyPi ( 780K / month 240 18.09.2020):

     pip install annoy
    
NMSLIB (30 2.7K) - Non-Metric Space Library (NMSLIB): An efficient similarity search.. Apache-2
  • GitHub ( 45 370 530 390 - 15% open 19.09.2021):

     git clone https://github.com/nmslib/nmslib
    
  • PyPi ( 92K / month 47 03.02.2021):

     pip install nmslib
    
  • Conda ( 47K 22.11.2021):

     conda install -c conda-forge nmslib
    
hnswlib (30 1.8K) - Header-only C++/python library for fast approximate nearest neighbors. Apache-2
  • GitHub ( 52 350 180 240 - 50% open 09.12.2021):

     git clone https://github.com/nmslib/hnswlib
    
  • PyPi ( 110K / month 20 28.12.2021):

     pip install hnswlib
    
PyNNDescent (28 570) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
  • GitHub ( 18 72 1.1K 94 - 46% open 05.01.2022):

     git clone https://github.com/lmcinnes/pynndescent
    
  • PyPi ( 1.4M / month 22 15.10.2021):

     pip install pynndescent
    
  • Conda ( 430K 15.10.2021):

     conda install -c conda-forge pynndescent
    
NGT (19 840) - Nearest Neighbor Search with Neighborhood Graph and Tree for High-.. Apache-2
  • GitHub ( 12 82 86 - 11% open 12.01.2022):

     git clone https://github.com/yahoojapan/NGT
    
  • PyPi ( 15K / month 6 23.09.2020):

     pip install ngt
    
N2 (18 500 ) - TOROS N2 - lightweight approximate Nearest Neighbor library which runs.. Apache-2
  • GitHub ( 18 61 22 37 - 45% open 20.05.2021):

     git clone https://github.com/kakao/n2
    
  • PyPi ( 740 / month 3 16.10.2020):

     pip install n2
    
Show 3 hidden projects...
  • Magnitude (23 1.5K ) - A fast, efficient universal vector embedding utility package. MIT
  • NearPy (21 700 ) - Python framework for fast (approximated) nearest neighbour search in.. MIT
  • PySparNN (11 890 ) - Approximate Nearest Neighbor Search for Sparse Data in Python!. BSD-3

Probabilistics & Statistics

Back to top

Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, gaussian processes, or statistics.

PyMC3 (39 6.3K) - Probabilistic Programming in Python: Bayesian Modeling and.. Apache-2
  • GitHub ( 350 1.5K 1.5K 580 2.5K - 7% open 13.01.2022):

     git clone https://github.com/pymc-devs/pymc3
    
  • PyPi ( 240K / month 230 24.08.2021):

     pip install pymc3
    
  • Conda ( 370K 12.10.2021):

     conda install -c conda-forge pymc3
    
tensorflow-probability (37 3.6K) - Probabilistic reasoning and statistical analysis in.. Apache-2
  • GitHub ( 430 950 1.2K - 45% open 12.01.2022):

     git clone https://github.com/tensorflow/probability
    
  • PyPi ( 760K / month 300 17.11.2021):

     pip install tensorflow-probability
    
  • Conda ( 48K 22.10.2021):

     conda install -c conda-forge tensorflow-probability
    
Pyro (32 7.3K) - Deep universal probabilistic programming with Python and PyTorch. Apache-2
  • GitHub ( 120 880 600 920 - 19% open 28.12.2021):

     git clone https://github.com/pyro-ppl/pyro
    
  • PyPi ( 60K / month 49 14.12.2021):

     pip install pyro-ppl
    
GPyTorch (32 2.6K) - A highly efficient and modular implementation of Gaussian Processes.. MIT
  • GitHub ( 90 390 460 1K - 23% open 05.01.2022):

     git clone https://github.com/cornellius-gp/gpytorch
    
  • PyPi ( 130K / month 30 04.12.2021):

     pip install gpytorch
    
pgmpy (32 2K) - Python Library for learning (Structure and Parameter), inference.. MIT
  • GitHub ( 100 630 120 300 770 - 28% open 12.01.2022):

     git clone https://github.com/pgmpy/pgmpy
    
  • PyPi ( 65K / month 9 30.12.2021):

     pip install pgmpy
    
hmmlearn (31 2.4K) - Hidden Markov Models in Python, with scikit-learn like API. BSD-3
  • GitHub ( 38 680 1.2K 370 - 15% open 13.01.2022):

     git clone https://github.com/hmmlearn/hmmlearn
    
  • PyPi ( 290K / month 130 18.07.2021):

     pip install hmmlearn
    
  • Conda ( 110K 13.11.2021):

     conda install -c conda-forge hmmlearn
    
filterpy (31 2.1K ) - Python Kalman filtering and optimal estimation library. Implements.. MIT
  • GitHub ( 36 480 1.2K 200 - 23% open 04.05.2021):

     git clone https://github.com/rlabbe/filterpy
    
  • PyPi ( 630K / month 130 10.10.2018):

     pip install filterpy
    
  • Conda ( 71K 05.05.2020):

     conda install -c conda-forge filterpy
    
pomegranate (30 2.8K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
  • GitHub ( 65 510 610 670 - 8% open 20.11.2021):

     git clone https://github.com/jmschrei/pomegranate
    
  • PyPi ( 33K / month 44 19.11.2021):

     pip install pomegranate
    
  • Conda ( 78K 16.11.2021):

     conda install -c conda-forge pomegranate
    
GPflow (30 1.6K) - Gaussian processes in TensorFlow. Apache-2
  • GitHub ( 72 410 320 740 - 16% open 05.01.2022):

     git clone https://github.com/GPflow/GPflow
    
  • PyPi ( 7.2K / month 28 26.10.2021):

     pip install gpflow
    
  • Conda ( 9.8K 06.11.2018):

     conda install -c conda-forge gpflow
    
patsy (30 810) - Describing statistical models in Python using symbolic formulas. BSD-2
  • GitHub ( 16 94 46K 140 - 51% open 26.09.2021):

     git clone https://github.com/pydata/patsy
    
  • PyPi ( 4.5M / month 2.6K 26.09.2021):

     pip install patsy
    
  • Conda ( 4.1M 26.09.2021):

     conda install -c conda-forge patsy
    
SALib (29 560) - Sensitivity Analysis Library in Python. Contains Sobol, Morris, FAST, and.. MIT
  • GitHub ( 34 170 270 - 18% open 19.12.2021):

     git clone https://github.com/SALib/SALib
    
  • PyPi ( 80K / month 51 04.09.2021):

     pip install salib
    
  • Conda ( 75K 04.09.2021):

     conda install -c conda-forge salib
    
pandas-ta (27 2K) - Technical Analysis Indicators - Pandas TA is an easy to use Python.. MIT
  • GitHub ( 44 460 390 330 - 17% open 16.11.2021):

     git clone https://github.com/twopirllc/pandas-ta
    
  • PyPi ( 41K / month 10 28.07.2021):

     pip install pandas-ta
    
bambi (25 730) - BAyesian Model-Building Interface (Bambi) in Python. MIT
  • GitHub ( 21 72 20 220 - 15% open 11.01.2022):

     git clone https://github.com/bambinos/bambi
    
  • PyPi ( 2K / month 3 11.01.2022):

     pip install bambi
    
Orbit (23 860) - A Python package for Bayesian forecasting with object-oriented design.. Apache-2
  • GitHub ( 15 65 5 310 - 13% open 12.01.2022):

     git clone https://github.com/uber/orbit
    
  • PyPi ( 3K / month 1 12.01.2022):

     pip install orbit-ml
    
Baal (20 520) - Library to enable Bayesian active learning in your research or labeling.. Apache-2
  • GitHub ( 11 49 63 - 28% open 10.01.2022):

     git clone https://github.com/ElementAI/baal
    
  • PyPi ( 560 / month 1 17.12.2021):

     pip install baal
    
Show 8 hidden projects...
  • Edward (28 4.7K ) - A probabilistic programming language in TensorFlow. Deep.. Apache-2
  • pingouin (28 880) - Statistical package in Python based on Pandas. GPL-3.0
  • PyStan (25 150) - PyStan, a Python interface to Stan, a platform for statistical modeling... ISC
  • scikit-posthocs (21 220) - Multiple Pairwise Comparisons (Post Hoc) Tests in Python. MIT
  • pyhsmm (20 500 ) - Bayesian inference in HSMMs and HMMs. MIT
  • Funsor (20 190) - Functional tensors for probabilistic programming. Apache-2
  • ZhuSuan (15 2.1K ) - A probabilistic programming library for Bayesian deep learning,.. MIT
  • Lea (12) - Discrete probability distributions in Python. GPL-3.0

Adversarial Robustness

Back to top

Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.

ART (32 2.7K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT
  • GitHub ( 90 760 180 630 - 11% open 10.01.2022):

     git clone https://github.com/Trusted-AI/adversarial-robustness-toolbox
    
  • PyPi ( 3.8K / month 6 07.01.2022):

     pip install adversarial-robustness-toolbox
    
TextAttack (30 1.8K) - TextAttack is a Python framework for adversarial attacks, data.. MIT
  • GitHub ( 46 220 51 190 - 20% open 16.12.2021):

     git clone https://github.com/QData/TextAttack
    
  • PyPi ( 6.6K / month 3 10.11.2021):

     pip install textattack
    
CleverHans (29 5.4K) - An adversarial example library for constructing attacks,.. MIT
  • GitHub ( 130 1.3K 280 450 - 5% open 23.09.2021):

     git clone https://github.com/cleverhans-lab/cleverhans
    
  • PyPi ( 870 / month 11 24.07.2021):

     pip install cleverhans
    
Foolbox (28 2.1K ) - A Python toolbox to create adversarial examples that fool neural.. MIT
  • GitHub ( 32 370 260 330 - 18% open 05.06.2021):

     git clone https://github.com/bethgelab/foolbox
    
  • PyPi ( 3.2K / month 13 23.02.2021):

     pip install foolbox
    
AdvBox (19 1.2K ) - Advbox is a toolbox to generate adversarial examples that fool.. Apache-2
  • GitHub ( 19 240 37 - 21% open 03.05.2021):

     git clone https://github.com/advboxes/AdvBox
    
  • PyPi ( 44 / month 05.12.2018):

     pip install advbox
    
robustness (17 650) - A library for experimenting with, training and evaluating neural.. MIT
  • GitHub ( 13 120 69 68 - 20% open 30.11.2021):

     git clone https://github.com/MadryLab/robustness
    
  • PyPi ( 800 / month 2 01.12.2020):

     pip install robustness
    
Show 3 hidden projects...
  • advertorch (21 990) - A Toolbox for Adversarial Robustness Research. GPL-3.0
  • textflint (16 540) - Unified Multilingual Robustness Evaluation Toolkit for Natural.. GPL-3.0
  • Adversary (15 360 ) - Tool to generate adversarial text examples and test machine.. MIT

GPU Utilities

Back to top

Libraries that require and make use of CUDA/GPU system capabilities to optimize data handling and machine learning tasks.

CuPy (39 5.7K) - NumPy & SciPy for GPU. MIT
  • GitHub ( 290 530 24K 910 1.6K - 22% open 13.01.2022):

     git clone https://github.com/cupy/cupy
    
  • PyPi ( 82K / month 140 09.12.2021):

     pip install cupy
    
  • Conda ( 1.1M 16.12.2021):

     conda install -c conda-forge cupy
    
  • Docker Hub ( 54K 7 09.12.2021):

     docker pull cupy/cupy
    
cuDF (30 4.4K) - cuDF - GPU DataFrame Library. Apache-2
  • GitHub ( 230 580 4.3K - 16% open 13.01.2022):

     git clone https://github.com/rapidsai/cudf
    
  • PyPi ( 770 / month 3 01.06.2020):

     pip install cudf
    
PyCUDA (30 1.3K) - CUDA integration for Python, plus shiny features. MIT
  • GitHub ( 74 250 1.1K 220 - 29% open 11.01.2022):

     git clone https://github.com/inducer/pycuda
    
  • PyPi ( 33K / month 190 03.04.2021):

     pip install pycuda
    
gpustat (28 2.7K) - A simple command-line utility for querying and monitoring GPU status. MIT
  • GitHub ( 12 210 1.5K 78 - 28% open 13.08.2021):

     git clone https://github.com/wookayin/gpustat
    
  • PyPi ( 300K / month 99 02.01.2021):

     pip install gpustat
    
  • Conda ( 110K 24.11.2020):

     conda install -c conda-forge gpustat
    
cuML (27 2.5K) - cuML - RAPIDS Machine Learning Library. Apache-2
  • GitHub ( 140 370 1.9K - 33% open 13.01.2022):

     git clone https://github.com/rapidsai/cuml
    
  • PyPi ( 550 / month 1 01.06.2020):

     pip install cuml
    
Apex (26 6K) - A PyTorch Extension: Tools for easy mixed precision and distributed.. BSD-3
  • GitHub ( 88 870 850 930 - 57% open 17.12.2021):

     git clone https://github.com/NVIDIA/apex
    
  • Conda ( 72K 22.04.2021):

     conda install -c conda-forge nvidia-apex
    
ArrayFire (26 3.7K) - ArrayFire: a general purpose GPU library. BSD-3
  • GitHub ( 81 490 1.8K 1.5K - 15% open 15.10.2021):

     git clone https://github.com/arrayfire/arrayfire
    
  • PyPi ( 530 / month 5 05.03.2021):

     pip install arrayfire
    
DALI (24 3.7K) - A GPU-accelerated library containing highly optimized building blocks.. Apache-2
  • GitHub ( 68 460 1.1K - 13% open 13.01.2022):

     git clone https://github.com/NVIDIA/DALI
    
cuGraph (24 890) - cuGraph - RAPIDS Graph Analytics Library. Apache-2
  • GitHub ( 71 170 750 - 11% open 13.01.2022):

     git clone https://github.com/rapidsai/cugraph
    
  • PyPi ( 170 / month 1 01.06.2020):

     pip install cugraph
    
scikit-cuda (24 870) - Python interface to GPU-powered libraries. BSD-3
  • GitHub ( 45 170 160 220 - 22% open 13.07.2021):

     git clone https://github.com/lebedov/scikit-cuda
    
  • PyPi ( 540 / month 44 27.05.2019):

     pip install scikit-cuda
    
BlazingSQL (23 1.7K) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for.. Apache-2
  • GitHub ( 47 160 720 - 18% open 30.09.2021):

     git clone https://github.com/BlazingDB/blazingsql
    
  • Conda ( 940 11.11.2019):

     conda install -c blazingsql blazingsql-protocol
    
Vulkan Kompute (19 710) - General purpose GPU compute framework for cross vendor.. Apache-2
  • GitHub ( 16 51 110 2 160 - 32% open 04.01.2022):

     git clone https://github.com/EthicalML/vulkan-kompute
    
  • PyPi ( 110 / month 15.09.2021):

     pip install kp
    
cuSignal (19 560) - GPU accelerated signal processing. Apache-2
  • GitHub ( 36 80 120 - 12% open 08.12.2021):

     git clone https://github.com/rapidsai/cusignal
    
Show 5 hidden projects...
  • GPUtil (22 810 ) - A Python module for getting the GPU status from NVIDA GPUs using.. MIT
  • py3nvml (22 200) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside your.. BSD-3
  • nvidia-ml-py3 (20 72 ) - Python 3 Bindings for the NVIDIA Management Library. BSD-3
  • SpeedTorch (15 650 ) - Library for faster pinned CPU - GPU transfer in Pytorch. MIT
  • ipyexperiments (15 140) - jupyter/ipython experiment containers for GPU and.. Apache-2

Tensorflow Utilities

Back to top

Libraries that extend TensorFlow with additional capabilities.

TF Addons (37 1.4K) - Useful extra functionality for TensorFlow 2.x maintained by.. Apache-2
  • GitHub ( 180 480 5.1K 920 - 23% open 03.01.2022):

     git clone https://github.com/tensorflow/addons
    
  • PyPi ( 6.9M / month 140 10.11.2021):

     pip install tensorflow-addons
    
tensor2tensor (35 12K) - Library of deep learning models and datasets designed to.. Apache-2
  • GitHub ( 240 2.9K 1.1K 1.2K - 45% open 12.01.2022):

     git clone https://github.com/tensorflow/tensor2tensor
    
  • PyPi ( 160K / month 93 17.06.2020):

     pip install tensor2tensor
    
TensorFlow Datasets (35 3.1K) - TFDS is a collection of datasets ready to use with.. Apache-2
  • GitHub ( 240 1.2K 1.1K - 45% open 13.01.2022):

     git clone https://github.com/tensorflow/datasets
    
  • PyPi ( 1.4M / month 140 28.07.2021):

     pip install tensorflow-datasets
    
tensorflow-hub (34 3K ) - A library for transfer learning by reusing parts of.. Apache-2
  • GitHub ( 83 1.6K 9.9K 640 - 2% open 13.01.2022):

     git clone https://github.com/tensorflow/hub
    
  • PyPi ( 2.7M / month 280 14.04.2021):

     pip install tensorflow-hub
    
  • Conda ( 60K 18.04.2021):

     conda install -c conda-forge tensorflow-hub
    
TensorFlow Transform (32 900) - Input pipeline framework. Apache-2
  • GitHub ( 27 180 660 170 - 12% open 13.01.2022):

     git clone https://github.com/tensorflow/transform
    
  • PyPi ( 8M / month 54 02.12.2021):

     pip install tensorflow-transform
    
TF Model Optimization (31 1.2K) - A toolkit to optimize ML models for deployment for.. Apache-2
  • GitHub ( 64 250 1.4K 280 - 47% open 13.01.2022):

     git clone https://github.com/tensorflow/model-optimization
    
  • PyPi ( 150K / month 13 30.09.2021):

     pip install tensorflow-model-optimization
    
Keras-Preprocessing (30 1K ) - Utilities for working with image data, text data, and.. MIT
  • GitHub ( 50 430 190 - 48% open 04.02.2021):

     git clone https://github.com/keras-team/keras-preprocessing
    
  • PyPi ( 8M / month 1.5K 14.05.2020):

     pip install keras-preprocessing
    
  • Conda ( 1.2M 15.01.2021):

     conda install -c conda-forge keras-preprocessing
    
TensorFlow I/O (29 520) - Dataset, streaming, and file system extensions.. Apache-2
  • GitHub ( 84 210 490 - 32% open 04.01.2022):

     git clone https://github.com/tensorflow/io
    
  • PyPi ( 150K / month 15 15.12.2021):

     pip install tensorflow-io
    
efficientnet (26 1.9K) - Implementation of EfficientNet model. Keras and.. Apache-2
  • GitHub ( 10 430 200K 860 120 - 51% open 16.07.2021):

     git clone https://github.com/qubvel/efficientnet
    
  • PyPi ( 70K / month 7 15.09.2020):

     pip install efficientnet
    
Neural Structured Learning (26 890) - Training neural models with structured signals. Apache-2
  • GitHub ( 31 160 160 60 - 5% open 04.01.2022):

     git clone https://github.com/tensorflow/neural-structured-learning
    
  • PyPi ( 9.6K / month 2 18.08.2020):

     pip install neural-structured-learning
    
TensorFlow Cloud (24 320) - The TensorFlow Cloud repository provides APIs that.. Apache-2
  • GitHub ( 26 64 120 80 - 67% open 04.01.2022):

     git clone https://github.com/tensorflow/cloud
    
  • PyPi ( 330K / month 1 17.06.2021):

     pip install tensorflow-cloud
    
TensorNets (20 1K ) - High level network definitions with pre-trained weights in.. MIT
  • GitHub ( 6 190 42 58 - 27% open 02.01.2021):

     git clone https://github.com/taehoonlee/tensornets
    
  • PyPi ( 76 / month 4 31.03.2020):

     pip install tensornets
    
Saliency (19 760) - Framework-agnostic implementation for state-of-the-art saliency.. Apache-2
  • GitHub ( 14 160 19 40 - 35% open 28.07.2021):

     git clone https://github.com/PAIR-code/saliency
    
  • PyPi ( 600 / month 3 03.05.2021):

     pip install saliency
    
TF Compression (17 570) - Data compression in TensorFlow. Apache-2
  • GitHub ( 10 200 76 - 3% open 09.01.2022):

     git clone https://github.com/tensorflow/compression
    
  • PyPi ( 420 / month 1 14.05.2021):

     pip install tensorflow-compression
    
Show 1 hidden projects...
  • tffm (20 770 ) - TensorFlow implementation of an arbitrary order Factorization Machine. MIT

Sklearn Utilities

Back to top

Libraries that extend scikit-learn with additional capabilities.

imbalanced-learn (36 5.7K) - A Python Package to Tackle the Curse of Imbalanced.. MIT
  • GitHub ( 61 1.2K 8.8K 500 - 12% open 11.01.2022):

     git clone https://github.com/scikit-learn-contrib/imbalanced-learn
    
  • PyPi ( 2.6M / month 240 11.01.2022):

     pip install imbalanced-learn
    
  • Conda ( 180K 11.01.2022):

     conda install -c conda-forge imbalanced-learn
    
MLxtend (35 3.8K) - A library of extension and helper modules for Pythons data.. BSD-3
  • GitHub ( 85 730 5K 400 - 24% open 04.01.2022):

     git clone https://github.com/rasbt/mlxtend
    
  • PyPi ( 1.5M / month 140 03.09.2021):

     pip install mlxtend
    
  • Conda ( 190K 03.09.2021):

     conda install -c conda-forge mlxtend
    
category_encoders (33 1.8K ) - A library of sklearn compatible categorical variable.. BSD-3
  • GitHub ( 48 340 2.9K 240 - 32% open 16.11.2021):

     git clone https://github.com/scikit-learn-contrib/category_encoders
    
  • PyPi ( 860K / month 23 14.10.2018):

     pip install category_encoders
    
  • Conda ( 140K 13.10.2021):

     conda install -c conda-forge category_encoders
    
fancyimpute (25 1K) - Multivariate imputation and matrix completion algorithms.. Apache-2
  • GitHub ( 12 160 1.1K 110 - 0% open 21.10.2021):

     git clone https://github.com/iskandr/fancyimpute
    
  • PyPi ( 9.6K / month 28 21.10.2021):

     pip install fancyimpute
    
scikit-lego (25 680) - Extra blocks for scikit-learn pipelines. MIT
  • GitHub ( 48 77 39 240 - 13% open 21.12.2021):

     git clone https://github.com/koaning/scikit-lego
    
  • PyPi ( 22K / month 6 09.12.2021):

     pip install scikit-lego
    
  • Conda ( 17K 09.12.2021):

     conda install -c conda-forge scikit-lego
    
sklearn-contrib-lightning (24 1.5K) - Large-scale linear classification, regression and.. BSD-3
  • GitHub ( 17 190 100 97 91 - 54% open 09.01.2022):

     git clone https://github.com/scikit-learn-contrib/lightning
    
  • PyPi ( 1.7K / month 6 15.06.2021):

     pip install sklearn-contrib-lightning
    
  • Conda ( 160K 13.11.2021):

     conda install -c conda-forge sklearn-contrib-lightning
    
scikit-opt (23 2.8K) - Genetic Algorithm, Particle Swarm Optimization, Simulated.. MIT
  • GitHub ( 14 680 59 130 - 24% open 10.01.2022):

     git clone https://github.com/guofei9987/scikit-opt
    
  • PyPi ( 1.1K / month 4 28.06.2021):

     pip install scikit-opt
    
iterative-stratification (21 620) - scikit-learn cross validators for iterative.. BSD-3
  • GitHub ( 6 57 180 19 - 21% open 11.11.2021):

     git clone https://github.com/trent-b/iterative-stratification
    
  • PyPi ( 89K / month 8 03.10.2021):

     pip install iterative-stratification
    
combo (21 560) - (AAAI 20) A Python Toolbox for Machine Learning Model Combination. BSD-2 xgboost
  • GitHub ( 1 95 440 12 - 75% open 02.10.2021):

     git clone https://github.com/yzhao062/combo
    
  • PyPi ( 53K / month 1 23.12.2020):

     pip install combo
    
DESlib (18 370) - A Python library for dynamic classifier and ensemble selection. BSD-3
  • GitHub ( 13 73 22 140 - 9% open 10.10.2021):

     git clone https://github.com/scikit-learn-contrib/DESlib
    
  • PyPi ( 990 / month 2 08.02.2021):

     pip install deslib
    
scikit-tda (18 320) - Topological Data Analysis for Python. MIT
  • GitHub ( 3 39 24 16 - 75% open 03.08.2021):

     git clone https://github.com/scikit-tda/scikit-tda
    
  • PyPi ( 3.7K / month 03.08.2021):

     pip install scikit-tda
    
Show 6 hidden projects...
  • sklearn-crfsuite (25 390 ) - scikit-learn inspired API for CRFsuite. MIT
  • scikit-multilearn (23 710 ) - A scikit-learn based module for multi-label et. al... BSD-2
  • skope-rules (20 430 ) - machine learning with logical rules in Python. BSD-1-Clause
  • celer (19 130) - Fast solver for L1-type problems: Lasso, sparse Logisitic regression,.. BSD-3
  • skggm (17 190 ) - Scikit-learn compatible estimation of general graphical models. MIT
  • dabl (15 110) - Data Analysis Baseline Library. BSD-3

Pytorch Utilities

Back to top

Libraries that extend Pytorch with additional capabilities.

PML (32 4K) - The easiest way to use deep metric learning in your application. Modular,.. MIT
  • GitHub ( 23 500 180 320 - 12% open 12.01.2022):

     git clone https://github.com/KevinMusgrave/pytorch-metric-learning
    
  • PyPi ( 97K / month 8 28.12.2021):

     pip install pytorch-metric-learning
    
  • Conda ( 5.4K 28.12.2021):

     conda install -c metric-learning pytorch-metric-learning
    
pytorch-optimizer (29 2.3K) - torch-optimizer -- collection of optimizers for.. Apache-2
  • GitHub ( 25 210 430 44 - 36% open 11.11.2021):

     git clone https://github.com/jettify/pytorch-optimizer
    
  • PyPi ( 52K / month 23 31.10.2021):

     pip install torch_optimizer
    
lightning-flash (28 1.3K) - Your PyTorch AI Factory - Flash enables you to easily.. Apache-2
  • GitHub ( 60 130 41 430 - 19% open 12.01.2022):

     git clone https://github.com/PyTorchLightning/lightning-flash
    
  • PyPi ( 2.3K / month 2 13.12.2021):

     pip install lightning-flash
    
EfficientNet-PyTorch (27 6.7K ) - A PyTorch implementation of EfficientNet and.. Apache-2
  • GitHub ( 24 1.3K 1.1M 260 - 49% open 15.04.2021):

     git clone https://github.com/lukemelas/EfficientNet-PyTorch
    
  • PyPi ( 370K / month 30 15.04.2021):

     pip install efficientnet-pytorch
    
accelerate (27 2.1K) - A simple way to train and use PyTorch models with multi-.. Apache-2
  • GitHub ( 30 120 280 150 - 33% open 11.01.2022):

     git clone https://github.com/huggingface/accelerate
    
  • PyPi ( 37K / month 4 27.09.2021):

     pip install accelerate
    
pytorch-summary (26 3.4K ) - Model summary in PyTorch similar to `model.summary()`.. MIT
  • GitHub ( 11 380 4.1K 160 - 75% open 10.05.2021):

     git clone https://github.com/sksq96/pytorch-summary
    
  • PyPi ( 55K / month 71 26.09.2018):

     pip install torchsummary
    
torchdiffeq (25 3.9K) - Differentiable ODE solvers with full GPU support and.. MIT
  • GitHub ( 20 670 180 160 - 17% open 22.09.2021):

     git clone https://github.com/rtqichen/torchdiffeq
    
  • PyPi ( 33K / month 6 02.06.2021):

     pip install torchdiffeq
    
Torchmeta (25 1.5K) - A collection of extensions and data-loaders for few-shot learning.. MIT
  • GitHub ( 12 190 78 120 - 27% open 20.09.2021):

     git clone https://github.com/tristandeleu/pytorch-meta
    
  • PyPi ( 1.2K / month 20.09.2021):

     pip install torchmeta
    
TabNet (25 1.5K) - PyTorch implementation of TabNet paper :.. MIT
  • GitHub ( 19 300 200 - 18% open 27.12.2021):

     git clone https://github.com/dreamquark-ai/tabnet
    
  • PyPi ( 33K / month 7 02.02.2021):

     pip install pytorch-tabnet
    
SRU (24 2K ) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755). MIT
  • GitHub ( 21 300 17 120 - 44% open 19.05.2021):

     git clone https://github.com/asappresearch/sru
    
  • PyPi ( 2.6K / month 3 17.06.2021):

     pip install sru
    
torch-scatter (24 850) - PyTorch Extension Library of Optimized Scatter Operations. MIT
  • GitHub ( 18 93 240 - 9% open 13.11.2021):

     git clone https://github.com/rusty1s/pytorch_scatter
    
  • PyPi ( 28K / month 40 22.10.2021):

     pip install torch-scatter
    
PyTorch Sparse (23 550) - PyTorch Extension Library of Optimized Autograd Sparse.. MIT
  • GitHub ( 17 70 190 - 31% open 13.11.2021):

     git clone https://github.com/rusty1s/pytorch_sparse
    
  • PyPi ( 17K / month 32 08.09.2021):

     pip install torch-sparse
    
EfficientNets (22 1.4K) - Pretrained EfficientNet, EfficientNet-Lite, MixNet,.. Apache-2
  • GitHub ( 5 190 88 51 - 1% open 08.07.2021):

     git clone https://github.com/rwightman/gen-efficientnet-pytorch
    
  • PyPi ( 5.9K / month 1 08.07.2021):

     pip install geffnet
    
Pytorch Toolbelt (22 1.2K) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT
  • GitHub ( 7 87 23 - 21% open 29.12.2021):

     git clone https://github.com/BloodAxe/pytorch-toolbelt
    
  • PyPi ( 6.7K / month 6 12.08.2021):

     pip install pytorch_toolbelt
    
reformer-pytorch (21 1.7K) - Reformer, the efficient Transformer, in Pytorch. MIT
  • GitHub ( 10 220 120 - 11% open 06.11.2021):

     git clone https://github.com/lucidrains/reformer-pytorch
    
  • PyPi ( 5.9K / month 06.11.2021):

     pip install reformer-pytorch
    
Higher (21 1.3K) - higher is a pytorch library allowing users to obtain higher.. Apache-2
  • GitHub ( 9 93 100 95 - 47% open 26.10.2021):

     git clone https://github.com/facebookresearch/higher
    
  • PyPi ( 7.5K / month 2 14.07.2020):

     pip install higher
    
tinygrad (19 5.1K) - You like pytorch? You like micrograd? You love tinygrad!. MIT
  • GitHub ( 52 570 1 110 - 19% open 06.01.2022):

     git clone https://github.com/geohot/tinygrad
    
Performer Pytorch (19 770) - An implementation of Performer, a linear attention-based.. MIT
  • GitHub ( 6 100 36 70 - 41% open 07.11.2021):

     git clone https://github.com/lucidrains/performer-pytorch
    
  • PyPi ( 890 / month 4 07.11.2021):

     pip install performer-pytorch
    
Tensor Sensor (19 620) - The goal of this library is to generate more helpful.. MIT
  • GitHub ( 3 33 7 23 - 34% open 13.12.2021):

     git clone https://github.com/parrt/tensor-sensor
    
  • PyPi ( 1.7K / month 11.12.2021):

     pip install tensor-sensor
    
Pywick (17 360) - High-level batteries-included neural network training library for.. MIT
  • GitHub ( 4 38 5 14 - 14% open 22.10.2021):

     git clone https://github.com/achaiah/pywick
    
  • PyPi ( 50 / month 22.10.2021):

     pip install pywick
    
torchsde (16 890) - Differentiable SDE solvers with GPU support and efficient.. Apache-2
  • GitHub ( 5 99 9 43 - 16% open 26.07.2021):

     git clone https://github.com/google-research/torchsde
    
Tez (16 740) - Tez is a super-simple and lightweight Trainer for PyTorch. It also.. Apache-2
  • GitHub ( 1 110 18 28 - 64% open 28.12.2021):

     git clone https://github.com/abhishekkrthakur/tez
    
  • PyPi ( 980 / month 2 28.12.2021):

     pip install tez
    
Torch-Struct (14 1K) - Fast, general, and tested differentiable structured prediction.. MIT
  • GitHub ( 14 80 52 - 46% open 04.11.2021):

     git clone https://github.com/harvardnlp/pytorch-struct
    
madgrad (14 750) - MADGRAD Optimization Method. MIT
  • GitHub ( 1 52 20 9 - 22% open 20.08.2021):

     git clone https://github.com/facebookresearch/madgrad
    
  • PyPi ( 3.6K / month 01.04.2021):

     pip install madgrad
    
Show 7 hidden projects...

Database Clients

Back to top

Libraries for connecting to, operating, and querying databases.

 best-of-python - DB Clients ( 1.9K) - Collection of database clients for python.


Others

Back to top

scipy (49 9.1K) - Ecosystem of open-source software for mathematics, science, and engineering. BSD-3
  • GitHub ( 1.2K 4K 330K 450K 8.2K - 22% open 12.01.2022):

     git clone https://github.com/scipy/scipy
    
  • PyPi ( 29M / month 56K 21.12.2021):

     pip install scipy
    
  • Conda ( 20M 25.11.2021):

     conda install -c conda-forge scipy
    
SymPy (45 8.8K) - A computer algebra system written in pure Python. BSD-3
  • GitHub ( 1.1K 3.5K 440K 38K 12K - 35% open 13.01.2022):

     git clone https://github.com/sympy/sympy
    
  • PyPi ( 1.8M / month 4K 08.10.2021):

     pip install sympy
    
  • Conda ( 1.8M 06.11.2021):

     conda install -c conda-forge sympy
    
Streamlit (38 17K) - Streamlit The fastest way to build data apps in Python. Apache-2
  • GitHub ( 130 1.6K 200 2.2K - 24% open 13.01.2022):

     git clone https://github.com/streamlit/streamlit
    
  • PyPi ( 640K / month 280 28.12.2021):

     pip install streamlit
    
Gradio (34 4.7K) - Wrap UIs around any model, share with anyone. Apache-2
  • GitHub ( 40 290 490 260 - 16% open 10.01.2022):

     git clone https://github.com/gradio-app/gradio
    
  • PyPi ( 72K / month 12 04.01.2022):

     pip install gradio
    
carla (33 7.1K) - Open-source simulator for autonomous driving research. MIT
  • GitHub ( 140 2K 120 3.7K - 13% open 19.11.2021):

     git clone https://github.com/carla-simulator/carla
    
  • PyPi ( 2.2K / month 3 17.11.2021):

     pip install carla
    
Datasette (33 5.7K) - An open source multi-tool for exploring and publishing data. Apache-2
  • GitHub ( 60 380 34 570 1.2K - 27% open 23.12.2021):

     git clone https://github.com/simonw/datasette
    
  • PyPi ( 140K / month 140 19.12.2021):

     pip install datasette
    
PyOD (33 5.2K) - (JMLR 19) A Python Toolbox for Scalable Outlier Detection (Anomaly.. BSD-2
  • GitHub ( 33 1K 1.1K 230 - 50% open 04.01.2022):

     git clone https://github.com/yzhao062/pyod
    
  • PyPi ( 440K / month 28 04.01.2022):

     pip install pyod
    
DeepChem (33 3.4K) - Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry,.. MIT
  • GitHub ( 180 1.2K 66 1.4K - 30% open 12.01.2022):

     git clone https://github.com/deepchem/deepchem
    
  • PyPi ( 4.6K / month 4 12.01.2022):

     pip install deepchem
    
PaddleHub (32 7.4K) - Awesome pre-trained models toolkit based on.. Apache-2
  • GitHub ( 53 1.5K 560 640 1K - 37% open 28.12.2021):

     git clone https://github.com/PaddlePaddle/PaddleHub
    
  • PyPi ( 7.9K / month 4 28.12.2021):

     pip install paddlehub
    
Autograd (32 5.6K ) - Efficiently computes derivatives of numpy code. MIT
  • GitHub ( 51 780 2.8K 380 - 42% open 03.03.2021):

     git clone https://github.com/HIPS/autograd
    
  • PyPi ( 1.1M / month 270 25.07.2019):

     pip install autograd
    
  • Conda ( 200K 25.07.2019):

     conda install -c conda-forge autograd
    
hdbscan (31 2.1K) - A high performance implementation of HDBSCAN clustering. BSD-3
  • GitHub ( 76 370 1.2K 410 - 62% open 12.01.2022):

     git clone https://github.com/scikit-learn-contrib/hdbscan
    
  • PyPi ( 300K / month 140 03.02.2021):

     pip install hdbscan
    
  • Conda ( 1M 14.02.2021):

     conda install -c conda-forge hdbscan
    
Pythran (31 1.7K) - Ahead of Time compiler for numeric kernels. BSD-3
  • GitHub ( 64 170 93 740 - 15% open 11.01.2022):

     git clone https://github.com/serge-sans-paille/pythran
    
  • PyPi ( 250K / month 14 14.12.2021):

     pip install pythran
    
  • Conda ( 210K 14.12.2021):

     conda install -c conda-forge pythran
    
tensorly (31 1.2K) - TensorLy: Tensor Learning in Python. BSD-2
  • GitHub ( 49 220 200 180 - 25% open 22.12.2021):

     git clone https://github.com/tensorly/tensorly
    
  • PyPi ( 5.1K / month 30 08.11.2021):

     pip install tensorly
    
  • Conda ( 200K 09.12.2021):

     conda install -c conda-forge tensorly
    
PennyLane (30 1.1K) - PennyLane is a cross-platform Python library for differentiable.. Apache-2
  • GitHub ( 86 330 59 600 - 25% open 13.01.2022):

     git clone https://github.com/PennyLaneAI/PennyLane
    
  • PyPi ( 8.6K / month 28 14.12.2021):

     pip install pennylane
    
agate (30 1.1K) - A Python data analysis library that is optimized for humans instead of.. MIT
  • GitHub ( 49 130 720 680 - 7% open 15.07.2021):

     git clone https://github.com/wireservice/agate
    
  • PyPi ( 680K / month 130 15.07.2021):

     pip install agate
    
  • Conda ( 75K 16.07.2021):

     conda install -c conda-forge agate
    
pyopencl (30 860) - OpenCL integration for Python, plus shiny features. MIT
  • GitHub ( 90 220 610 300 - 22% open 30.12.2021):

     git clone https://github.com/inducer/pyopencl
    
  • PyPi ( 22K / month 180 30.12.2021):

     pip install pyopencl
    
  • Conda ( 550K 30.12.2021):

     conda install -c conda-forge pyopencl
    
causalml (29 2.6K) - Uplift modeling and causal inference with machine learning.. Apache-2
  • GitHub ( 32 390 33 240 - 19% open 10.01.2022):

     git clone https://github.com/uber/causalml
    
  • PyPi ( 44K / month 1 02.08.2021):

     pip install causalml
    
TabPy (28 1.2K) - Execute Python code on the fly and display results in Tableau visualizations:. MIT
  • GitHub ( 44 440 79 290 - 5% open 13.01.2022):

     git clone https://github.com/tableau/TabPy
    
  • PyPi ( 18K / month 2 20.08.2021):

     pip install tabpy
    
pyjanitor (28 800) - Clean APIs for data cleaning. Python implementation of R package Janitor. MIT
  • GitHub ( 96 130 140 430 - 21% open 13.01.2022):

     git clone https://github.com/pyjanitor-devs/pyjanitor
    
  • PyPi ( 14K / month 10 21.11.2021):

     pip install pyjanitor
    
  • Conda ( 110K 22.11.2021):

     conda install -c conda-forge pyjanitor
    
Trax (27 6.7K) - Trax Deep Learning with Clear Code and Speed. Apache-2
  • GitHub ( 74 670 40 200 - 40% open 23.12.2021):

     git clone https://github.com/google/trax
    
  • PyPi ( 4K / month 26.10.2021):

     pip install trax
    
metric-learn (27 1.2K) - Metric learning algorithms in Python. MIT
  • GitHub ( 21 220 180 170 - 30% open 17.11.2021):

     git clone https://github.com/scikit-learn-contrib/metric-learn
    
  • PyPi ( 8.2K / month 11 02.07.2020):

     pip install metric-learn
    
pycm (27 1.2K) - Multi-class confusion matrix library in Python. MIT
  • GitHub ( 16 100 120 180 - 10% open 27.10.2021):

     git clone https://github.com/sepandhaghighi/pycm
    
  • PyPi ( 32K / month 12 27.10.2021):

     pip install pycm
    
kmodes (27 950) - Python implementations of the k-modes and k-prototypes clustering.. MIT
  • GitHub ( 20 360 940 140 - 13% open 15.12.2021):

     git clone https://github.com/nicodv/kmodes
    
  • PyPi ( 240K / month 24 08.10.2021):

     pip install kmodes
    
pyclustering (27 910 ) - pyclustring is a Python, C++ data mining library. BSD-3
  • GitHub ( 26 210 380 260 650 - 8% open 12.02.2021):

     git clone https://github.com/annoviko/pyclustering
    
  • PyPi ( 42K / month 28 25.11.2020):

     pip install pyclustering
    
  • Conda ( 33K 13.09.2021):

     conda install -c conda-forge pyclustering
    
Mars (26 2.3K) - Mars is a tensor-based unified framework for large-scale data.. Apache-2
  • GitHub ( 39 290 940 - 16% open 12.01.2022):

     git clone https://github.com/mars-project/mars
    
  • PyPi ( 2.2K / month 1 16.12.2021):

     pip install pymars
    
PySwarms (26 870 ) - A research toolkit for particle swarm optimization in Python. MIT
  • GitHub ( 43 280 150 200 - 8% open 23.06.2021):

     git clone https://github.com/ljvmiranda921/pyswarms
    
  • PyPi ( 30K / month 6 03.01.2021):

     pip install pyswarms
    
River (25 3K) - Online machine learning in Python. BSD-3
  • GitHub ( 70 320 66 340 - 1% open 05.01.2022):

     git clone https://github.com/online-ml/river
    
alibi-detect (25 1.1K) - Algorithms for outlier, adversarial and drift detection. Apache-2
  • GitHub ( 15 110 48 200 - 40% open 13.01.2022):

     git clone https://github.com/SeldonIO/alibi-detect
    
  • PyPi ( 11K / month 5 09.12.2021):

     pip install alibi-detect
    
AugLy (24 4.2K) - A data augmentations library for audio, image, text, and video. MIT
  • GitHub ( 14 220 25 56 - 25% open 28.12.2021):

     git clone https://github.com/facebookresearch/AugLy
    
  • PyPi ( 1.7K / month 3 17.12.2021):

     pip install augly
    
modAL (24 1.5K ) - A modular active learning framework for Python. MIT
  • GitHub ( 14 230 20 110 120 - 54% open 07.01.2021):

     git clone https://github.com/modAL-python/modAL
    
  • PyPi ( 2.7K / month 7 07.01.2021):

     pip install modAL
    
AstroML (23 790 ) - Machine learning, statistics, and data mining for astronomy and.. BSD-2
  • GitHub ( 30 270 140 - 38% open 07.04.2021):

     git clone https://github.com/astroML/astroML
    
  • PyPi ( 1.5K / month 33 04.01.2022):

     pip install astroML
    
  • Conda ( 27K 16.02.2020):

     conda install -c conda-forge astroml
    
Prince (23 750) - Python factor analysis library (PCA, CA, MCA, MFA, FAMD). MIT
  • GitHub ( 12 130 170 100 - 33% open 28.12.2021):

     git clone https://github.com/MaxHalford/prince
    
  • PyPi ( 13K / month 5 06.10.2020):

     pip install prince
    
findspark (23 430 ) - Find pyspark to make it importable. BSD-3
  • GitHub ( 14 66 2.2K 21 - 52% open 14.06.2021):

     git clone https://github.com/minrk/findspark
    
  • PyPi ( 1.9M / month 140 08.06.2020):

     pip install findspark
    
  • Conda ( 600K 06.07.2018):

     conda install -c conda-forge findspark
    
StreamAlert (22 2.6K) - StreamAlert is a serverless, realtime data analysis framework.. Apache-2
  • GitHub ( 33 320 340 - 24% open 04.11.2021):

     git clone https://github.com/airbnb/streamalert
    
gplearn (22 1.1K) - Genetic Programming in Python, with a scikit-learn inspired API. BSD-3
  • GitHub ( 11 190 220 180 - 26% open 18.10.2021):

     git clone https://github.com/trevorstephens/gplearn
    
  • PyPi ( 2.7K / month 10 01.06.2019):

     pip install gplearn
    
avalanche (22 700) - Avalanche: an End-to-End Library for Continual Learning. MIT
  • GitHub ( 48 110 450 - 15% open 11.01.2022):

     git clone https://github.com/ContinualAI/avalanche
    
impyute (21 300) - Data imputations library to preprocess datasets with missing data. MIT
  • GitHub ( 11 43 120 64 - 42% open 06.11.2021):

     git clone https://github.com/eltonlaw/impyute
    
  • PyPi ( 1.6K / month 3 29.04.2019):

     pip install impyute
    
opyrator (20 2.5K ) - Turns your machine learning code into microservices with web API,.. MIT
  • GitHub ( 4 110 31 24 - 8% open 06.05.2021):

     git clone https://github.com/ml-tooling/opyrator
    
  • PyPi ( 270 / month 04.05.2021):

     pip install opyrator
    
scikit-rebate (20 340 ) - A scikit-learn-compatible Python implementation of.. MIT
  • GitHub ( 13 62 32 - 40% open 15.02.2021):

     git clone https://github.com/EpistasisLab/scikit-rebate
    
  • PyPi ( 3.8K / month 40 20.03.2021):

     pip install skrebate
    
SUOD (20 300) - (MLSys 21) An Acceleration System for Large-scare Unsupervised Heterogeneous.. BSD-2
  • GitHub ( 1 39 400 8 - 75% open 02.10.2021):

     git clone https://github.com/yzhao062/SUOD
    
  • PyPi ( 39K / month 01.10.2021):

     pip install suod
    
BioPandas (19 400) - Working with molecular structures in pandas DataFrames. BSD-3
  • GitHub ( 8 89 39 - 38% open 04.01.2022):

     git clone https://github.com/rasbt/biopandas
    
  • PyPi ( 2.1K / month 9 24.09.2021):

     pip install biopandas
    
  • Conda ( 94K 31.08.2021):

     conda install -c conda-forge biopandas
    
pykale (19 320) - Knowledge-Aware machine LEarning (KALE): accessible machine learning.. MIT
  • GitHub ( 14 38 74 - 13% open 22.12.2021):

     git clone https://github.com/pykale/pykale
    
  • PyPi ( 35 / month 13.10.2021):

     pip install pykale
    
baikal (18 590 ) - A graph-based functional API for building complex scikit-learn.. BSD-3
  • GitHub ( 2 29 3 20 - 30% open 11.04.2021):

     git clone https://github.com/alegonz/baikal
    
  • PyPi ( 380 / month 1 15.11.2020):

     pip install baikal
    
  • Conda ( 1.2M 04.11.2021):

     conda install -c conda-forge cython-blis
    
apricot (18 400) - apricot implements submodular optimization for the purpose of selecting.. MIT
  • GitHub ( 4 40 10 22 24 - 25% open 18.11.2021):

     git clone https://github.com/jmschrei/apricot
    
  • PyPi ( 380 / month 3 28.09.2020):

     pip install apricot-select
    
traingenerator (13 1.1K ) - A web app to generate template code for machine learning. MIT
  • GitHub ( 3 160 16 - 81% open 29.04.2021):

     git clone https://github.com/jrieke/traingenerator
    
Show 12 hidden projects...
  • datalad (31 270) - Keep code, data, containers under control with git and git-annex. MIT
  • Cython BLIS (29 180) - Fast matrix-multiplication as a self-contained Python library no.. BSD-3
  • pysc2 (28 7.4K ) - StarCraft II Learning Environment. Apache-2
  • minisom (23 990) - MiniSom is a minimalistic implementation of the Self Organizing.. CC-BY-3.0
  • cleanlab (22 2.6K) - The standard package for machine learning with noisy labels,.. AGPL-3.0
  • mlens (21 710 ) - ML-Ensemble high performance ensemble learning. MIT
  • vecstack (21 640 ) - Python package for stacking (machine learning technique). MIT
  • rrcf (19 350 ) - Implementation of the Robust Random Cut Forest algorithm for anomaly.. MIT
  • pandas-ml (18 280 ) - pandas, scikit-learn, xgboost and seaborn integration. BSD-3
  • Feature Engine (17 11) - Feature engineering package with sklearn like functionality. BSD-3
  • dstack (12 180) - An open-source tool to rapidly develop data applications with Python. Apache-2
  • nylon (11 76) - An intelligent, flexible grammar of machine learning. MIT

Related Resources

Contribution

Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:

  • Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
  • Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI.

If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.

For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.

License

CC0


Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
Python (1,147,256
Machine Learning (31,898
Deep Learning (23,829
Tensorflow (12,840
Pytorch (11,655
Data Science (9,253
Nlp (8,431
Keras (5,781
Data Visualization (4,533
Awesome (3,873
Awesome List (3,558
Scikit Learn (2,001
Python Library (1,288
Best Of (11
Related Projects