Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Faster Whisper | 8,711 | 22 | a month ago | 12 | November 26, 2023 | 140 | mit | Python | ||
Faster Whisper transcription with CTranslate2 | ||||||||||
Qbot | 4,799 | 6 months ago | 51 | mit | Jupyter Notebook | |||||
[🔥updating ...] AI 自动量化交易机器人 AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant | ||||||||||
Autogptq | 3,637 | a month ago | 174 | mit | Python | |||||
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm. | ||||||||||
Nlp Architect | 2,928 | 2 years ago | 10 | April 12, 2020 | 14 | apache-2.0 | Python | |||
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks | ||||||||||
Pocketflow | 2,553 | 3 years ago | 73 | other | Python | |||||
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications. | ||||||||||
Ctranslate2 | 2,437 | 23 | 4 months ago | 103 | December 05, 2023 | 110 | mit | C++ | ||
Fast inference engine for Transformer models | ||||||||||
Mixtral Offloading | 1,943 | 4 months ago | 12 | mit | Python | |||||
Run Mixtral-8x7B models in Colab or consumer desktops | ||||||||||
Vector Quantize Pytorch | 1,627 | 25 | 4 months ago | 160 | December 06, 2023 | 27 | mit | Python | ||
Vector Quantization, in Pytorch | ||||||||||
Awesome Model Quantization | 1,449 | 4 months ago | ||||||||
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo. | ||||||||||
Model Optimization | 1,445 | 3 | 27 | 4 months ago | 30 | May 26, 2023 | 207 | apache-2.0 | Python | |
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning. |