I Bert

[ICML'21] I-BERT: Integer-only BERT Quantization
Alternatives To I Bert
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Chinese Llama Alpaca15,877
4 months ago8apache-2.0Python
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Autogptq3,637
25 days ago174mitPython
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Nlp Architect2,928
a year ago10April 12, 202014apache-2.0Python
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
Deepsparse2,72933 months ago141December 07, 202328otherPython
Sparsity-aware deep learning inference runtime for CPUs
Nncf72563 months ago16November 16, 202346apache-2.0Python
Neural Network Compression Framework for enhanced OpenVINO™ inference
Complete Life Cycle Of A Data Science Project499
4 months ago4mit
Complete-Life-Cycle-of-a-Data-Science-Project
Squeezellm486
3 months ago5mitPython
SqueezeLLM: Dense-and-Sparse Quantization
Sparsezoo34773 months ago26December 04, 20236apache-2.0Python
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
Fastt5280
2 years ago14April 05, 202213apache-2.0Python
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Compress Fasttext15316 months ago12October 14, 20232mitJupyter Notebook
Tools for shrinking fastText models (in gensim format)
Alternatives To I Bert
Select To Compare


Alternative Project Comparisons
Popular Quantization Projects
Popular Natural Language Processing Projects
Popular Machine Learning Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Natural Language Processing
Neural
Translation
Quantization
Model Compression