Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for deep learning quantization
deep-learning
x
quantization
x
68 search results found
Faster Whisper
⭐
8,711
Faster Whisper transcription with CTranslate2
Qbot
⭐
4,799
[🔥updating ...] AI 自动量化交易机器人 AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
Autogptq
⭐
3,637
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Nlp Architect
⭐
2,928
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
Pocketflow
⭐
2,553
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
Ctranslate2
⭐
2,437
Fast inference engine for Transformer models
Mixtral Offloading
⭐
1,943
Run Mixtral-8x7B models in Colab or consumer desktops
Vector Quantize Pytorch
⭐
1,627
Vector Quantization, in Pytorch
Awesome Model Quantization
⭐
1,449
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
Model Optimization
⭐
1,445
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Intel Extension For Pytorch
⭐
1,161
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Training_extensions
⭐
1,119
Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™
Brevitas
⭐
1,015
Brevitas: neural network quantization in PyTorch
Ppq
⭐
957
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Rwkv.cpp
⭐
956
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Nncf
⭐
725
Neural Network Compression Framework for enhanced OpenVINO™ inference
Awesome Emdl
⭐
723
Embedded and mobile deep learning research resources
Tinyengine
⭐
614
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
Deep Compression Alexnet
⭐
599
Deep Compression on AlexNet
Deephash
⭐
537
An Open-Source Package for Deep Learning to Hash (DeepHash)
Qkeras
⭐
514
QKeras: a quantization deep learning library for Tensorflow Keras
Complete Life Cycle Of A Data Science Project
⭐
499
Complete-Life-Cycle-of-a-Data-Science-Project
Awesome Deep Neural Network Compression
⭐
475
Summary, Code for Deep Neural Network Quantization
Onnx2tf
⭐
461
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massive Transpose extrapolation problem in onnx-tensorflow (onnx-tf). I don't need a Star, but give me a pull request.
Minigpt4.cpp
⭐
435
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
Tinychatengine
⭐
407
TinyChatEngine: On-Device LLM Inference Library
Awesome Ml Model Compression
⭐
378
Awesome machine learning model compression research papers, tools, and learning material.
Bmxnet
⭐
344
(New version is out: https://github.com/hpi-xnor/BMXNet-v2) BMXNet: An Open-Source Binary Neural Network Implementation Based on MXNet
Awesome Model Compression And Acceleration
⭐
329
a list of awesome papers on deep model ompression and acceleration
Deephash Papers
⭐
319
Must-read papers on deep learning to hash (DeepHash)
Sparsebit
⭐
291
A model compression and acceleration toolbox based on pytorch.
Fastt5
⭐
280
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Model_optimization
⭐
245
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
Blueoil
⭐
243
Bring Deep Learning to small devices
Jejunet
⭐
236
Real-Time Video Segmentation on Mobile Devices with DeepLab V3+, MobileNet V2. Worked on the project in 🏝 Jeju island
Dfq
⭐
230
PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.
Deep Compression Pytorch
⭐
182
PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by Song Han, Huizi Mao, William J. Dally
Network Speed And Compression
⭐
176
Network acceleration methods
Awesome Ai Infrastructures
⭐
171
Infrastructures™ for Machine Learning Training/Inference in Production.
Great Deep Learning Tutorials
⭐
153
A Great Collection of Deep Learning Tutorials and Repositories
Terngrad
⭐
152
Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)
Tf2deepfloorplan
⭐
107
TF2 Deep FloorPlan Recognition using a Multi-task Network with Room-boundary-Guided Attention. Enable tensorboard, quantization, flask, tflite, docker, github actions and google colab.
Frostnet
⭐
86
FrostNet: Towards Quantization-Aware Network Architecture Search
Efficient Deep Learning
⭐
82
Related Paper of Efficient Deep Neural Networks
Hailo_model_zoo
⭐
82
The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment
Qonnx
⭐
81
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
Discrete Key Value Bottleneck Pytorch
⭐
81
Implementation of Discrete Key / Value Bottleneck, in Pytorch
Permute Quantize Finetune
⭐
79
Using ideas from product quantization for state-of-the-art neural network compression.
Tf2
⭐
74
An Open Source Deep Learning Inference Engine Based on FPGA
Jacinto Ai Devkit
⭐
74
Training & Quantization of embedded friendly Deep Learning / Machine Learning / Computer Vision models
Facial Landmark Detection Hrnet
⭐
72
A TensorFlow implementation of HRNet for facial landmark detection.
Sota Backbones
⭐
64
A collection of SOTA Image Classification Models in PyTorch
Ssql Eccv2022
⭐
64
PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation)
Cvpr17 Dvsq
⭐
62
The implementation of CVPR-17 paper "Deep Visual-Semantic Quantization of Efficient Image Retrieval"
Gpq
⭐
59
Generalized Product Quantization Network For Semi-supervised Image Retrieval - CVPR 2020
F8net
⭐
52
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
Awesome Deeplearning Tutorials
⭐
47
Links to useful online tutorials etc. about deep learning
Ai8x Synthesis
⭐
46
Quantization and Synthesis (Device Specific Code Generation) for ADI's MAX78000 and MAX78002 AI Devices
Tidy
⭐
43
Offline semantic Text-to-Image and Image-to-Image search on Android powered by quantized state-of-the-art vision-language pretrained CLIP model and ONNX Runtime inference engine
Mobilenet_v1_stm32_cmsis_nn
⭐
40
Mobilenet v1 trained on Imagenet for STM32 using extended CMSIS-NN with INT-Q quantization support
Lsq Net
⭐
37
Unofficial implementation of LSQ-Net, a neural network quantization framework
Aaai17 Cdq
⭐
31
The implementation of AAAI-17 paper "Collective Deep Quantization of Efficient Cross-modal Retrieval"
Edge Tpu Tiny Yolo
⭐
21
Run Tiny YOLO-v3 on Google's Edge TPU USB Accelerator.
Model Compression Acceleration
⭐
20
Paper list on model compression and acceleration
Compress Net Notes
⭐
20
Neuralzip
⭐
19
A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralzip
Dnnac
⭐
18
All about acceleration and compression of Deep Neural Networks
Jnqd
⭐
14
Learning-based Just-noticeable-quantization-distortion Model for perceptual video coding
Binary Neural Networks
⭐
14
Exploring "Binary Neural Networks" (https://arxiv.org/abs/1602.02830) in Theano. A set of experiments that use binarised weights and/or activations to reduce computational load of convolutional neural networks.
Awesome Approximate Dnn
⭐
13
Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment
Coarsehash
⭐
13
Benchmark datasets used in ICRA 2020 paper: Fast, Compact and Highly Scalable Visual Place Recognition through Sequence-based Matching of Overloaded Representations
Dtq
⭐
13
PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)
Papers I Read
⭐
12
Summaries and notes on recent Deep Learning literature
Qsnns
⭐
9
Quantization-aware training with spiking neural networks
Quantized Deep Neural Network On Jetson Agx Xavier
⭐
9
How to create, train and quantize network, then integrate it into pre/post image processing and generate CUDA C++ code for targeting Jetson AGX Xavier
Quantized_meanfield
⭐
9
This repository provides code source used in the paper: A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off
Awd Lstm Tensorflow
⭐
9
AWD-LSTM from "Regularizing and Optimizing LSTM Language Models" with training-award quantization support for tensorflow.
Paper Collection Of Efficient Ml
⭐
9
paper collection
Q Asr
⭐
9
Integer-only Zero-shot Quantization for Efficient Speech Recognition
Pocket Cnn
⭐
9
CNN-to-FPGA-framework for small CNN, written in VHDL and Python
Llms
⭐
7
Comprehensive LLMs repo, where I cover both theoretical and practical aspects of LLMs.
Hsi Toolbox
⭐
6
Hyperspectral CNN compression and band selection
Myconvnet
⭐
6
Deep learning using TensorFlow low-level APIs
Co Design
⭐
6
Software/Hardware Co-design for Deep Learning.
Daqn
⭐
6
An implement of our paper “DEEP ADVERSARIAL QUANTIZATION NETWORK FOR CROSS-MODAL RETRIEVAL”
Da2lite
⭐
6
DA2Lite is an automated model compression toolkit for PyTorch.
Neural Network Quatization And Compression Papers
⭐
5
This repository gives most of the papers which are published in the domain of neural network compression and quantization
Deep Llr Quantization
⭐
5
Source code for the "Deep Log-Likelihood Ratio Compression" paper submitted to EUSIPCO 2019
Pwlq
⭐
5
Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks
Related Searches
Python Deep Learning (13,095)
Jupyter Notebook Deep Learning (10,328)
Deep Learning Neural Network (5,801)
Deep Learning Pytorch (4,652)
Deep Learning Tensorflow (4,441)
Deep Learning Convolutional Neural Networks (4,142)
Deep Learning Computer Vision (3,652)
Deep Learning Artificial Intelligence (2,898)
Deep Learning Keras (2,519)
Deep Learning Dataset (2,320)
1-68 of 68 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.