Project Name	Stars	Most Recent Commit	Open Issues	License	Language
Marlin	160	3 months ago	4	apache-2.0	Python
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Tensorquant	44	4 years ago		apache-2.0	Python

Yolov3_lite	32	4 years ago	8		C++
yolov3 model compress and acceleration (quantization, sparse)， c++ version
Wlq	20	5 years ago		other	C++
caffe implementation of single level quantization
Tensorflow_model_quantization	8	3 years ago			Python
A tutorial of model quantization using TensorFlow

Alternatives To Marlin

Select To Compare

Marlin ⭐ 160

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

most recent commit 3 months ago

Tensorquant ⭐ 44

most recent commit 4 years ago

Yolov3_lite ⭐ 32

yolov3 model compress and acceleration (quantization, sparse)， c++ version

most recent commit 4 years ago

Wlq ⭐ 20

caffe implementation of single level quantization

most recent commit 5 years ago

Tensorflow_model_quantization ⭐ 8

A tutorial of model quantization using TensorFlow

most recent commit 3 years ago

Suggest An Alternative To marlin

Alternative Project Comparisons

Marlin vs Tensorquant

Marlin vs Yolov3_lite

Marlin vs Wlq

Marlin vs Tensorflow_model_quantization

Popular Kernel Projects

Linux ⭐ 164,652

Linux kernel source tree

total releases 2latest release December 07, 2022most recent commit 3 months ago

Linux Insides ⭐ 29,075

A little bit about a linux kernel

most recent commit 3 months ago

Serenity ⭐ 26,922

The Serenity Operating System 🐞

most recent commit 3 months ago

Os Tutorial ⭐ 25,710

How to create an OS from scratch

most recent commit 6 months ago

Bcc ⭐ 18,800

BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more

most recent commit 3 months ago

Popular Quantization Projects

Chinese Llama Alpaca ⭐ 15,877

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

most recent commit 4 months ago

Llama Factory ⭐ 10,715

Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)

total releases 19latest release December 03, 2023most recent commit 3 months ago

Faster Whisper ⭐ 8,711

Faster Whisper transcription with CTranslate2

dependent packages 22total releases 12latest release November 26, 2023most recent commit 22 days ago

Mozjpeg ⭐ 5,225

Improved JPEG encoder.

dependent packages 1total releases 2latest release December 01, 2023most recent commit 7 months ago

Pngquant ⭐ 4,995

Lossy PNG compressor — pngquant command based on libimagequant library

total releases 5latest release November 13, 2022most recent commit 3 months ago

Popular Operating Systems Categories