Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Whisper.cpp	27,404		1	5 months ago	1	December 12, 2022	465	mit	C
Port of OpenAI's Whisper model in C/C++
Deepspeech	24,127	29	14	4 months ago	100	December 19, 2020	137	mpl-2.0	C++
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Leon	13,937			6 months ago			94	mit	TypeScript
🧠 Leon is your open-source personal assistant.
Kaldi	13,453		3	5 months ago	3	April 20, 2022	234	other	Shell
kaldi-asr/kaldi is the official location of the Kaldi project.
Nemo	9,041	2	8	5 months ago	70	October 25, 2023	109	apache-2.0	Python
NeMo: a toolkit for conversational AI
Faster Whisper	8,711		22	2 months ago	12	November 26, 2023	140	mit	Python
Faster Whisper transcription with CTranslate2
Speech_recognition	7,801	544	277	5 months ago	56	December 06, 2023	314	bsd-3-clause	Python
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Whisperx	7,510			5 months ago			341	bsd-4-clause	Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Asrt_speechrecognition	7,253			5 months ago	1	October 23, 2020	101	gpl-3.0	Python
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Speechbrain	7,166			5 months ago			149	apache-2.0	Python
A PyTorch-based Speech Toolkit

Alternatives To Whisper.cpp

Select To Compare

Whisper.cpp ⭐ 27,404

Port of OpenAI's Whisper model in C/C++

dependent packages 1total releases 1most recent commit 5 months ago

Deepspeech ⭐ 24,127

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

dependent packages 14total releases 100most recent commit 4 months ago

Leon ⭐ 13,937

🧠 Leon is your open-source personal assistant.

most recent commit 6 months ago

Kaldi ⭐ 13,453

kaldi-asr/kaldi is the official location of the Kaldi project.

dependent packages 3total releases 3most recent commit 5 months ago

Nemo ⭐ 9,041

NeMo: a toolkit for conversational AI

dependent packages 8total releases 70most recent commit 5 months ago

Faster Whisper ⭐ 8,711

Faster Whisper transcription with CTranslate2

dependent packages 22total releases 12most recent commit 2 months ago

Speech_recognition ⭐ 7,801

Speech recognition module for Python, supporting several engines and APIs, online and offline.

dependent packages 277total releases 56most recent commit 5 months ago

Whisperx ⭐ 7,510

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

most recent commit 5 months ago

Asrt_speechrecognition ⭐ 7,253

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

total releases 1most recent commit 5 months ago

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

most recent commit 5 months ago

Suggest An Alternative To whisper.cpp

Alternative Project Comparisons

Whisper.cpp vs Deepspeech

Whisper.cpp vs Leon

Whisper.cpp vs Kaldi

Whisper.cpp vs Nemo

Whisper.cpp vs Faster Whisper

Whisper.cpp vs Speech_recognition

Whisper.cpp vs Whisperx

Whisper.cpp vs Asrt_speechrecognition

Whisper.cpp vs Speechbrain

Popular Speech Recognition Projects

Transformers ⭐ 127,491

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

dependent packages 2,484total releases 125latest release November 15, 2023most recent commit 6 days ago

Deeplearningexamples ⭐ 12,073

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

most recent commit 5 months ago

Deep Learning Drizzle ⭐ 10,767

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

most recent commit a year ago

Paddlespeech ⭐ 10,407

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

dependent packages 4total releases 9latest release May 27, 2022most recent commit 5 days ago

Espnet ⭐ 7,563

End-to-End Speech Processing Toolkit

dependent packages 5total releases 33latest release October 25, 2023most recent commit 5 months ago

Popular Speech To Text Projects

Pyvideotrans ⭐ 3,054

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，并添加配音

most recent commit 5 months ago

Nlp Models Tensorflow ⭐ 1,329

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

most recent commit 4 years ago

Dc_tts ⭐ 1,148

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

most recent commit a year ago

Botium Speech Processing ⭐ 938

Botium Speech Processing

most recent commit a year ago

Voicy ⭐ 865

@voicybot Telegram bot main repository

most recent commit 7 months ago

Popular Machine Learning Categories

Natural Language Processing

Neural Network

Neural

Computer Vision

Convolutional Neural Networks

Opencv