Project Name	Stars	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Espnet	7,563	5	3 months ago	33	October 25, 2023	270	apache-2.0	Python
End-to-End Speech Processing Toolkit
Speechbrain	7,166		3 months ago			149	apache-2.0	Python
A PyTorch-based Speech Toolkit
Silero Models	4,088	4	6 months ago	4	June 12, 2022	8	other	Jupyter Notebook
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Wenet	3,512		3 months ago	13	August 29, 2023	55	apache-2.0	Python
Production First and Production Ready End-to-End Speech Recognition Toolkit
Pytorch Kaldi	2,138		2 years ago			24		Python
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Whisper Timestamped	1,217	3	3 months ago	3	December 08, 2023	15	agpl-3.0	Python
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Espresso	930		9 months ago			7	other	Python
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Conformer	809		4 months ago			19	apache-2.0	Python
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Sincnet	764		3 years ago			22	mit	Python
SincNet is a neural architecture for efficiently processing raw audio samples.
Speech Transformer	714		a year ago			5		Python
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Alternatives To Listen Attend Spell

Select To Compare

Espnet ⭐ 7,563

End-to-End Speech Processing Toolkit

dependent packages 5total releases 33most recent commit 3 months ago

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

most recent commit 3 months ago

Silero Models ⭐ 4,088

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

dependent packages 4total releases 4most recent commit 6 months ago

Wenet ⭐ 3,512

Production First and Production Ready End-to-End Speech Recognition Toolkit

total releases 13most recent commit 3 months ago

Pytorch Kaldi ⭐ 2,138

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

most recent commit 2 years ago

Whisper Timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

dependent packages 3total releases 3most recent commit 3 months ago

Espresso ⭐ 930

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

most recent commit 9 months ago

Conformer ⭐ 809

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

most recent commit 4 months ago

Sincnet ⭐ 764

SincNet is a neural architecture for efficiently processing raw audio samples.

most recent commit 3 years ago

Speech Transformer ⭐ 714

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

most recent commit a year ago

Suggest An Alternative To Listen-Attend-Spell

Alternative Project Comparisons

Listen Attend Spell vs Espnet

Listen Attend Spell vs Speechbrain

Listen Attend Spell vs Silero Models

Listen Attend Spell vs Wenet

Listen Attend Spell vs Pytorch Kaldi

Listen Attend Spell vs Whisper Timestamped

Listen Attend Spell vs Espresso

Listen Attend Spell vs Conformer

Listen Attend Spell vs Sincnet

Listen Attend Spell vs Speech Transformer

Popular Asr Projects

Kaldi ⭐ 13,453

kaldi-asr/kaldi is the official location of the Kaldi project.

dependent packages 3total releases 3latest release April 20, 2022most recent commit 3 months ago

Paddlespeech ⭐ 10,011

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

dependent packages 4total releases 9latest release May 27, 2022most recent commit a month ago

Nemo ⭐ 9,041

NeMo: a toolkit for conversational AI

dependent packages 8total releases 70latest release October 25, 2023most recent commit 3 months ago

Whisperx ⭐ 7,510

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

most recent commit 3 months ago

Vosk Api ⭐ 6,633

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

dependent packages 40total releases 37latest release December 14, 2022most recent commit 3 months ago

Popular Pytorch Projects

Transformers ⭐ 124,049

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

dependent packages 2,484total releases 125latest release November 15, 2023most recent commit 13 days ago

Stable Diffusion Webui ⭐ 118,856

Stable Diffusion web UI

total releases 2latest release January 17, 2022most recent commit 3 months ago

Pytorch ⭐ 74,794

Tensors and Dynamic neural networks in Python with strong GPU acceleration

dependent packages 8,272total releases 39latest release November 15, 2023most recent commit 3 months ago

Keras ⭐ 60,854

Deep Learning for humans

dependent packages 697total releases 87latest release December 06, 2023most recent commit 12 days ago

Real Time Voice Cloning ⭐ 49,550

Clone a voice in 5 seconds to generate arbitrary speech in real-time

most recent commit 3 months ago

Popular Machine Learning Categories

Natural Language Processing

Neural Network

Neural

Computer Vision

Convolutional Neural Networks

Opencv