Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Espnet | 7,563 | 5 | 3 months ago | 33 | October 25, 2023 | 270 | apache-2.0 | Python | ||
End-to-End Speech Processing Toolkit | ||||||||||
Speechbrain | 7,166 | 3 months ago | 149 | apache-2.0 | Python | |||||
A PyTorch-based Speech Toolkit | ||||||||||
Silero Models | 4,088 | 4 | 6 months ago | 4 | June 12, 2022 | 8 | other | Jupyter Notebook | ||
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple | ||||||||||
Wenet | 3,512 | 3 months ago | 13 | August 29, 2023 | 55 | apache-2.0 | Python | |||
Production First and Production Ready End-to-End Speech Recognition Toolkit | ||||||||||
Pytorch Kaldi | 2,138 | 2 years ago | 24 | Python | ||||||
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit. | ||||||||||
Whisper Timestamped | 1,217 | 3 | 3 months ago | 3 | December 08, 2023 | 15 | agpl-3.0 | Python | ||
Multilingual Automatic Speech Recognition with word-level timestamps and confidence | ||||||||||
Espresso | 930 | 9 months ago | 7 | other | Python | |||||
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit | ||||||||||
Conformer | 809 | 4 months ago | 19 | apache-2.0 | Python | |||||
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020) | ||||||||||
Sincnet | 764 | 3 years ago | 22 | mit | Python | |||||
SincNet is a neural architecture for efficiently processing raw audio samples. | ||||||||||
Speech Transformer | 714 | a year ago | 5 | Python | ||||||
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese. |