Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Transformers | 124,049 | 64 | 2,484 | a month ago | 125 | November 15, 2023 | 946 | apache-2.0 | Python | |
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. | ||||||||||
Deeplearningexamples | 12,073 | 4 months ago | 295 | Jupyter Notebook | ||||||
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure. | ||||||||||
Espnet | 7,563 | 5 | 4 months ago | 33 | October 25, 2023 | 270 | apache-2.0 | Python | ||
End-to-End Speech Processing Toolkit | ||||||||||
Speechbrain | 7,166 | 4 months ago | 149 | apache-2.0 | Python | |||||
A PyTorch-based Speech Toolkit | ||||||||||
Silero Models | 4,088 | 4 | 7 months ago | 4 | June 12, 2022 | 8 | other | Jupyter Notebook | ||
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple | ||||||||||
Wenet | 3,754 | 2 days ago | 13 | August 29, 2023 | 55 | apache-2.0 | Python | |||
Production First and Production Ready End-to-End Speech Recognition Toolkit | ||||||||||
Ml Road | 2,742 | 5 months ago | 3 | mit | Python | |||||
Machine Learning Resources, Practice and Research | ||||||||||
Funasr | 2,315 | 2 | 4 months ago | 42 | November 28, 2023 | 156 | other | Python | ||
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. | ||||||||||
Pytorch Kaldi | 2,138 | 2 years ago | 24 | Python | ||||||
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit. | ||||||||||
Whisper Timestamped | 1,217 | 3 | 4 months ago | 3 | December 08, 2023 | 15 | agpl-3.0 | Python | ||
Multilingual Automatic Speech Recognition with word-level timestamps and confidence |