Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Speechbrain | 7,166 | 2 months ago | 149 | apache-2.0 | Python | |||||
A PyTorch-based Speech Toolkit | ||||||||||
Pyannote Audio | 4,460 | 1 | 13 | 2 months ago | 24 | December 01, 2023 | 95 | mit | Jupyter Notebook | |
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding | ||||||||||
Deepvoice3_pytorch | 1,906 | 3 months ago | 43 | other | Python | |||||
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models | ||||||||||
Wavenet_vocoder | 1,617 | 3 years ago | 14 | other | Python | |||||
WaveNet vocoder | ||||||||||
Whisper Timestamped | 1,217 | 3 | 2 months ago | 3 | December 08, 2023 | 15 | agpl-3.0 | Python | ||
Multilingual Automatic Speech Recognition with word-level timestamps and confidence | ||||||||||
Sincnet | 764 | 3 years ago | 22 | mit | Python | |||||
SincNet is a neural architecture for efficiently processing raw audio samples. | ||||||||||
Fullsubnet | 443 | 7 months ago | 32 | mit | Python | |||||
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement." | ||||||||||
Ims Toucan | 426 | 2 months ago | 29 | apache-2.0 | Python | |||||
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality. | ||||||||||
Nnmnkwii | 375 | 15 | 1 | a year ago | 26 | January 04, 2022 | 6 | other | Python | |
Library to build speech synthesis systems designed for easy and fast prototyping. | ||||||||||
Unispeech | 328 | 10 months ago | 12 | other | Python | |||||
UniSpeech - Large Scale Self-Supervised Learning for Speech |