| coqui-ai/TTS |
25,894 |
|
0 |
19 |
over 2 years ago |
90 |
December 01, 2023 |
101 |
mpl-2.0 |
Python |
| 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production |
| PaddlePaddle/PaddleSpeech |
12,635 |
|
0 |
4 |
12 days ago |
9 |
May 27, 2022 |
437 |
apache-2.0 |
Python |
| Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award. |
| NVIDIA/NeMo |
9,041 |
|
2 |
8 |
over 2 years ago |
70 |
October 25, 2023 |
109 |
apache-2.0 |
Python |
| NeMo: a toolkit for conversational AI |
| espnet/espnet |
7,563 |
|
0 |
5 |
over 2 years ago |
33 |
October 25, 2023 |
270 |
apache-2.0 |
Python |
| End-to-End Speech Processing Toolkit |
| netease-youdao/EmotiVoice |
5,739 |
|
0 |
0 |
over 2 years ago |
0 |
|
73 |
apache-2.0 |
Python |
| EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
| jaywalnut310/vits |
5,589 |
|
0 |
0 |
over 2 years ago |
0 |
|
142 |
mit |
Python |
| VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech |
| snakers4/silero-models |
4,088 |
|
0 |
4 |
over 2 years ago |
4 |
June 12, 2022 |
8 |
other |
Jupyter Notebook |
| Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple |
| TensorSpeech/TensorFlowTTS |
3,558 |
|
0 |
1 |
over 2 years ago |
8 |
August 21, 2021 |
8 |
apache-2.0 |
Python |
| :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages) |
| yl4579/StyleTTS2 |
3,464 |
|
0 |
0 |
over 2 years ago |
0 |
|
31 |
mit |
Python |
| StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models |
| zzw922cn/awesome-speech-recognition-speech-synthesis-papers |
3,126 |
|
0 |
0 |
over 2 years ago |
0 |
|
2 |
mit |
|
| Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC) |