Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Real Time Voice Cloning | 49,550 | 3 months ago | 187 | other | Python | |||||
Clone a voice in 5 seconds to generate arbitrary speech in real-time | ||||||||||
Tts | 28,328 | 19 | a month ago | 90 | December 01, 2023 | 101 | mpl-2.0 | Python | ||
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production | ||||||||||
Nemo | 9,041 | 2 | 8 | 3 months ago | 70 | October 25, 2023 | 109 | apache-2.0 | Python | |
NeMo: a toolkit for conversational AI | ||||||||||
Tts | 8,144 | 5 months ago | 21 | mpl-2.0 | Jupyter Notebook | |||||
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts) | ||||||||||
Espnet | 7,563 | 5 | 3 months ago | 33 | October 25, 2023 | 270 | apache-2.0 | Python | ||
End-to-End Speech Processing Toolkit | ||||||||||
Emotivoice | 5,739 | 3 months ago | 73 | apache-2.0 | Python | |||||
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine | ||||||||||
Vits | 5,589 | 5 months ago | 142 | mit | Python | |||||
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech | ||||||||||
Styletts2 | 3,464 | 3 months ago | 31 | mit | Python | |||||
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models | ||||||||||
Stable Diffusion | 1,501 | 3 months ago | 11 | gpl-3.0 | Jupyter Notebook | |||||
Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney | ||||||||||
Hifi Gan | 1,376 | 9 months ago | 82 | mit | Python | |||||
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis |