Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for tts speech synthesis
speech-synthesis
x
tts
x
165 search results found
Tts
⭐
32,406
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Paddlespeech
⭐
10,878
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.【安全加固,暂停交互,请耐心等待】
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Espnet
⭐
7,563
End-to-End Speech Processing Toolkit
Emotivoice
⭐
5,739
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Vits
⭐
5,589
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Silero Models
⭐
4,088
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Styletts2
⭐
3,464
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Diffsinger
⭐
3,123
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Tacotron
⭐
2,845
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Lingvo
⭐
2,776
Lingvo
Piper
⭐
2,586
A fast, local neural text to speech system
Edge Tts
⭐
2,532
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Whisperspeech
⭐
2,419
An Open Source text-to-speech system built by inverting Whisper.
Deepvoice3_pytorch
⭐
1,906
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Wavernn
⭐
1,761
WaveRNN Vocoder + TTS
Parallelwavegan
⭐
1,427
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Hifi Gan
⭐
1,376
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Open Speech Corpora
⭐
830
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Natspeech
⭐
814
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Yourtts
⭐
741
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
Voicefixer
⭐
735
General Speech Restoration
Fastspeech
⭐
723
The Implementation of FastSpeech based on pytorch.
Irene Voice Assistant
⭐
644
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
Diffwave
⭐
628
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Transformer Tts
⭐
599
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Thorsten Voice
⭐
475
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
Aspeak
⭐
459
A simple text-to-speech client for Azure TTS API.
Parakeet
⭐
459
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
Ai Waifu Vtuber
⭐
457
AI Vtuber for Streaming on Youtube/Twitch
Gantts
⭐
441
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Ims Toucan
⭐
426
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Tiktok Voice
⭐
394
Simple Python script to interact with the TikTok TTS API
Kan Tts
⭐
377
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-
Dl For Emo Tts
⭐
350
💻 🤖 A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech 🔈
Styletts
⭐
310
Official Implementation of StyleTTS
Text2video
⭐
294
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Speech Recognition Uk
⭐
262
Speech Recognition for Ukrainian
Voicefixer_main
⭐
244
General Speech Restoration
Wavegrad
⭐
239
A fast, high-quality neural vocoder.
Speech_dataset
⭐
229
The dataset of Speech Recognition
Dsnote
⭐
225
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Msedgetts
⭐
213
A simple Azure Speech Service module that uses the Microsoft Edge Read Aloud API
Portaspeech
⭐
211
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Glow Tts
⭐
199
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Ttslearn
⭐
197
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Durian Pytorch
⭐
182
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
Nisqa
⭐
174
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Cotatron
⭐
163
Official code for Cotatron @ INTERSPEECH 2020
Ueazspeech
⭐
162
This plugin integrates Azure Speech Cognitive Services in Unreal Engine.
Ukrainian Tts
⭐
161
Ukrainian TTS (text-to-speech) using ESPNET
Tensorvox
⭐
160
Desktop application for neural speech synthesis written in C++
Comprehensive Transformer Tts
⭐
146
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
Nix Tts
⭐
138
🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation
M.i.t.s.u.h.a.
⭐
134
World's First Multilingual Inexpensive Therapeutic Sophisticated Ultra-responsive Holographic Agent. In simple terms, an AI you can talk to and it'll talk back with a body using VTube Studio.
Summertts
⭐
132
SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键 is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out
Whisper Vits Japanese
⭐
128
Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)
Easy Speech
⭐
127
Cross browser Speech Synthesis also known as Text to speech or TTS; no dependencies; uses Web Speech API
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
Comospeech
⭐
112
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Cross Speaker Emotion Transfer
⭐
104
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
Stylespeech
⭐
103
Official implementation of Meta-StyleSpeech and StyleSpeech
Editts
⭐
100
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
Stylespeech
⭐
100
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Msmc Tts
⭐
100
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Vits2
⭐
98
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Manim Voiceover
⭐
96
Manim plugin for all things voiceover
Awesome Singing Voice Synthesis And Singing Voice Conversion
⭐
94
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
Vits Mandarin Biaobei
⭐
91
application of vits on mandarin tts
Voicesmith
⭐
87
[WIP] VoiceSmith makes training text to speech models easy.
Univnet
⭐
86
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Diffsinger
⭐
85
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
Pytorch Dc Tts
⭐
85
Text to Speech with PyTorch (English and Mongolian)
Istftnet Pytorch
⭐
76
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Rvc Tts Webui
⭐
75
Text-to-Speech Gradio webui using RVC and edge-tts
Unsuperior Ai Waifu
⭐
73
AI waifu that can run on your phone or PC
Hiftnet
⭐
72
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Multi Singer
⭐
71
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Comprehensive E2e Tts
⭐
71
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Cnn_vocoder
⭐
69
A fast cnn-based vocoder
Persian Tts Coqui
⭐
65
Persian/Farsi text to speech(TTS) training using coqui tts
Wavegrad2
⭐
63
Unofficial Pytorch Implementation of WaveGrad2
Adaspeech
⭐
62
AdaSpeech: Adaptive Text to Speech for Custom Voice
Diffgan Tts
⭐
52
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Sova Tts Engine
⭐
50
Tacotron2 based engine for the SOVA-TTS project
Unitytts
⭐
49
Text to Speech in Unity.
Mb Istft Vits2
⭐
49
Application of MB-iSTFT-VITS components to vits2_pytorch
Lvcnet
⭐
47
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Verbify Tts
⭐
44
Simple and free Text-to-Speech (TTS) engine that reads for you any text on your screen with high-quality voices powered by AI models.
Fre Gan Pytorch
⭐
43
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Ukrainian Tts Datasets
⭐
42
🇺🇦 Open Source Ukrainian Text-to-Speech datasets
Lightspeech
⭐
42
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Expressive Fastspeech2
⭐
41
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Tacotron2
⭐
40
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Univnet Pytorch
⭐
39
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
Aivoice
⭐
39
Deep CNN networks for Speech Synthesis
Elevenlabs Dotnet
⭐
36
A Non-Official ElevenLabs RESTful API Client for dotnet
Tfgan
⭐
36
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Daft Exprt
⭐
36
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Wavegrad2
⭐
35
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Related Searches
Python Tts (595)
Tts Text To Speech (280)
1-100 of 165 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.