Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python speech synthesis
python
x
speech-synthesis
x
241 search results found
Istftnet Pytorch
⭐
76
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
Styletts Vc
⭐
76
Official Implementation of StyleTTS-VC
Ssl_speech_restoration
⭐
75
SelfRemaster: SSL Speech Restoration
Emotionalconversionstargan
⭐
75
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
Rvc Tts Webui
⭐
75
Text-to-Speech Gradio webui using RVC and edge-tts
Hiftnet
⭐
72
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Tdmelodic
⭐
71
A Japanese accent dictionary generator
Multi Singer
⭐
71
PyTorch Implementation of Multi-Singer (ACM-MM'21)
Comprehensive E2e Tts
⭐
71
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Cnn_vocoder
⭐
69
A fast cnn-based vocoder
Speech_ai
⭐
68
Speech to speech bot built with Python
Nlp Guide
⭐
61
Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.
Gst Tacotron
⭐
57
Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)
Hifigan
⭐
56
An 16kHz implementation of HiFi-GAN for soft-vc.
Fastvocoder
⭐
56
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Jejueo
⭐
56
Jejueo Datasets for Machine Translation and Speech Synthesis
Nanoflow
⭐
53
PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity."
Diffgan Tts
⭐
52
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Neon Tts Plugin Coqui
⭐
52
Coqui AI TTS plugin
Few Shot Transformer Tts
⭐
51
Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Vae_tacotron2
⭐
51
VAE Tacotron 2, an alternative of GST Tacotron
Tacotron
⭐
50
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Sova Tts Engine
⭐
50
Tacotron2 based engine for the SOVA-TTS project
Cs224n Gpu That Talks
⭐
50
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Nvda Ibmtts Driver
⭐
49
This project is aimed at developing and maintaining the NVDA IBMTTS driver. IBMTTS is a synthesizer similar to Eloquence. Please send your ideas and contributions here!
Mb Istft Vits2
⭐
49
Application of MB-iSTFT-VITS components to vits2_pytorch
Msspeech
⭐
48
not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud
Voicekit Examples
⭐
46
Examples on how to use Tinkoff Voicekit
Voice Privacy Challenge 2022
⭐
45
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Verbify Tts
⭐
44
Simple and free Text-to-Speech (TTS) engine that reads for you any text on your screen with high-quality voices powered by AI models.
Zerospeech
⭐
44
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Fre Gan Pytorch
⭐
43
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Source Filter Vae
⭐
42
Learning and controlling the source-filter representation of speech with a variational autoencoder
Ukrainian Tts Datasets
⭐
42
🇺🇦 Open Source Ukrainian Text-to-Speech datasets
Lightspeech
⭐
42
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Qppwg
⭐
41
Quasi-Periodic Parallel WaveGAN Pytorch implementation
Expressive Fastspeech2
⭐
41
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Tacotron2
⭐
40
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Univnet Pytorch
⭐
39
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
Aivoice
⭐
39
Deep CNN networks for Speech Synthesis
Sova Tts Tps
⭐
38
NLP-preprocessor for the SOVA-TTS project
Tts Arabic Pytorch
⭐
37
TTS models for Arabic (Tacotron2, FastPitch)
Daft Exprt
⭐
36
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Tfgan
⭐
36
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Wavegrad2
⭐
35
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Turkicasr
⭐
35
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
Avocodo Pytorch
⭐
34
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Ubisoft Laforge Daft Exprt
⭐
34
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Unicats Ctx Txt2vec
⭐
34
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
Vectorquantizedcpc
⭐
33
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
Lexconvert
⭐
32
Convert phoneme codes and lexicon formats for English speech synths
Voice Assistant Chatgpt
⭐
31
Voice Assistant based on Whisper ASR and ChatGPT API
Articulatory
⭐
30
Deep Articulatory Synthesis and Inversion
Tts Tortoise Gradio
⭐
29
A Gradio setup for Tortoise TTS.
Sc Cnn
⭐
29
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
Talknet2 Pytorch
⭐
28
TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.
Tensorflow_wavenet_vocoder
⭐
27
wavenet vocoder using tensorflow
Voice_chatbot
⭐
26
Chatbot in russian with speech recognition using PocketSphinx and speech synthesis using RHVoice. The AttentionSeq2Seq model is used. Imlemented using Python3+TensorFlow+Keras.
Dialog
⭐
26
A PyTorch Implementation of japanese chatbot using BERT and Transformer's decoder
Vaenar Tts
⭐
25
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Istft Avocodo Pytorch
⭐
25
Ultrafast GAN based Vocoder for Text to Speech
Audiomae Pytorch
⭐
25
Unofficial PyTorch implementation of Masked Autoencoders that Listen
Robust_fine_grained_prosody_control
⭐
24
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
Extensibletts Pytorch
⭐
24
An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
Fcl Taco2
⭐
24
Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021
Speech Training Recorder
⭐
24
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
Realtimesingingsynthesizer
⭐
23
Live Coding Singing Synthesizer. Python sinsy-NG wrapper.
Vcvits
⭐
23
Non Parallel Voice Conversion based on VITS
Comprehensive Tacotron2
⭐
23
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Voice Conversion
⭐
22
an tutorial implement of voice conversion using pytorch
Turkish Text To Speech
⭐
22
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Voice100
⭐
22
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
Fastpitchformant
⭐
21
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
Best Rq Pytorch
⭐
20
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
Gradint
⭐
19
Graduated Interval Recall program
Echo Xi
⭐
19
Speech to text to speech using Elevenlabs
Sonata
⭐
19
A cross-platform engine for neural TTS models.
Singlevc
⭐
18
Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.
Speechnet
⭐
18
Automatic Speech Recognition
Tacotron
⭐
18
tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granularities
Zero Shot Tts
⭐
18
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Styletts2
⭐
18
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Transformer Tts
⭐
18
A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq
Wavthruvec_pytorch
⭐
17
An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"
Neural Lexicon Reader
⭐
17
Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
Openai_tts
⭐
17
OpenAI TTS custom component for HA
Phones
⭐
17
A collection of utilities for handling IPA phones.
Vocal Tube Model
⭐
16
a very simple vocal tract model, few tube model. generate vowel sound by it
Average_prosody
⭐
16
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Workshop
Fast Seamlessm4t Onnx
⭐
16
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
Mediumvc
⭐
16
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Jam Ai
⭐
15
Jam-AI a personal AI voice-controlled assistant using technologies such as Speech Recognition, NLP, TTS, and stuffs
Autovocoder
⭐
15
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Ai Npcs That Can Control Their Actions Along With Dialogue
⭐
14
AI NPCs that can control their actions along with dialogue. For instance, if I ask an NPC to tell me its favorite magic spell, it not only tells me the spell but also performs it!
Idiaptts
⭐
14
A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis
Waveglow
⭐
14
A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
Gnuspeech_trm
⭐
13
Standalone version of the Gnuspeech Tube Resonance Model (TRM)
Qpnet
⭐
12
Quasi-Periodic WaveNet Pytorch implementation
Espnet_tts_frontend
⭐
12
Text frontend for ESPnet tts recipes
Tacotron_pytorch
⭐
12
Tacotron implementation of pytorch
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Pytorch (15,280)
Python Dataset (15,278)
Python Docker (14,113)
Python Tensorflow (13,737)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
101-200 of 241 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.