Awesome Open Source

Programming Languages

Search results for python speech synthesis

speech-synthesis x

241 search results found

Istftnet Pytorch ⭐ 76

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Styletts Vc ⭐ 76

Official Implementation of StyleTTS-VC

Ssl_speech_restoration ⭐ 75

SelfRemaster: SSL Speech Restoration

Emotionalconversionstargan ⭐ 75

This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".

Rvc Tts Webui ⭐ 75

Text-to-Speech Gradio webui using RVC and edge-tts

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Tdmelodic ⭐ 71

A Japanese accent dictionary generator

Multi Singer ⭐ 71

PyTorch Implementation of Multi-Singer (ACM-MM'21)

Comprehensive E2e Tts ⭐ 71

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Cnn_vocoder ⭐ 69

A fast cnn-based vocoder

Speech_ai ⭐ 68

Speech to speech bot built with Python

Nlp Guide ⭐ 61

Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.

Gst Tacotron ⭐ 57

Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)

An 16kHz implementation of HiFi-GAN for soft-vc.

Fastvocoder ⭐ 56

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

Jejueo Datasets for Machine Translation and Speech Synthesis

Nanoflow ⭐ 53

PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity."

Diffgan Tts ⭐ 52

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Neon Tts Plugin Coqui ⭐ 52

Coqui AI TTS plugin

Few Shot Transformer Tts ⭐ 51

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Vae_tacotron2 ⭐ 51

VAE Tacotron 2, an alternative of GST Tacotron

Tacotron ⭐ 50

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Sova Tts Engine ⭐ 50

Tacotron2 based engine for the SOVA-TTS project

Cs224n Gpu That Talks ⭐ 50

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Nvda Ibmtts Driver ⭐ 49

This project is aimed at developing and maintaining the NVDA IBMTTS driver. IBMTTS is a synthesizer similar to Eloquence. Please send your ideas and contributions here!

Mb Istft Vits2 ⭐ 49

Application of MB-iSTFT-VITS components to vits2_pytorch

Msspeech ⭐ 48

not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud

Voicekit Examples ⭐ 46

Examples on how to use Tinkoff Voicekit

Voice Privacy Challenge 2022 ⭐ 45

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Verbify Tts ⭐ 44

Simple and free Text-to-Speech (TTS) engine that reads for you any text on your screen with high-quality voices powered by AI models.

Zerospeech ⭐ 44

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Fre Gan Pytorch ⭐ 43

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Source Filter Vae ⭐ 42

Learning and controlling the source-filter representation of speech with a variational autoencoder

Ukrainian Tts Datasets ⭐ 42

🇺🇦 Open Source Ukrainian Text-to-Speech datasets

Lightspeech ⭐ 42

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Quasi-Periodic Parallel WaveGAN Pytorch implementation

Expressive Fastspeech2 ⭐ 41

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Tacotron2 ⭐ 40

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Univnet Pytorch ⭐ 39

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Deep CNN networks for Speech Synthesis

Sova Tts Tps ⭐ 38

NLP-preprocessor for the SOVA-TTS project

Tts Arabic Pytorch ⭐ 37

TTS models for Arabic (Tacotron2, FastPitch)

Daft Exprt ⭐ 36

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Wavegrad2 ⭐ 35

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Turkicasr ⭐ 35

A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

Avocodo Pytorch ⭐ 34

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Ubisoft Laforge Daft Exprt ⭐ 34

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Unicats Ctx Txt2vec ⭐ 34

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS

Vectorquantizedcpc ⭐ 33

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Lexconvert ⭐ 32

Convert phoneme codes and lexicon formats for English speech synths

Voice Assistant Chatgpt ⭐ 31

Voice Assistant based on Whisper ASR and ChatGPT API

Articulatory ⭐ 30

Deep Articulatory Synthesis and Inversion

Tts Tortoise Gradio ⭐ 29

A Gradio setup for Tortoise TTS.

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

Talknet2 Pytorch ⭐ 28

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Tensorflow_wavenet_vocoder ⭐ 27

wavenet vocoder using tensorflow

Voice_chatbot ⭐ 26

Chatbot in russian with speech recognition using PocketSphinx and speech synthesis using RHVoice. The AttentionSeq2Seq model is used. Imlemented using Python3+TensorFlow+Keras.

A PyTorch Implementation of japanese chatbot using BERT and Transformer's decoder

Vaenar Tts ⭐ 25

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Istft Avocodo Pytorch ⭐ 25

Ultrafast GAN based Vocoder for Text to Speech

Audiomae Pytorch ⭐ 25

Unofficial PyTorch implementation of Masked Autoencoders that Listen

Robust_fine_grained_prosody_control ⭐ 24

PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis

Extensibletts Pytorch ⭐ 24

An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery

Fcl Taco2 ⭐ 24

Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021

Speech Training Recorder ⭐ 24

Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.

Realtimesingingsynthesizer ⭐ 23

Live Coding Singing Synthesizer. Python sinsy-NG wrapper.

Non Parallel Voice Conversion based on VITS

Comprehensive Tacotron2 ⭐ 23

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Voice Conversion ⭐ 22

an tutorial implement of voice conversion using pytorch

Turkish Text To Speech ⭐ 22

Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan

Voice100 ⭐ 22

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.

Fastpitchformant ⭐ 21

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Best Rq Pytorch ⭐ 20

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Graduated Interval Recall program

Speech to text to speech using Elevenlabs

A cross-platform engine for neural TTS models.

Singlevc ⭐ 18

Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.

Speechnet ⭐ 18

Automatic Speech Recognition

Tacotron ⭐ 18

tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granularities

Zero Shot Tts ⭐ 18

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Styletts2 ⭐ 18

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

Transformer Tts ⭐ 18

A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq

Wavthruvec_pytorch ⭐ 17

An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"

Neural Lexicon Reader ⭐ 17

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

Openai_tts ⭐ 17

OpenAI TTS custom component for HA

A collection of utilities for handling IPA phones.

Vocal Tube Model ⭐ 16

a very simple vocal tract model, few tube model. generate vowel sound by it

Average_prosody ⭐ 16

Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Workshop

Fast Seamlessm4t Onnx ⭐ 16

ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Mediumvc ⭐ 16

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Jam-AI a personal AI voice-controlled assistant using technologies such as Speech Recognition, NLP, TTS, and stuffs

Autovocoder ⭐ 15

Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing

Ai Npcs That Can Control Their Actions Along With Dialogue ⭐ 14

AI NPCs that can control their actions along with dialogue. For instance, if I ask an NPC to tell me its favorite magic spell, it not only tells me the spell but also performs it!

Idiaptts ⭐ 14

A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis

Waveglow ⭐ 14

A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Gnuspeech_trm ⭐ 13

Standalone version of the Gnuspeech Tube Resonance Model (TRM)

Quasi-Periodic WaveNet Pytorch implementation

Espnet_tts_frontend ⭐ 12

Text frontend for ESPnet tts recipes

Tacotron_pytorch ⭐ 12

Tacotron implementation of pytorch

Related Searches

Python Django (28,897)

Python Machine Learning (20,195)

Python Flask (17,643)

Python Pytorch (15,280)

Python Dataset (15,278)

Python Docker (14,113)

Python Tensorflow (13,737)

Python Command Line (13,351)

Python Deep Learning (13,092)

Python Jupyter Notebook (12,976)

101-200 of 241 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.