Awesome Open Source

Programming Languages

Search results for deep learning text to speech

deep-learning x

text-to-speech x

71 search results found

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

NeMo: a toolkit for conversational AI

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Emotivoice ⭐ 5,739

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Styletts2 ⭐ 3,464

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Awesome Prompt Engineering ⭐ 2,780

This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

Openseq2seq ⭐ 1,393

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Hifi Gan ⭐ 1,376

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Merlin ⭐ 1,189

This is now the official location of the Merlin project.

Transformertts ⭐ 977

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Tts Generation Webui ⭐ 970

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet)

Voice Cloning App ⭐ 879

A Python/Pytorch app for easily synthesising human voices

Diffwave ⭐ 628

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Transformer Tts ⭐ 599

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Forwardtacotron ⭐ 487

⏩ Generating speech in a single forward pass without any attention!

Ims Toucan ⭐ 426

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Voicebox Pytorch ⭐ 401

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Hms Ml Demo ⭐ 333

HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.

Styletts ⭐ 310

Official Implementation of StyleTTS

Vits2_pytorch ⭐ 286

unofficial vits2-TTS implementation in pytorch

Matcha Tts ⭐ 276

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Wavegrad ⭐ 239

A fast, high-quality neural vocoder.

Speech_dataset ⭐ 229

The dataset of Speech Recognition

Glow Tts ⭐ 199

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Ttslearn ⭐ 197

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Willow Inference Server ⭐ 190

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

Spear Tts Pytorch ⭐ 178

Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Viettts ⭐ 151

Vietnamese Text to Speech library

Comprehensive Transformer Tts ⭐ 146

A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.

Bvae Tts ⭐ 145

Official implementation of BVAE-TTS

Neural Hmm ⭐ 143

Neural HMMs are all you need (for high-quality attention-free TTS)

Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)

Mongolian Nlp ⭐ 126

Useful resources for Mongolian NLP

Spokestack Python ⭐ 124

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Machinelearning_deeplearning ⭐ 113

I will share about Machine Learning and Deep Learning.

Pflowtts_pytorch ⭐ 107

Unofficial implementation of NVIDIA P-Flow TTS paper

Msmc Tts ⭐ 100

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Whisper Auto Transcribe ⭐ 91

Auto transcribe tool based on whisper

Tacotron Pytorch ⭐ 90

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

Pytorch Dc Tts ⭐ 85

Text to Speech with PyTorch (English and Mongolian)

Tts Recipes ⭐ 84

🐸TTS recipes for different datasets

WIP Library Text To Speech From Suno AI's Bark in C/C++ for fast inference

Nonparaseq2seqvc_code ⭐ 77

Implementation code of non-parallel sequence-to-sequence VC

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

Comprehensive E2e Tts ⭐ 71

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Persian Tts Coqui ⭐ 65

Persian/Farsi text to speech(TTS) training using coqui tts

Wavegrad2 ⭐ 63

Unofficial Pytorch Implementation of WaveGrad2

Nlp Pretrained Model ⭐ 63

A collection of Natural language processing pre-trained models.

Insightsolver Colab ⭐ 62

InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.

🔎A Currency Detection app for the visually impaired which automatically recognizes and tells which Indian currency note is clicked by the camera using Computer/Phone audio and TensorFlow model in the background🔎

Friend.ly ⭐ 41

A social media platform with a friend recommendation engine based on personality trait extraction

Text Normalization Demo ⭐ 40

Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain

Jetson Voice ⭐ 39

ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT

Tts Arabic Pytorch ⭐ 37

TTS models for Arabic (Tacotron2, FastPitch)

Bark Speaker Directory ⭐ 32

Site for sharing Bark voices

Attacking Speaker Recognition with Deep Generative Models

Comprehensive Tacotron2 ⭐ 23

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Deep Convolution Text to Speech

Text2speech ⭐ 21

Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023

Styletts2 ⭐ 18

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

Fast Seamlessm4t Onnx ⭐ 16

ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

WIP Tensorflow implementation of https://github.com/mozilla/TTS

Multi Speaker Neural Vocoder ⭐ 12

Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommunications Technologies and Services Engineering

Jen 1 Composer Pytorch ⭐ 11

Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)

Eye Handicapped Service ⭐ 9

[ X:AI Conference ] 시각장애인을 위한 안내見 서비스

Deep Learning Tts Template ⭐ 8

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

Deep_throat ⭐ 8

speech synthesis program

Dc_tts Transfer Learning ⭐ 7

Transfer learning exploration of dc_tts text-to-speech model

Deepdubpy ⭐ 6

A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)

Transformer Text To Speech ⭐ 6

Pytorch implementation of Transformer-TTS for converting text into speech.

Data_driven_ai_voice_cloning ⭐ 5

This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering

Related Searches

Python Deep Learning (19,753)

Jupyter Notebook Deep Learning (10,328)

Deep Learning Pytorch (6,246)

Deep Learning Neural Network (5,868)

Deep Learning Tensorflow (4,441)

Deep Learning Convolutional Neural Networks (4,142)

Network Deep Learning (3,532)

Deep Learning Computer Vision (3,365)

Deep Learning Artificial Intelligence (3,135)

Deep Learning Natural Language Processing (2,897)

1-71 of 71 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.