Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for deep learning text to speech
deep-learning
x
text-to-speech
x
71 search results found
Tts
⭐
28,328
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Tts
⭐
8,144
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Emotivoice
⭐
5,739
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Vits
⭐
5,589
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Styletts2
⭐
3,464
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Awesome Prompt Engineering
⭐
2,780
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
Openseq2seq
⭐
1,393
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Hifi Gan
⭐
1,376
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Merlin
⭐
1,189
This is now the official location of the Merlin project.
Transformertts
⭐
977
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
Tts Generation Webui
⭐
970
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet)
Voice Cloning App
⭐
879
A Python/Pytorch app for easily synthesising human voices
Diffwave
⭐
628
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Transformer Tts
⭐
599
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Forwardtacotron
⭐
487
⏩ Generating speech in a single forward pass without any attention!
Ims Toucan
⭐
426
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Voicebox Pytorch
⭐
401
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Hms Ml Demo
⭐
333
HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Styletts
⭐
310
Official Implementation of StyleTTS
Vits2_pytorch
⭐
286
unofficial vits2-TTS implementation in pytorch
Matcha Tts
⭐
276
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Wavegrad
⭐
239
A fast, high-quality neural vocoder.
Speech_dataset
⭐
229
The dataset of Speech Recognition
Glow Tts
⭐
199
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
Ttslearn
⭐
197
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Willow Inference Server
⭐
190
Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
Spear Tts Pytorch
⭐
178
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
Nisqa
⭐
174
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
Viettts
⭐
151
Vietnamese Text to Speech library
Comprehensive Transformer Tts
⭐
146
A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.
Bvae Tts
⭐
145
Official implementation of BVAE-TTS
Neural Hmm
⭐
143
Neural HMMs are all you need (for high-quality attention-free TTS)
Lihq
⭐
133
Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)
Mongolian Nlp
⭐
126
Useful resources for Mongolian NLP
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
Machinelearning_deeplearning
⭐
113
I will share about Machine Learning and Deep Learning.
Pflowtts_pytorch
⭐
107
Unofficial implementation of NVIDIA P-Flow TTS paper
Msmc Tts
⭐
100
Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS
Vits2
⭐
98
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Whisper Auto Transcribe
⭐
91
Auto transcribe tool based on whisper
Tacotron Pytorch
⭐
90
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Univnet
⭐
86
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Pytorch Dc Tts
⭐
85
Text to Speech with PyTorch (English and Mongolian)
Tts Recipes
⭐
84
🐸TTS recipes for different datasets
Bark
⭐
80
WIP Library Text To Speech From Suno AI's Bark in C/C++ for fast inference
Nonparaseq2seqvc_code
⭐
77
Implementation code of non-parallel sequence-to-sequence VC
Hiftnet
⭐
72
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
Comprehensive E2e Tts
⭐
71
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
Persian Tts Coqui
⭐
65
Persian/Farsi text to speech(TTS) training using coqui tts
Wavegrad2
⭐
63
Unofficial Pytorch Implementation of WaveGrad2
Nlp Pretrained Model
⭐
63
A collection of Natural language processing pre-trained models.
Insightsolver Colab
⭐
62
InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.
Noteify
⭐
47
🔎A Currency Detection app for the visually impaired which automatically recognizes and tells which Indian currency note is clicked by the camera using Computer/Phone audio and TensorFlow model in the background🔎
Friend.ly
⭐
41
A social media platform with a friend recommendation engine based on personality trait extraction
Text Normalization Demo
⭐
40
Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain
Jetson Voice
⭐
39
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
Tts Arabic Pytorch
⭐
37
TTS models for Arabic (Tacotron2, FastPitch)
Bark Speaker Directory
⭐
32
Site for sharing Bark voices
Asrgen
⭐
28
Attacking Speaker Recognition with Deep Generative Models
Comprehensive Tacotron2
⭐
23
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Dctts2
⭐
22
Deep Convolution Text to Speech
Text2speech
⭐
21
Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
Styletts2
⭐
18
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Fast Seamlessm4t Onnx
⭐
16
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
Tts_tf
⭐
12
WIP Tensorflow implementation of https://github.com/mozilla/TTS
Multi Speaker Neural Vocoder
⭐
12
Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommunications Technologies and Services Engineering
Jen 1 Composer Pytorch
⭐
11
Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.19180)
Eye Handicapped Service
⭐
9
[ X:AI Conference ] 시각장애인을 위한 안내見 서비스
Deep Learning Tts Template
⭐
8
This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).
Deep_throat
⭐
8
speech synthesis program
Dc_tts Transfer Learning
⭐
7
Transfer learning exploration of dc_tts text-to-speech model
Deepdubpy
⭐
6
A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)
Transformer Text To Speech
⭐
6
Pytorch implementation of Transformer-TTS for converting text into speech.
Data_driven_ai_voice_cloning
⭐
5
This repository contain the code of the main part of my master thesis degree at Politecnico di Torino in Data science & Engineering
Related Searches
Python Deep Learning (19,753)
Jupyter Notebook Deep Learning (10,328)
Deep Learning Pytorch (6,246)
Deep Learning Neural Network (5,868)
Deep Learning Tensorflow (4,441)
Deep Learning Convolutional Neural Networks (4,142)
Network Deep Learning (3,532)
Deep Learning Computer Vision (3,365)
Deep Learning Artificial Intelligence (3,135)
Deep Learning Natural Language Processing (2,897)
1-71 of 71 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.