Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for speech synthesis
speech-synthesis
x
472 search results found
Hifigan
⭐
56
An 16kHz implementation of HiFi-GAN for soft-vc.
Web Speech Angular
⭐
56
A Web Application that implements Speech Recognition and Speech Synthesis using Web APIs, Angular, TypeScript, RxJS, and Angular Material
Fastvocoder
⭐
56
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Discordearsbot
⭐
56
A speech-to-text framework and bot for Discord. Take control of your Discord server using speech and voice commands. Can also be useful for hearing impaired and deaf people.
Quasar Speech Api
⭐
54
🎤 🔉 Projeto de um SPA desenvolvido com Quasar Framework 1.0 + Speech API para capturar áudio e transformar em texto, ou utilizar um texto como base para a aplicação emitir um áudio.
Ai_webui
⭐
53
AI-WEBUI: A universal web interface for AI creation, 一款好用的图像、音频、视频AI处理工具
Nanoflow
⭐
53
PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity."
Echogarden
⭐
53
Integrated speech toolset designed to be accessible to end-users. Fully open-source.
Neon Tts Plugin Coqui
⭐
52
Coqui AI TTS plugin
Diffgan Tts
⭐
52
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Vae_tacotron2
⭐
51
VAE Tacotron 2, an alternative of GST Tacotron
Few Shot Transformer Tts
⭐
51
Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Sova Tts Engine
⭐
50
Tacotron2 based engine for the SOVA-TTS project
Cs224n Gpu That Talks
⭐
50
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Tacotron
⭐
50
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Javascript30 Challenge
⭐
49
30 Day Vanilla JS Challenge, This idea comes from Wesbos. here to document the process of these challenges.
Nvda Ibmtts Driver
⭐
49
This project is aimed at developing and maintaining the NVDA IBMTTS driver. IBMTTS is a synthesizer similar to Eloquence. Please send your ideas and contributions here!
Mb Istft Vits2
⭐
49
Application of MB-iSTFT-VITS components to vits2_pytorch
Unitytts
⭐
49
Text to Speech in Unity.
Spokestack Android
⭐
49
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Msspeech
⭐
48
not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud
Lvcnet
⭐
47
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
React Native Spokestack
⭐
46
Spokestack: give your React Native app a voice interface!
Voicekit Examples
⭐
46
Examples on how to use Tinkoff Voicekit
Voice Privacy Challenge 2022
⭐
45
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Zerospeech
⭐
44
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Verbify Tts
⭐
44
Simple and free Text-to-Speech (TTS) engine that reads for you any text on your screen with high-quality voices powered by AI models.
Chatgpt Voice Control
⭐
43
Voice control for ChatGPT. Talk to ChatGPT and hear ChatGPT's responses in a natural voice.
Fre Gan Pytorch
⭐
43
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Lightspeech
⭐
42
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Source Filter Vae
⭐
42
Learning and controlling the source-filter representation of speech with a variational autoencoder
Ukrainian Tts Datasets
⭐
42
🇺🇦 Open Source Ukrainian Text-to-Speech datasets
Amazonspeechtranslator
⭐
42
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Qppwg
⭐
41
Quasi-Periodic Parallel WaveGAN Pytorch implementation
Talkie
⭐
41
Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.
Expressive Fastspeech2
⭐
41
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Tacotron2
⭐
40
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Text To Speech Api
⭐
40
Play.ht's Text to Speech API
Heartbeat Tutorials
⭐
40
Code for tutorials I have written for Heartbeat
Univnet Pytorch
⭐
39
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
Aivoice
⭐
39
Deep CNN networks for Speech Synthesis
Sova Tts Tps
⭐
38
NLP-preprocessor for the SOVA-TTS project
Tts Arabic Pytorch
⭐
37
TTS models for Arabic (Tacotron2, FastPitch)
Klatt Syn
⭐
37
Klatt formant synthesizer
Nova Nodejs
⭐
37
NOVA is a customizable voice assistant made with Node.js.
Daft Exprt
⭐
36
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Tfgan
⭐
36
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Elevenlabs Dotnet
⭐
36
A Non-Official ElevenLabs RESTful API Client for dotnet
Turkicasr
⭐
35
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
Qtspeech
⭐
35
QtSpeech is cross-platform library based on Qt to provide common cross-platform API to access and use system TTS (Text-to-Speech) engines on platforms as Windows (SAPI), Mac (SpeechSynthesis) and Linux (Festival). Licensed as LGPL, so can be used on OpenSource and Commercial products
Wavegrad2
⭐
35
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Web Speech Cognitive Services
⭐
35
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Ubisoft Laforge Daft Exprt
⭐
34
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Spoken Word
⭐
34
Spoken Word
Avocodo Pytorch
⭐
34
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Unicats Ctx Txt2vec
⭐
34
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
Tts React
⭐
33
Convert text to speech using React.
Vectorquantizedcpc
⭐
33
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
Lexconvert
⭐
32
Convert phoneme codes and lexicon formats for English speech synths
Wiki2ssml
⭐
31
Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.
Voice Assistant Chatgpt
⭐
31
Voice Assistant based on Whisper ASR and ChatGPT API
Liee Diff Svc Ai
⭐
31
Voice model "LIEE" for DIFF-SVC by julieraptor
Articulatory
⭐
30
Deep Articulatory Synthesis and Inversion
Synthetic Voice Detection Vocoder Artifacts
⭐
30
This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.
Tts Tortoise Gradio
⭐
29
A Gradio setup for Tortoise TTS.
Speech
⭐
29
A library for using Web Speech API with Angular
Tinycog
⭐
29
Small Robot, Toy Robot platform
Sc Cnn
⭐
29
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
Elevenlabs
⭐
28
ElevenLabs Artificial Voice Synthesis Client
Hmm For Emo Tts
⭐
28
💻 A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech 🔈 from text
Jsut Lab
⭐
28
HTS-style full-context labels for JSUT v1.1
Tiktok Tts
⭐
28
Provides a simple way to generate text-to-speech audio files using TikTok's text-to-speech (TTS) API in Node.js.
S2scyclegan
⭐
28
Attempt at speech2speech using CycleGAN
Talknet2 Pytorch
⭐
28
TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.
Tensorflow_wavenet_vocoder
⭐
27
wavenet vocoder using tensorflow
Robust Voice Style Transfer
⭐
27
Demo for 2022 ICASSP
Langue
⭐
27
A modern platform for conlanging. Currently in the planning stage.
Izabela
⭐
26
Your speech assistant. Communicate with text-to-speech in games, on voice chat, on stream or simply on your speakers!
Voice_chatbot
⭐
26
Chatbot in russian with speech recognition using PocketSphinx and speech synthesis using RHVoice. The AttentionSeq2Seq model is used. Imlemented using Python3+TensorFlow+Keras.
Spokestack Ios
⭐
26
Spokestack: give your iOS app a voice interface!
Dialog
⭐
26
A PyTorch Implementation of japanese chatbot using BERT and Transformer's decoder
Glottdnn
⭐
26
GlottDNN vocoder and tools for training DNN excitation models
Tf Flowavenet
⭐
25
Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Khronos
⭐
25
The open source intelligent personal assistant
Pythaitts
⭐
25
Open Source Thai Text-to-speech library in Python
Vaenar Tts
⭐
25
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
Daisy Openai Chat
⭐
25
Python platform for working with LLMs
Audiomae Pytorch
⭐
25
Unofficial PyTorch implementation of Masked Autoencoders that Listen
Istft Avocodo Pytorch
⭐
25
Ultrafast GAN based Vocoder for Text to Speech
Speech Training Recorder
⭐
24
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
Fcl Taco2
⭐
24
Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021
Phrasebook
⭐
24
⛩ 100% free Japanese Phrasebook app, built for travel and offline usage. Add it to your Home screen and access 670+ essential phrases in 19 topics. Requires no Internet connection and offers speech synthesis, so you know how to pronounce Japanese phrases correctly.
Robust_fine_grained_prosody_control
⭐
24
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
Extensibletts Pytorch
⭐
24
An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery
Asterisk Voicekit Modules
⭐
23
Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.
Vcvits
⭐
23
Non Parallel Voice Conversion based on VITS
Comprehensive Tacotron2
⭐
23
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Realtimesingingsynthesizer
⭐
23
Live Coding Singing Synthesizer. Python sinsy-NG wrapper.
Speechsynthesisrecorder
⭐
22
Get audio output from window.speechSynthesis.speak() call as ArrayBuffer, AudioBuffer, Blob, MediaSource, MediaStream, ReadableStream, other object or data types
Voice100
⭐
22
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
201-300 of 472 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.