Awesome Open Source

Programming Languages

Search results for speech synthesis

speech-synthesis x

472 search results found

An 16kHz implementation of HiFi-GAN for soft-vc.

Web Speech Angular ⭐ 56

A Web Application that implements Speech Recognition and Speech Synthesis using Web APIs, Angular, TypeScript, RxJS, and Angular Material

Fastvocoder ⭐ 56

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

Discordearsbot ⭐ 56

A speech-to-text framework and bot for Discord. Take control of your Discord server using speech and voice commands. Can also be useful for hearing impaired and deaf people.

Quasar Speech Api ⭐ 54

🎤 🔉 Projeto de um SPA desenvolvido com Quasar Framework 1.0 + Speech API para capturar áudio e transformar em texto, ou utilizar um texto como base para a aplicação emitir um áudio.

Ai_webui ⭐ 53

AI-WEBUI: A universal web interface for AI creation, 一款好用的图像、音频、视频AI处理工具

Nanoflow ⭐ 53

PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity."

Echogarden ⭐ 53

Integrated speech toolset designed to be accessible to end-users. Fully open-source.

Neon Tts Plugin Coqui ⭐ 52

Coqui AI TTS plugin

Diffgan Tts ⭐ 52

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Vae_tacotron2 ⭐ 51

VAE Tacotron 2, an alternative of GST Tacotron

Few Shot Transformer Tts ⭐ 51

Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.

Sova Tts Engine ⭐ 50

Tacotron2 based engine for the SOVA-TTS project

Cs224n Gpu That Talks ⭐ 50

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Tacotron ⭐ 50

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Javascript30 Challenge ⭐ 49

30 Day Vanilla JS Challenge, This idea comes from Wesbos. here to document the process of these challenges.

Nvda Ibmtts Driver ⭐ 49

This project is aimed at developing and maintaining the NVDA IBMTTS driver. IBMTTS is a synthesizer similar to Eloquence. Please send your ideas and contributions here!

Mb Istft Vits2 ⭐ 49

Application of MB-iSTFT-VITS components to vits2_pytorch

Unitytts ⭐ 49

Text to Speech in Unity.

Spokestack Android ⭐ 49

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Msspeech ⭐ 48

not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

React Native Spokestack ⭐ 46

Spokestack: give your React Native app a voice interface!

Voicekit Examples ⭐ 46

Examples on how to use Tinkoff Voicekit

Voice Privacy Challenge 2022 ⭐ 45

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Zerospeech ⭐ 44

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Verbify Tts ⭐ 44

Simple and free Text-to-Speech (TTS) engine that reads for you any text on your screen with high-quality voices powered by AI models.

Chatgpt Voice Control ⭐ 43

Voice control for ChatGPT. Talk to ChatGPT and hear ChatGPT's responses in a natural voice.

Fre Gan Pytorch ⭐ 43

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Lightspeech ⭐ 42

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Source Filter Vae ⭐ 42

Learning and controlling the source-filter representation of speech with a variational autoencoder

Ukrainian Tts Datasets ⭐ 42

🇺🇦 Open Source Ukrainian Text-to-Speech datasets

Amazonspeechtranslator ⭐ 42

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Quasi-Periodic Parallel WaveGAN Pytorch implementation

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Expressive Fastspeech2 ⭐ 41

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Tacotron2 ⭐ 40

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Text To Speech Api ⭐ 40

Play.ht's Text to Speech API

Heartbeat Tutorials ⭐ 40

Code for tutorials I have written for Heartbeat

Univnet Pytorch ⭐ 39

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Deep CNN networks for Speech Synthesis

Sova Tts Tps ⭐ 38

NLP-preprocessor for the SOVA-TTS project

Tts Arabic Pytorch ⭐ 37

TTS models for Arabic (Tacotron2, FastPitch)

Klatt Syn ⭐ 37

Klatt formant synthesizer

Nova Nodejs ⭐ 37

NOVA is a customizable voice assistant made with Node.js.

Daft Exprt ⭐ 36

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Elevenlabs Dotnet ⭐ 36

A Non-Official ElevenLabs RESTful API Client for dotnet

Turkicasr ⭐ 35

A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

Qtspeech ⭐ 35

QtSpeech is cross-platform library based on Qt to provide common cross-platform API to access and use system TTS (Text-to-Speech) engines on platforms as Windows (SAPI), Mac (SpeechSynthesis) and Linux (Festival). Licensed as LGPL, so can be used on OpenSource and Commercial products

Wavegrad2 ⭐ 35

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

Web Speech Cognitive Services ⭐ 35

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Ubisoft Laforge Daft Exprt ⭐ 34

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Spoken Word ⭐ 34

Avocodo Pytorch ⭐ 34

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Unicats Ctx Txt2vec ⭐ 34

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS

Tts React ⭐ 33

Convert text to speech using React.

Vectorquantizedcpc ⭐ 33

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Lexconvert ⭐ 32

Convert phoneme codes and lexicon formats for English speech synths

Wiki2ssml ⭐ 31

Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.

Voice Assistant Chatgpt ⭐ 31

Voice Assistant based on Whisper ASR and ChatGPT API

Liee Diff Svc Ai ⭐ 31

Voice model "LIEE" for DIFF-SVC by julieraptor

Articulatory ⭐ 30

Deep Articulatory Synthesis and Inversion

Synthetic Voice Detection Vocoder Artifacts ⭐ 30

This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.

Tts Tortoise Gradio ⭐ 29

A Gradio setup for Tortoise TTS.

A library for using Web Speech API with Angular

Small Robot, Toy Robot platform

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

Elevenlabs ⭐ 28

ElevenLabs Artificial Voice Synthesis Client

Hmm For Emo Tts ⭐ 28

💻 A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech 🔈 from text

Jsut Lab ⭐ 28

HTS-style full-context labels for JSUT v1.1

Tiktok Tts ⭐ 28

Provides a simple way to generate text-to-speech audio files using TikTok's text-to-speech (TTS) API in Node.js.

S2scyclegan ⭐ 28

Attempt at speech2speech using CycleGAN

Talknet2 Pytorch ⭐ 28

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Tensorflow_wavenet_vocoder ⭐ 27

wavenet vocoder using tensorflow

Robust Voice Style Transfer ⭐ 27

Demo for 2022 ICASSP

A modern platform for conlanging. Currently in the planning stage.

Your speech assistant. Communicate with text-to-speech in games, on voice chat, on stream or simply on your speakers!

Voice_chatbot ⭐ 26

Chatbot in russian with speech recognition using PocketSphinx and speech synthesis using RHVoice. The AttentionSeq2Seq model is used. Imlemented using Python3+TensorFlow+Keras.

Spokestack Ios ⭐ 26

Spokestack: give your iOS app a voice interface!

A PyTorch Implementation of japanese chatbot using BERT and Transformer's decoder

Glottdnn ⭐ 26

GlottDNN vocoder and tools for training DNN excitation models

Tf Flowavenet ⭐ 25

Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"

The open source intelligent personal assistant

Pythaitts ⭐ 25

Open Source Thai Text-to-speech library in Python

Vaenar Tts ⭐ 25

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Daisy Openai Chat ⭐ 25

Python platform for working with LLMs

Audiomae Pytorch ⭐ 25

Unofficial PyTorch implementation of Masked Autoencoders that Listen

Istft Avocodo Pytorch ⭐ 25

Ultrafast GAN based Vocoder for Text to Speech

Speech Training Recorder ⭐ 24

Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.

Fcl Taco2 ⭐ 24

Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021

Phrasebook ⭐ 24

⛩ 100% free Japanese Phrasebook app, built for travel and offline usage. Add it to your Home screen and access 670+ essential phrases in 19 topics. Requires no Internet connection and offers speech synthesis, so you know how to pronounce Japanese phrases correctly.

Robust_fine_grained_prosody_control ⭐ 24

PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis

Extensibletts Pytorch ⭐ 24

An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery

Asterisk Voicekit Modules ⭐ 23

Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.

Non Parallel Voice Conversion based on VITS

Comprehensive Tacotron2 ⭐ 23

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Realtimesingingsynthesizer ⭐ 23

Live Coding Singing Synthesizer. Python sinsy-NG wrapper.

Speechsynthesisrecorder ⭐ 22

Get audio output from window.speechSynthesis.speak() call as ArrayBuffer, AudioBuffer, Blob, MediaSource, MediaStream, ReadableStream, other object or data types

Voice100 ⭐ 22

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.

201-300 of 472 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.