Awesome Open Source

Programming Languages

Search results for speech recognition tts

speech-recognition x

81 search results found

Paddlespeech ⭐ 10,011

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

NeMo: a toolkit for conversational AI

Espnet ⭐ 7,563

End-to-End Speech Processing Toolkit

Silero Models ⭐ 4,088

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Awesome Speech Recognition Speech Synthesis Papers ⭐ 2,869

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Lingvo ⭐ 2,776

Open Speech Corpora ⭐ 830

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

an open-source implementation of sequence-to-sequence based speech processing engine

Irene Voice Assistant ⭐ 644

Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.

Alan Sdk Reactnative ⭐ 560

In-App assistant SDK to build a multimodal conversational UX for applications created with React Native (iOS, Android)

Tts Voice Wizard ⭐ 467

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

Ai Waifu Vtuber ⭐ 457

AI Vtuber for Streaming on Youtube/Twitch

Alan Sdk Pcf ⭐ 426

Build a voice assistant for any application created with Microsoft Power Apps

Deep learning for audio processing

Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.

Parrots ⭐ 318

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese. 中文语音识别、文字转语音，基于语音库实现，易扩展。

Android Speech ⭐ 317

Android speech recognition and text to speech made easy

Langhelper ⭐ 292

Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts, Listening test, so on.

Speech Recognition Uk ⭐ 262

Speech Recognition for Ukrainian

Livewhisper ⭐ 261

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

Jarvis Chatgpt ⭐ 242

A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.

Gpt Voice Conversation Chatbot ⭐ 232

Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.

Speech_dataset ⭐ 229

The dataset of Speech Recognition

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

Lobe Tts ⭐ 182

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

Ueazspeech ⭐ 162

This plugin integrates Azure Speech Cognitive Services in Unreal Engine.

Interspeech2019 Tutorial ⭐ 160

INTERSPEECH 2019 Tutorial Materials

M.i.t.s.u.h.a. ⭐ 134

World's First Multilingual Inexpensive Therapeutic Sophisticated Ultra-responsive Holographic Agent. In simple terms, an AI you can talk to and it'll talk back with a body using VTube Studio.

Mongolian Nlp ⭐ 126

Useful resources for Mongolian NLP

Spokestack Python ⭐ 124

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Whispering Ui ⭐ 108

Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)

Awesome Russian Speech ⭐ 97

Russian speech technology links

Talk2gpt ⭐ 92

GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language. Includes a free text2image

Simple Obs Stt ⭐ 91

Speech-to-text and keyboard input captions for OBS.

GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language.

Mongolian Speech Recognition ⭐ 86

Mongolian speech recognition with PyTorch

Unsuperior Ai Waifu ⭐ 73

AI waifu that can run on your phone or PC

voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTT

Android Tts Stt ⭐ 55

One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem

Spokestack Android ⭐ 49

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Gpt_chatbot ⭐ 47

This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows

React Native Spokestack ⭐ 46

Spokestack: give your React Native app a voice interface!

Py Nltools ⭐ 42

A collection of basic python modules for spoken natural language processing

Timething ⭐ 41

Timething is a library for aligning text transcripts with their audio recordings.

Yandex Speech ⭐ 38

node.js module for Yandex speech systems (ASR & TTS)

Companion for OSC and Communication

Termux Deepspeech ⭐ 33

Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux

Baiduasrandtts ⭐ 32

Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech; 百度语音识别、语音合成API使用。

Program for speech recognition using the Google Speech API, voice commands, control your computer.

Daisy Openai Chat ⭐ 25

Python platform for working with LLMs

Gptspeaker ⭐ 25

The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through the OpenAI service, and responds back. Like Apple Siri, Amazon Alex, Google Nest Home, Mi XiaoAi etc.

Speech Training Recorder ⭐ 24

Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.

Voice100 ⭐ 22

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.

Python Voice Assistant ⭐ 21

A Python based Voice Assistant like Siri

Opensource Voice Tools ⭐ 21

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Tianbot Rosecho (Tianecho)，中文语音人机交互模块，支持ROS即插即用

Speech to text to speech using Elevenlabs

Nala_assistant ⭐ 18

🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.

100% free, local & offline voice assistant with speech recognition

Tts_data_maker ⭐ 18

Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to speech .

Baidu_speech ⭐ 17

Home automation program controlled by your voice.

Whisper_android ⭐ 12

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

Voicetospeech ⭐ 11

Live speech recognition to synthesized speech with hundreds of voices, TTS, language auto-translation, and socket support in-browser.

13.3 Hours Chinese Mandarin Synthesis Corpus Female Emotional ⭐ 11

Chinese Mandarin Synthesis Corpus-Female/Emotional

Rw Deepspeech Api ⭐ 8

An end to end deep speech REST API containing speech to text and text speech services for Kinyarwanda.

Kaggle Ai ⭐ 8

Categorize AI problems and record through kaggle, Google's data science website

Android Sesli Haber ⭐ 8

DEPRECATED - This application is created by a group of student who finished Learn Android in 32 Days course.

Simplespeechloop ⭐ 8

A very basic demonstration connecting speech recognition and text-to-speech

Be_nlp_speech_resources ⭐ 8

Links to Belarusian NLP and Speech resources

React Native Spokestack Tray ⭐ 8

React Native component for adding Spokestack to a React Native app

Personal Intelligent Assistant (Persia), a simple "bot", or "assistant" which responds to several commands, using TTS and speech recognition, and also uses computer vision for face detection.

Speech Adapters ⭐ 6

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding

Python Virtual Assistant ⭐ 6

A simple python based virtual voice assistant that can take and execute commands

Real Time Translator ⭐ 6

A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speech (gTTS) to play out the translation.

Speech2scpi ⭐ 6

Speech recognition and tts for your SCPI enabled oscilloscope

Alfred Project ⭐ 5

GPT-Powered AI Companion: Real-Time Whisper Transcription + CoquiTTS Human-like Text-to-Speech Responses

Small pyhon flask container allowing us to convert Text to Speech and Speech to Text

Capetangjs ⭐ 5

A JavaScript library for text to speech vice versa using Web Speech API

Cordova Plugin Speech ⭐ 5

cordova-plugin-speech

Meuxvtuber ⭐ 5

Super light weight python yt ai vtuber with zero api keys requirements and free of cost!

Personal Assistant :Implemention of OpenAI GPT3.5 Cross Platform

Tor Speech ⭐ 5

🔉 Yandex & Google + Tor

Related Searches

Python Speech Recognition (876)

Python Tts (595)

1-81 of 81 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.