Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for speech recognition tts
speech-recognition
x
tts
x
81 search results found
Paddlespeech
⭐
10,011
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Espnet
⭐
7,563
End-to-End Speech Processing Toolkit
Silero Models
⭐
4,088
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Awesome Speech Recognition Speech Synthesis Papers
⭐
2,869
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Lingvo
⭐
2,776
Lingvo
Open Speech Corpora
⭐
830
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Athena
⭐
821
an open-source implementation of sequence-to-sequence based speech processing engine
Irene Voice Assistant
⭐
644
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
Alan Sdk Reactnative
⭐
560
In-App assistant SDK to build a multimodal conversational UX for applications created with React Native (iOS, Android)
Tts Voice Wizard
⭐
467
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
Ai Waifu Vtuber
⭐
457
AI Vtuber for Streaming on Youtube/Twitch
Alan Sdk Pcf
⭐
426
Build a voice assistant for any application created with Microsoft Power Apps
Dla
⭐
421
Deep learning for audio processing
Amica
⭐
325
Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.
Parrots
⭐
318
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese. 中文语音识别、文字转语音,基于语音库实现,易扩展。
Android Speech
⭐
317
Android speech recognition and text to speech made easy
Langhelper
⭐
292
Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts, Listening test, so on.
Speech Recognition Uk
⭐
262
Speech Recognition for Ukrainian
Livewhisper
⭐
261
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Jarvis Chatgpt
⭐
242
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
Gpt Voice Conversation Chatbot
⭐
232
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
Speech_dataset
⭐
229
The dataset of Speech Recognition
Dsnote
⭐
225
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Lobe Tts
⭐
182
🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser
Ueazspeech
⭐
162
This plugin integrates Azure Speech Cognitive Services in Unreal Engine.
Interspeech2019 Tutorial
⭐
160
INTERSPEECH 2019 Tutorial Materials
M.i.t.s.u.h.a.
⭐
134
World's First Multilingual Inexpensive Therapeutic Sophisticated Ultra-responsive Holographic Agent. In simple terms, an AI you can talk to and it'll talk back with a body using VTube Studio.
Mongolian Nlp
⭐
126
Useful resources for Mongolian NLP
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
Whispering Ui
⭐
108
Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)
Awesome Russian Speech
⭐
97
Russian speech technology links
Talk2gpt
⭐
92
GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language. Includes a free text2image
Simple Obs Stt
⭐
91
Speech-to-text and keyboard input captions for OBS.
Gptalk
⭐
88
GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language.
Mongolian Speech Recognition
⭐
86
Mongolian speech recognition with PyTorch
Unsuperior Ai Waifu
⭐
73
AI waifu that can run on your phone or PC
Hermod
⭐
59
voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTT
Android Tts Stt
⭐
55
One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Spokestack Android
⭐
49
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Gpt_chatbot
⭐
47
This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows
React Native Spokestack
⭐
46
Spokestack: give your React Native app a voice interface!
Py Nltools
⭐
42
A collection of basic python modules for spoken natural language processing
Timething
⭐
41
Timething is a library for aligning text transcripts with their audio recordings.
Yandex Speech
⭐
38
node.js module for Yandex speech systems (ASR & TTS)
Hoscy
⭐
33
Companion for OSC and Communication
Termux Deepspeech
⭐
33
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
Baiduasrandtts
⭐
32
Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech; 百度语音识别、语音合成API使用。
Mspeech
⭐
26
Program for speech recognition using the Google Speech API, voice commands, control your computer.
Daisy Openai Chat
⭐
25
Python platform for working with LLMs
Gptspeaker
⭐
25
The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through the OpenAI service, and responds back. Like Apple Siri, Amazon Alex, Google Nest Home, Mi XiaoAi etc.
Speech Training Recorder
⭐
24
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
Voice100
⭐
22
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
Python Voice Assistant
⭐
21
A Python based Voice Assistant like Siri
Opensource Voice Tools
⭐
21
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Rosecho
⭐
20
Tianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
Echo Xi
⭐
19
Speech to text to speech using Elevenlabs
Nala_assistant
⭐
18
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
Alts
⭐
18
100% free, local & offline voice assistant with speech recognition
Tts_data_maker
⭐
18
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to speech .
Baidu_speech
⭐
17
K.a.i
⭐
15
Home automation program controlled by your voice.
Whisper_android
⭐
12
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Voicetospeech
⭐
11
Live speech recognition to synthesized speech with hundreds of voices, TTS, language auto-translation, and socket support in-browser.
13.3 Hours Chinese Mandarin Synthesis Corpus Female Emotional
⭐
11
Chinese Mandarin Synthesis Corpus-Female/Emotional
Rw Deepspeech Api
⭐
8
An end to end deep speech REST API containing speech to text and text speech services for Kinyarwanda.
Kaggle Ai
⭐
8
Categorize AI problems and record through kaggle, Google's data science website
Android Sesli Haber
⭐
8
DEPRECATED - This application is created by a group of student who finished Learn Android in 32 Days course.
Simplespeechloop
⭐
8
A very basic demonstration connecting speech recognition and text-to-speech
Be_nlp_speech_resources
⭐
8
Links to Belarusian NLP and Speech resources
React Native Spokestack Tray
⭐
8
React Native component for adding Spokestack to a React Native app
Persia
⭐
6
Personal Intelligent Assistant (Persia), a simple "bot", or "assistant" which responds to several commands, using TTS and speech recognition, and also uses computer vision for face detection.
Speech Adapters
⭐
6
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding
Python Virtual Assistant
⭐
6
A simple python based virtual voice assistant that can take and execute commands
Real Time Translator
⭐
6
A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speech (gTTS) to play out the translation.
Speech2scpi
⭐
6
Speech recognition and tts for your SCPI enabled oscilloscope
Alfred Project
⭐
5
GPT-Powered AI Companion: Real-Time Whisper Transcription + CoquiTTS Human-like Text-to-Speech Responses
Tts Stt
⭐
5
Small pyhon flask container allowing us to convert Text to Speech and Speech to Text
Capetangjs
⭐
5
A JavaScript library for text to speech vice versa using Web Speech API
Cordova Plugin Speech
⭐
5
cordova-plugin-speech
Meuxvtuber
⭐
5
Super light weight python yt ai vtuber with zero api keys requirements and free of cost!
Aigen
⭐
5
Personal Assistant :Implemention of OpenAI GPT3.5 Cross Platform
Tor Speech
⭐
5
🔉 Yandex & Google + Tor
Related Searches
Python Speech Recognition (876)
Python Tts (595)
1-81 of 81 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.