Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python voice recognition
python
x
voice-recognition
x
106 search results found
Speechbrain
⭐
7,166
A PyTorch-based Speech Toolkit
Vosk Api
⭐
6,633
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Silero Vad
⭐
2,339
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Python Ai Assistant
⭐
812
Python AI assistant 🧠
Mycroft Precise
⭐
749
A lightweight, simple-to-use, RNN wake word listener
Rhino
⭐
576
On-device Speech-to-Intent engine powered by deep learning
Speech To Text Benchmark
⭐
570
speech to text benchmark framework
Voiceprintrecognition Pytorch
⭐
540
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Cheetah
⭐
537
On-device streaming speech-to-text engine powered by deep learning
Wunjo.wladradchenko.ru
⭐
509
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
Whisperlive
⭐
505
A nearly-live implementation of OpenAI's Whisper.
Picovoice
⭐
449
On-device voice assistant platform powered by deep learning
Leopard
⭐
390
On-device speech-to-text engine powered by deep learning
Voicebook
⭐
325
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Caster
⭐
317
Dragonfly-Based Voice Programming and Accessibility Toolkit
Gesture Controlled Virtual Mouse
⭐
301
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
Vosk
⭐
287
VOSK Speech Recognition Toolkit
Voiceprintrecognition Tensorflow
⭐
273
使用Tensorflow实现声纹识别
Livewhisper
⭐
261
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Gpt Voice Conversation Chatbot
⭐
232
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
Customizable Gpt Chatbot
⭐
186
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
Speaker Recognition Py3
⭐
179
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
Voiceprintrecognition Paddlepaddle
⭐
173
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模
Pitranslate
⭐
140
Raspberry Pi Translation Tool
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
At16k
⭐
123
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Myprosody
⭐
97
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Chatgpt Voice Assistant
⭐
87
A chatbot that uses speech to text for input, sends the text to OpenAI's ChatGPT text generation model and speaks the response using text to speech.
Voiceprintrecognition Keras
⭐
84
基于Kersa实现的声纹识别模型
Pyautosrt
⭐
77
PySimpleGUI based DESKTOP APP to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
Streamassist
⭐
72
Home Assistant custom component that allows you to turn almost any camera and almost any speaker into a local voice assistant
Whisper_dictation
⭐
67
Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, with images, voice control, in under 4 GiB of VRAM.
Kobold_assistant
⭐
62
Like ChatGPT's voice chat, but entirely offline/private, usiong local models such as LLama 2 and Whisper
Asr_benchmark
⭐
60
Program to benchmark various speech recognition APIs
Chatgpt_wechat
⭐
48
未认证微信公众号接入chatgpt,新增语音聊天(英语对话),基于Flask,实现个人微信公众号【无
Gpt_chatbot
⭐
47
This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows
Python Assistant
⭐
47
Python Assistant (PA) is a voice command based assistant service written in Python 3.9+. It can recognize human speech or voice, talk to user and execute basic commands.
Asrecognition
⭐
47
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
Autosrt
⭐
41
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
Banglaspeech2text
⭐
38
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.
Assistant
⭐
37
An intellligent AI assistant that can do anything!
Wavencoder
⭐
36
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Chatgpt Voice Chatbot Telegram
⭐
36
ChatGPT Voice Chatbot Telegram is a Python and Flask-based GitHub repository that enables users to communicate with an AI chatbot using voice-to-text and text-to-voice technologies powered by OpenAI. The repository provides a flexible and customizable solution for building advanced voice-enabled chatbots using natural language processing.
Octopus
⭐
34
On-device Speech-to-Index engine powered by deep learning
Voce Browser
⭐
33
Voice Controlled Chromium Web Browser
Nala
⭐
30
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
M.i.l.e.s
⭐
30
M.I.L.E.S is a voice assistant powered by GPT-4-Turbo, Miles can change his own system prompt, change his own model, play spotify, use spotify controls, change system volume and spotify volume, use a calculator, search for the weather anywhere on earth, get the date and time, store permanent memories, and speak to you naturally. MacOS only.
Youtranslate
⭐
28
Takes a youtube video, clones the voice and re-creates that video in a different language
Freespeech Vr
⭐
25
speaker-independent voice recognition with dynamic language learning
Autosub
⭐
22
GUI utility to transcribe/translate from video/audio/subtitles to subtitles
Spotify Voice Control
⭐
22
Voice control for spotify through the terminal
Dtw_digital_voice_recognition
⭐
21
基于DTW与MFCC特征进行数字0-9的语音识别,DTW,MFCC,语音识别,中英数据,端点检测,D Voice Recognition。
Alexis
⭐
20
Alexis is an Open Source command line Robot butler!
Lie_to_me
⭐
18
Lie detection using facial and voice recognition
Realtalk
⭐
16
First Place Winner at Delta Hacks 5
Autosrt
⭐
16
Offline srt producer gui with whisper.cpp
Dvoice
⭐
16
Dvoice est un outil de reconnaissance vocale pour les dialectes et les langues peu représentées.
Greenkey Discovery Sdk
⭐
15
Speed up business workflows through custom 'voice skills' and text (NLP) interpreters
Pyvosklivesubtitle
⭐
15
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
Movie2comic
⭐
15
A tool to transfer movie into comics by keyframe extracting, voice recognition and style transfer techniques.
Voice_gender_detection
⭐
14
♂️♀️ Detect a person's gender from a voice file. Achieves 90.7% +/- 1.3% accuracy.
Power Ki
⭐
14
POWER-KI programming language for Intelligent Applications (IA)
All In
⭐
13
Poker and BlackJack project (french speech recognition + statistics)
Speech To Intent Benchmark
⭐
12
benchmark for Speech-to-Intent engines
Olami Api Quickstart Python Samples
⭐
12
OLAMI API Quickstart Python Samples
Voicebot
⭐
11
A simple telegram bot to recognize lengthy voice files to text and vice versa with multiple language support.
Python Sphinx Listen
⭐
11
Simple Python interface for voice recognition using cmuSphinx and gstreamer.
Audioset_models
⭐
11
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
Practice Python
⭐
10
Coursera Courses and practice in Python
Zara Desktopassistant
⭐
10
A personal voice desktop assistant with designed GUI. Created and Tested on LINUX.
Happearth
⭐
10
Software prototype of a home energy monitor. visit in https://happearth.herokuapp.com
Etos Keywordspotting
⭐
10
PyTorch implementations of neural network models for keyword spotting
Voice Assistant
⭐
10
A step by step tutorial on building a voice-based assistant using python.
Okdocker
⭐
9
Voice recognition inside docker container
Cnn Type Models 4 Noise Voice Recognition
⭐
9
Real-time Noise-voice recognition task with CNN-type models
Sigh
⭐
9
background voice detection program that listens for a wake word and activates transcription mode
Gmm_digital_voice_recognition
⭐
8
基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklear Voice Recognition。
Multi Language Rtvc
⭐
8
User-friendly Multi-Language Voice Cloning Application
Ttoggle_it
⭐
8
Short News 📰 | API 🍏 | The Live news is taken form newsapi organization.
Eva
⭐
8
Open source voice-enabled personal assistant
Whisper_autosrt
⭐
8
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
Speaker Identification
⭐
7
Speaker Identification using Neural Net.
Ai_voice_assistant_tg_bot
⭐
7
Telegram bot with options to receive text and response with ChatGPT and also parse voice and response in the same language.
Desktop Assistant
⭐
7
In this lockdown period I used my Python skills and made a Desktop Assistant. It is same as Google Assistant which we use in our phones. It works over user's voice commands. You can control your system with your voice command. It can: 1. Open and close any application of system. 2. Search anything on Google or Youtube. 3. Able to speak time & date. 4. Send email through voice commands. 5. Play or Stop music in our system. 6. Solve any algebraic and mathematical problems. 7. Restart, Sleep or Sh
Glados Voice Assistant
⭐
7
GLaDOS Terminal-based AI Assistant
Zakas
⭐
6
🤖 A desktop Siri-like voice manager bot, to automate your daily routine.
Syntaviz
⭐
6
A visualization interface for analyzing a (very large) corpus of natural-language queries.
Rpizero_relay
⭐
6
Raspberry Pi Zero voice controlled relay switch
Python Virtual Assistant
⭐
6
A simple python based virtual voice assistant that can take and execute commands
Novelai Voice Chat
⭐
6
Voice chat with your AI companion using local Whisper and NovelAI
Automatic_speaker_recognition
⭐
6
A repos for USTH Digital Signal Processing 2020 Group 3 project. It's quite obvious in the title.
Android Autosrt V2
⭐
6
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES
Alexatutorial
⭐
6
This is a tutorial for creating a basic Alexa skill. (No Alexa Required)
Diy Voice Assistant
⭐
6
Configurations and scripts for building your own voice asisstant based on Rhasspy and Node-RED
Voice_assistant
⭐
5
Basic voice assistant with voice commands. Feel free to read the README.txt for instructions on how to add your own commands
Transcribevoicefile2text
⭐
5
Transcribe Voice File to Text
Voice Command Assistant
⭐
5
Powerful assistant performing powerful automated tasks from user’s voice inputs. Developed using machine learning and speech synthesis Python frameworks.
Vosk_autosrt
⭐
5
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
Ameli Ai
⭐
5
Ameli, a cross platform personal voice assistant for Windows/Linux/MacOS/Android/iOS
Olami Python Voice Kit
⭐
5
OLAMI Voice Kit - Python sample codes for the voice-based assistant robot
Related Searches
Python Django (27,688)
Python Machine Learning (20,195)
Python Flask (15,957)
Python Dataset (14,792)
Python Pytorch (14,671)
Python Docker (14,113)
Python Tensorflow (13,737)
Python Deep Learning (13,095)
Python Jupyter Notebook (12,976)
Python Command Line (12,852)
1-100 of 106 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.