Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for artificial intelligence speech recognition
artificial-intelligence
x
speech-recognition
x
46 search results found
Leon
⭐
13,937
🧠 Leon is your open-source personal assistant.
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Asrt_speechrecognition
⭐
7,253
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Openvino
⭐
5,316
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Alan Sdk Web
⭐
2,377
Actionable AI SDK for Web to enable text and voice conversations with actions (JavaScript, React, Angular, Vue, Ember, Electron)
Alan Sdk Ios
⭐
1,909
Actionable AI SDK for iOS to enable text and voice conversations with actions (Swift, Objective-C)
Alan Sdk Flutter
⭐
1,742
Conversational AI SDK for Flutter to build AI-powered voice assistants for Flutter applications (iOS and Android)
Alan Sdk Android
⭐
1,732
Conversational AI SDK for Android to build AI-powered voice assistants for Android applications (Java, Kotlin)
Alan Sdk Ionic
⭐
1,515
In-App assistant SDK to build a multimodal conversational UX for applications created with Ionic (React, Angular, Vue)
Ios_ml
⭐
1,406
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Dragonfire
⭐
1,294
the open-source virtual assistant for Ubuntu based Linux distributions
Alan Sdk Cordova
⭐
1,070
In-App assistant SDK to build a multimodal conversational UX for Apache Cordova applications
Quillman
⭐
880
A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.
Lhotse
⭐
794
Tools for handling speech data in machine learning projects.
Sincnet
⭐
764
SincNet is a neural architecture for efficiently processing raw audio samples.
Alan Sdk Reactnative
⭐
560
In-App assistant SDK to build a multimodal conversational UX for applications created with React Native (iOS, Android)
Storytoolkitai
⭐
504
An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models
Whishper
⭐
443
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Alan Sdk Pcf
⭐
426
Build a voice assistant for any application created with Microsoft Power Apps
Amica
⭐
325
Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.
Edenai Apis
⭐
313
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Bmlist
⭐
297
A List of Big Models
Langhelper
⭐
292
Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts, Listening test, so on.
Nonautoreggenprogress
⭐
290
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Livewhisper
⭐
261
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Jarvis Chatgpt
⭐
242
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
Gpt Voice Conversation Chatbot
⭐
232
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
Voicestreamai
⭐
222
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
Whisper_dart
⭐
220
speech recognition in dart support all audio format and support server side client side, + support all language, only support in cpu only
Ai Audio Datasets
⭐
199
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
Voice_activity_detection
⭐
171
Voice Activity Detection based on Deep Learning & TensorFlow
Ollama Voice Mac
⭐
165
Mac compatible Ollama Voice
Rnnt Speech Recognition
⭐
152
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Zzz Retired__openstt
⭐
146
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Synthalingua
⭐
144
Synthalingua - Real Time Translation
M.i.t.s.u.h.a.
⭐
134
World's First Multilingual Inexpensive Therapeutic Sophisticated Ultra-responsive Holographic Agent. In simple terms, an AI you can talk to and it'll talk back with a body using VTube Studio.
Persephone
⭐
131
A tool for automatic phoneme transcription
Awesome Ai Services
⭐
127
An overview of the AI-as-a-service landscape
Cep
⭐
120
CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
Whispering Ui
⭐
108
Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)
Zeta
⭐
106
Build high-performance AI models with modular building blocks
Laibot Client
⭐
92
开源人工智能,基于开源软硬件构建语音对话机器人、智能音箱……人机对话、自然交互,来宝拥有无限可能。特 3!
Talk2gpt
⭐
92
GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language. Includes a free text2image
Chrome Web Speech Api
⭐
90
Chrome Web Speech API
Gptalk
⭐
88
GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language.
Emeltal
⭐
83
Local ML voice chat using high-end models.
Deepgram Js Sdk
⭐
81
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
Unsuperior Ai Waifu
⭐
73
AI waifu that can run on your phone or PC
Awesome Openai Whisper
⭐
72
A curated list of awesome OpenAI's Whisper
Speech_ai
⭐
68
Speech to speech bot built with Python
Python Deep Learning Projects
⭐
67
Codebase for my book "Python DeepLearning Projects" | Learn applied deep learning for various use-cases on NLP, CV and ASR using TensorFlow and Keras. Book link.
Whisper_dictation
⭐
67
Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, with images, voice control, in under 4 GiB of VRAM.
Ai Study
⭐
63
人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP
Max Speech To Text Converter
⭐
60
Converts spoken words into text form.
Ai With Python Series
⭐
53
A Python Series of tutorials aimed at learning Artificial Intelligence concepts. This series of tutorials start from the basics of Python and builds on top of it. We will cover three full-fledged case studies to practice AI Implementation of Python with real data and solve real-world problems.
Ai_webui
⭐
53
AI-WEBUI: A universal web interface for AI creation, 一款好用的图像、音频、视频AI处理工具
Keras Sincnet
⭐
49
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Gpt_chatbot
⭐
47
This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows
Waifu Otw
⭐
44
❤️ Waifu On The Web: A web-based Artificial Intelligence with Natural Language Processing and a Live2D model
React.ai
⭐
39
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Vocalforge
⭐
39
Your one-stop solution for voice dataset creation
Nova Nodejs
⭐
37
NOVA is a customizable voice assistant made with Node.js.
Salutejs
⭐
35
SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Deep Learning And Paper
⭐
33
【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、
Nodejs Whisper
⭐
31
Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.
Aniemore
⭐
28
Emotions recognition from audio and text files (only russian language)
Multi Hotword_spotting
⭐
28
Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Daisy Openai Chat
⭐
25
Python platform for working with LLMs
Gptspeaker
⭐
25
The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through the OpenAI service, and responds back. Like Apple Siri, Amazon Alex, Google Nest Home, Mi XiaoAi etc.
Maix Speechrecognizer
⭐
24
Speech Recognition or Wake Word detection demo, developed using Maixduino framework and PlatfomIO, to run on K210 MCU on Sipeed's Maix dev board
Speech Emotion Recognition
⭐
23
Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.
Llmchat
⭐
21
A Discord chatbot that supports popular LLMs for text generation and ultra-realistic voices for voice chat.
Python Voice Assistant
⭐
21
A Python based Voice Assistant like Siri
Robocop
⭐
20
Artificially Intelligent Machine with Computer Vision, Natural Language Processing, AI, Sense and Feelings.
Notewhispers
⭐
17
Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes
Cif Coldec
⭐
16
[ICASSP 2022] IMPROVING END-TO-END CONTEXTUAL SPEECH RECOGNITION WITH FINE-GRAINED CONTEXTUAL KNOWLEDGE SELECTION
Tchatbot
⭐
15
A ChatBot framework to create customizable all purpose Chatbots using NLP, Tensorflow, Speech Recognition
Jam Ai
⭐
15
Jam-AI a personal AI voice-controlled assistant using technologies such as Speech Recognition, NLP, TTS, and stuffs
Zac The Ai Assistant
⭐
15
ZAC: Your robotic virtual assistant - Enhancing human-machine interaction and automation through voice recognition and web scraping.
Speech Command
⭐
15
Speech Command Recognizer using tensorflowjs
Chatbot With Voice
⭐
14
Jarvis like chatbot with voice
Ai Npcs That Can Control Their Actions Along With Dialogue
⭐
14
AI NPCs that can control their actions along with dialogue. For instance, if I ask an NPC to tell me its favorite magic spell, it not only tells me the spell but also performs it!
Olami Api Quickstart Python Samples
⭐
12
OLAMI API Quickstart Python Samples
Medico
⭐
12
AI-powered medical terms detection tool.
Favorite Research Papers
⭐
12
Listing my favorite research papers 📝 from different fields as I read them.
Marvin Virtualassistent
⭐
11
A dinamic virtual assistent made with Python, you can easily add more voice commands without any code
Speechtotextsamples
⭐
11
Sample code showing how to use the Azure Speech to Text service from Python 🗣
Olami Android Client Sdk
⭐
11
OLAMI API Android client library and sample codes
Olami Api Quickstart Nodejs Samples
⭐
10
OLAMI API Quickstart Node.js Samples
Dialogflow Xamarin Client
⭐
10
Xamarin SDK for Dialogflow
Ai Assistant
⭐
9
A python AI assistant. Uses SpeechRecognition module.
Olami Api Quickstart Curl Samples
⭐
9
OLAMI API Quickstart cURL Samples (in bash)
Conformer
⭐
9
An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras
Memento App
⭐
9
Android App which serves as an AI assistant for human memory
Say It
⭐
9
A mobile web application that helps you convert spoken words to sharable/editable text 🎊
Nodejs Ai Live Face Recognition Voice Controlled
⭐
9
Its a Voice Controlled AI (Natural Language Processing) with some live face recognition and person detection, with user signition and person detection, with user system (the ai recognize the user logged in) integred and db.stem(the ai recognize the user logged in) integred and db.
Lok Lib
⭐
8
A library to power an AI voice-controlled digital assistant
Speech Rest Api
⭐
8
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
Wordleit
⭐
8
Wordleit is a free open source markdown text editor that gives you a seamless experience as both a reader and a writer. Supported with AI Speech Recognition.
Saypi Userscript
⭐
8
An independent voice interface for Inflection AI's conversational assistant, Pi
Related Searches
Jupyter Notebook Artificial Intelligence (2,712)
Python Artificial Intelligence (2,382)
Machine Learning Artificial Intelligence (2,074)
Javascript Artificial Intelligence (2,019)
Artificial Intelligence Neural Network (1,732)
C Plus Plus Artificial Intelligence (1,390)
Deep Learning Artificial Intelligence (1,351)
Java Artificial Intelligence (1,340)
Artificial Intelligence Natural Language Processing (1,282)
Artificial Intelligence Tensorflow (1,225)
1-46 of 46 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.