Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for machine learning speech recognition
machine-learning
x
speech-recognition
x
98 search results found
Transformers
⭐
124,049
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Deepspeech
⭐
24,127
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Deep Learning Drizzle
⭐
10,767
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Ml Road
⭐
2,742
Machine Learning Resources, Practice and Research
Alan Sdk Web
⭐
2,377
Actionable AI SDK for Web to enable text and voice conversations with actions (JavaScript, React, Angular, Vue, Ember, Electron)
Alan Sdk Ios
⭐
1,909
Actionable AI SDK for iOS to enable text and voice conversations with actions (Swift, Objective-C)
Alan Sdk Flutter
⭐
1,742
Conversational AI SDK for Flutter to build AI-powered voice assistants for Flutter applications (iOS and Android)
Alan Sdk Android
⭐
1,732
Conversational AI SDK for Android to build AI-powered voice assistants for Android applications (Java, Kotlin)
Alan Sdk Ionic
⭐
1,515
In-App assistant SDK to build a multimodal conversational UX for applications created with Ionic (React, Angular, Vue)
Project_alias
⭐
1,421
Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
Ios_ml
⭐
1,406
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Awesome Diarization
⭐
1,384
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Whisper Turbo
⭐
1,313
Cross-Platform, GPU Accelerated Whisper 🏎️
Dragonfire
⭐
1,294
the open-source virtual assistant for Ubuntu based Linux distributions
Whisper Timestamped
⭐
1,217
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Alan Sdk Cordova
⭐
1,070
In-App assistant SDK to build a multimodal conversational UX for Apache Cordova applications
Kur
⭐
812
Descriptive Deep Learning
Lhotse
⭐
794
Tools for handling speech data in machine learning projects.
Deepspeech Examples
⭐
739
Examples of how to use or integrate DeepSpeech
Whisper Playground
⭐
637
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
Free Spoken Digit Dataset
⭐
518
A free audio dataset of spoken digits. Think MNIST for audio.
Alan Sdk Pcf
⭐
426
Build a voice assistant for any application created with Microsoft Power Apps
Libfaceid
⭐
290
libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Kerasdeepspeech
⭐
244
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Ai Audio Datasets
⭐
199
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
Voice_activity_detection
⭐
171
Voice Activity Detection based on Deep Learning & TensorFlow
Chinese Automatic Speech Recognition
⭐
157
Chinese speech recognition
Rnnt Speech Recognition
⭐
152
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Tevr Asr Tool
⭐
132
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Persephone
⭐
131
A tool for automatic phoneme transcription
Awesome Ai Services
⭐
127
An overview of the AI-as-a-service landscape
Tensorflow Ctc Speech Recognition
⭐
127
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Summary
⭐
126
summaries of all the papers I read
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
Automatic Speech Recognition
⭐
116
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Mltu
⭐
100
Machine Learning Training Utilities (for TensorFlow and PyTorch)
Ctc Asr
⭐
92
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Chrome Web Speech Api
⭐
90
Chrome Web Speech API
Emeltal
⭐
83
Local ML voice chat using high-end models.
Download_audioset
⭐
81
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Unsuperior Ai Waifu
⭐
73
AI waifu that can run on your phone or PC
Awesome Openai Whisper
⭐
72
A curated list of awesome OpenAI's Whisper
Speech_ai
⭐
68
Speech to speech bot built with Python
Wav2letter.pytorch
⭐
67
A fully convolution-network for speech-to-text, built on pytorch.
Max Speech To Text Converter
⭐
60
Converts spoken words into text form.
Papers
⭐
60
A list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in which it is implemented
Ai With Python Series
⭐
53
A Python Series of tutorials aimed at learning Artificial Intelligence concepts. This series of tutorials start from the basics of Python and builds on top of it. We will cover three full-fledged case studies to practice AI Implementation of Python with real data and solve real-world problems.
Keras Sincnet
⭐
49
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Noisy Student Training Asr
⭐
44
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
Simpleder
⭐
44
A lightweight library to compute Diarization Error Rate (DER).
Pywhisper
⭐
42
openai/whisper + extra features
Deepspeech
⭐
40
A PyTorch implementation of DeepSpeech and DeepSpeech2.
React.ai
⭐
39
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Banglaspeech2text
⭐
38
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.
Fedaudio
⭐
36
[ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks
Real Time Voice Translator
⭐
32
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
Tensorflow Lite Examples Android
⭐
31
Examples of Tensorflow Lite on Android
Nodejs Whisper
⭐
31
Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.
Multilingual Asr
⭐
29
Multilingual Speech Recognition for Indonesian Languages
Deepspeech Cleaner
⭐
28
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
Aniemore
⭐
28
Emotions recognition from audio and text files (only russian language)
Speech Emotion Recognition
⭐
23
Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.
Speech_emotion_recognition
⭐
23
In this project, the performance of speech emotion recognition is compared between two methods (SVM vs Bi-LSTM RNN).Conventional classifiers that uses machine learning algorithms has been used for decades in recognizing emotions from speech. However, in recent years, deep learning methods have taken the center stage and have gained popularity for their ability to perform well without any input hand-crafted features. Speech emotion on sets obtained from RAVDESS corpus is classified using a conven
Speech Commands Classification By Lstm Pytorch
⭐
19
Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.
Jabberwocky
⭐
18
An Alexa skill providing a conversational interface to any public figure (as mimicked by GPT3). The legacy GUI is no longer maintained.
Speechnet
⭐
18
Automatic Speech Recognition
Fast Seamlessm4t Onnx
⭐
16
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
Speech Command
⭐
15
Speech Command Recognizer using tensorflowjs
Tchatbot
⭐
15
A ChatBot framework to create customizable all purpose Chatbots using NLP, Tensorflow, Speech Recognition
Speech Recognition Learning Resources
⭐
15
✅ A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.
Assistant
⭐
13
A machine learning powered, voice-based virtual assistant for Raspberry Pi. Supports several features like conversation, weather, opening websites, geolocation, date/time, and creating timers.
Htk
⭐
12
HTK Toolkit with Linux 64 bit and Docker support
Favorite Research Papers
⭐
12
Listing my favorite research papers 📝 from different fields as I read them.
Speech To Text Demo
⭐
12
An application that updates its own user interface based on user's voice commands using speech recognition and machine learning
Lattice_rnn
⭐
11
Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation
Audio Pretrained Model
⭐
11
A collection of Audio and Speech pre-trained models.
Webvoicesdk
⭐
11
Buildings block for voice-enabled applications in the browser
Baidu Deepspeech2
⭐
10
A Tensorflow implementation of Baidu's Deep Speech 2 paper
Asr
⭐
10
Automatic speech recognition using neural networks
Speechrecognition
⭐
10
Small-footprint Keyword Spotting
Vb_diarization
⭐
10
VB Diarization with Eigenvoice and HMM Priors, refactored
Listen Attend And Speell Pytorch
⭐
9
Implementation of Automatic Speech Recognition inspired by "Listen, Attend and Spell" paper in PyTorch
Conformer
⭐
9
An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras
Memento App
⭐
9
Android App which serves as an AI assistant for human memory
Bleepy
⭐
8
Bleepy is a Python program that can block Tagalog and English profanity in audio and videos.
Speech_recognition
⭐
8
Indo:- A mini Speech Recognizer
Deepspeech Pytorch
⭐
8
Pytorch implementation for DeepSpeech 2.0
Speech Recognition Tensorflow Challenge
⭐
8
Different CNN Models for keyword spotting in speech recognition
Dljeju2018coderepoasr
⭐
8
Details on my work on using GANs for speech synthesis for improving Speech Recognition accuracy for ASR problem
Kaggle Ai
⭐
8
Categorize AI problems and record through kaggle, Google's data science website
Aitk
⭐
7
Artificial Intelligence Toolkit, a powerful tool that makes your life better.
End2endautomaticspeechrecognition
⭐
7
In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.
Project
⭐
6
Speech-Recognition STT Project
Automatic Indian Sign Language Translator Isl
⭐
6
I created an application which takes in live speech or audio recording as input, converts it into text and displays the relevant Indian Sign Language images or GIFs, using Natural Language Processing and Machine Learning Algorithm.
Mi Go
⭐
6
Mi-Go is an open-source test framework designed to evaluate and compare the accuracy of speech-to-text models on YouTube dataset.
Tflite Speech Recognition
⭐
6
Demo for training a convolutional neural network to classify words and deploy the model to a Raspberry Pi using TensorFlow Lite.
Speech Recognition
⭐
6
A speech-to-text app using AVAudioEngine.
Virtualassistant
⭐
6
Virtual Assistant project done in the Middlesex University with Dr. Nawaz Khan by scholarship of the ErasmusPlus program.
Bertphone
⭐
5
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
Learningnumbers
⭐
5
A guide to help kids learn numbers!
Related Searches
Python Machine Learning (14,099)
Jupyter Notebook Machine Learning (12,247)
Machine Learning Neural Network (4,421)
Machine Learning Data Science (3,802)
Machine Learning Tensorflow (2,982)
Machine Learning Artificial Intelligence (2,074)
Machine Learning Classification (1,874)
Dataset Machine Learning (1,872)
Machine Learning Pytorch (1,835)
Machine Learning Computer Vision (1,796)
1-98 of 98 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.