Awesome Open Source

Programming Languages

Search results for machine learning speech recognition

machine-learning x

speech-recognition x

98 search results found

Transformers ⭐ 124,049

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Deepspeech ⭐ 24,127

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Deep Learning Drizzle ⭐ 10,767

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

Ml Road ⭐ 2,742

Machine Learning Resources, Practice and Research

Alan Sdk Web ⭐ 2,377

Actionable AI SDK for Web to enable text and voice conversations with actions (JavaScript, React, Angular, Vue, Ember, Electron)

Alan Sdk Ios ⭐ 1,909

Actionable AI SDK for iOS to enable text and voice conversations with actions (Swift, Objective-C)

Alan Sdk Flutter ⭐ 1,742

Conversational AI SDK for Flutter to build AI-powered voice assistants for Flutter applications (iOS and Android)

Alan Sdk Android ⭐ 1,732

Conversational AI SDK for Android to build AI-powered voice assistants for Android applications (Java, Kotlin)

Alan Sdk Ionic ⭐ 1,515

In-App assistant SDK to build a multimodal conversational UX for applications created with Ionic (React, Angular, Vue)

Project_alias ⭐ 1,421

Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.

Ios_ml ⭐ 1,406

List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.

Awesome Diarization ⭐ 1,384

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Whisper Turbo ⭐ 1,313

Cross-Platform, GPU Accelerated Whisper 🏎️

Dragonfire ⭐ 1,294

the open-source virtual assistant for Ubuntu based Linux distributions

Whisper Timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Alan Sdk Cordova ⭐ 1,070

In-App assistant SDK to build a multimodal conversational UX for Apache Cordova applications

Descriptive Deep Learning

Tools for handling speech data in machine learning projects.

Deepspeech Examples ⭐ 739

Examples of how to use or integrate DeepSpeech

Whisper Playground ⭐ 637

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

Free Spoken Digit Dataset ⭐ 518

A free audio dataset of spoken digits. Think MNIST for audio.

Alan Sdk Pcf ⭐ 426

Build a voice assistant for any application created with Microsoft Power Apps

Libfaceid ⭐ 290

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Kerasdeepspeech ⭐ 244

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Ai Audio Datasets ⭐ 199

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

Voice_activity_detection ⭐ 171

Voice Activity Detection based on Deep Learning & TensorFlow

Chinese Automatic Speech Recognition ⭐ 157

Chinese speech recognition

Rnnt Speech Recognition ⭐ 152

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

Tevr Asr Tool ⭐ 132

State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.

Persephone ⭐ 131

A tool for automatic phoneme transcription

Awesome Ai Services ⭐ 127

An overview of the AI-as-a-service landscape

Tensorflow Ctc Speech Recognition ⭐ 127

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Summary ⭐ 126

summaries of all the papers I read

Spokestack Python ⭐ 124

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Automatic Speech Recognition ⭐ 116

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Machine Learning Training Utilities (for TensorFlow and PyTorch)

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Chrome Web Speech Api ⭐ 90

Chrome Web Speech API

Local ML voice chat using high-end models.

Download_audioset ⭐ 81

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Unsuperior Ai Waifu ⭐ 73

AI waifu that can run on your phone or PC

Awesome Openai Whisper ⭐ 72

A curated list of awesome OpenAI's Whisper

Speech_ai ⭐ 68

Speech to speech bot built with Python

Wav2letter.pytorch ⭐ 67

A fully convolution-network for speech-to-text, built on pytorch.

Max Speech To Text Converter ⭐ 60

Converts spoken words into text form.

A list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in which it is implemented

Ai With Python Series ⭐ 53

A Python Series of tutorials aimed at learning Artificial Intelligence concepts. This series of tutorials start from the basics of Python and builds on top of it. We will cover three full-fledged case studies to practice AI Implementation of Python with real data and solve real-world problems.

Keras Sincnet ⭐ 49

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Noisy Student Training Asr ⭐ 44

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

Simpleder ⭐ 44

A lightweight library to compute Diarization Error Rate (DER).

Pywhisper ⭐ 42

openai/whisper + extra features

Deepspeech ⭐ 40

A PyTorch implementation of DeepSpeech and DeepSpeech2.

React.ai ⭐ 39

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Banglaspeech2text ⭐ 38

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

Fedaudio ⭐ 36

[ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks

Real Time Voice Translator ⭐ 32

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

Tensorflow Lite Examples Android ⭐ 31

Examples of Tensorflow Lite on Android

Nodejs Whisper ⭐ 31

Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.

Multilingual Asr ⭐ 29

Multilingual Speech Recognition for Indonesian Languages

Deepspeech Cleaner ⭐ 28

Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework

Aniemore ⭐ 28

Emotions recognition from audio and text files (only russian language)

Speech Emotion Recognition ⭐ 23

Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.

Speech_emotion_recognition ⭐ 23

In this project, the performance of speech emotion recognition is compared between two methods (SVM vs Bi-LSTM RNN).Conventional classifiers that uses machine learning algorithms has been used for decades in recognizing emotions from speech. However, in recent years, deep learning methods have taken the center stage and have gained popularity for their ability to perform well without any input hand-crafted features. Speech emotion on sets obtained from RAVDESS corpus is classified using a conven

Speech Commands Classification By Lstm Pytorch ⭐ 19

Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.

Jabberwocky ⭐ 18

An Alexa skill providing a conversational interface to any public figure (as mimicked by GPT3). The legacy GUI is no longer maintained.

Speechnet ⭐ 18

Automatic Speech Recognition

Fast Seamlessm4t Onnx ⭐ 16

ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Speech Command ⭐ 15

Speech Command Recognizer using tensorflowjs

Tchatbot ⭐ 15

A ChatBot framework to create customizable all purpose Chatbots using NLP, Tensorflow, Speech Recognition

Speech Recognition Learning Resources ⭐ 15

✅ A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.

Assistant ⭐ 13

A machine learning powered, voice-based virtual assistant for Raspberry Pi. Supports several features like conversation, weather, opening websites, geolocation, date/time, and creating timers.

HTK Toolkit with Linux 64 bit and Docker support

Favorite Research Papers ⭐ 12

Listing my favorite research papers 📝 from different fields as I read them.

Speech To Text Demo ⭐ 12

An application that updates its own user interface based on user's voice commands using speech recognition and machine learning

Lattice_rnn ⭐ 11

Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation

Audio Pretrained Model ⭐ 11

A collection of Audio and Speech pre-trained models.

Webvoicesdk ⭐ 11

Buildings block for voice-enabled applications in the browser

Baidu Deepspeech2 ⭐ 10

A Tensorflow implementation of Baidu's Deep Speech 2 paper

Automatic speech recognition using neural networks

Speechrecognition ⭐ 10

Small-footprint Keyword Spotting

Vb_diarization ⭐ 10

VB Diarization with Eigenvoice and HMM Priors, refactored

Listen Attend And Speell Pytorch ⭐ 9

Implementation of Automatic Speech Recognition inspired by "Listen, Attend and Spell" paper in PyTorch

Conformer ⭐ 9

An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras

Memento App ⭐ 9

Android App which serves as an AI assistant for human memory

Bleepy is a Python program that can block Tagalog and English profanity in audio and videos.

Speech_recognition ⭐ 8

Indo:- A mini Speech Recognizer

Deepspeech Pytorch ⭐ 8

Pytorch implementation for DeepSpeech 2.0

Speech Recognition Tensorflow Challenge ⭐ 8

Different CNN Models for keyword spotting in speech recognition

Dljeju2018coderepoasr ⭐ 8

Details on my work on using GANs for speech synthesis for improving Speech Recognition accuracy for ASR problem

Kaggle Ai ⭐ 8

Categorize AI problems and record through kaggle, Google's data science website

Artificial Intelligence Toolkit, a powerful tool that makes your life better.

End2endautomaticspeechrecognition ⭐ 7

In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.

Speech-Recognition STT Project

Automatic Indian Sign Language Translator Isl ⭐ 6

I created an application which takes in live speech or audio recording as input, converts it into text and displays the relevant Indian Sign Language images or GIFs, using Natural Language Processing and Machine Learning Algorithm.

Mi-Go is an open-source test framework designed to evaluate and compare the accuracy of speech-to-text models on YouTube dataset.

Tflite Speech Recognition ⭐ 6

Demo for training a convolutional neural network to classify words and deploy the model to a Raspberry Pi using TensorFlow Lite.

Speech Recognition ⭐ 6

A speech-to-text app using AVAudioEngine.

Virtualassistant ⭐ 6

Virtual Assistant project done in the Middlesex University with Dr. Nawaz Khan by scholarship of the ErasmusPlus program.

Bertphone ⭐ 5

Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"

Learningnumbers ⭐ 5

A guide to help kids learn numbers!

Related Searches

Python Machine Learning (14,099)

Jupyter Notebook Machine Learning (12,247)

Machine Learning Neural Network (4,421)

Machine Learning Data Science (3,802)

Machine Learning Tensorflow (2,982)

Machine Learning Artificial Intelligence (2,074)

Machine Learning Classification (1,874)

Dataset Machine Learning (1,872)

Machine Learning Pytorch (1,835)

Machine Learning Computer Vision (1,796)

1-98 of 98 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.