Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for automatic speech recognition
automatic-speech-recognition
x
97 search results found
Wenet
⭐
3,717
Production First and Production Ready End-to-End Speech Recognition Toolkit
Awesome Speech Recognition Speech Synthesis Papers
⭐
2,869
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Automatic_speech_recognition
⭐
2,743
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stt
⭐
1,988
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Whisper Asr Webservice
⭐
1,317
OpenAI Whisper ASR Webservice API
Pororo
⭐
1,081
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Tensorflowasr
⭐
890
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Open_stt
⭐
671
Open STT
Whispering
⭐
597
Streaming transcriber with whisper
Cheetah
⭐
537
On-device streaming speech-to-text engine powered by deep learning
Awesome Kaldi
⭐
478
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Neural_sp
⭐
466
End-to-end ASR/LM implementation with PyTorch
Jiwer
⭐
440
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Tensorflowasr
⭐
434
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于
Leopard
⭐
390
On-device speech-to-text engine powered by deep learning
Huggingsound
⭐
357
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
Whisper Youtube
⭐
282
🔉 Youtube Videos Transcription with OpenAI's Whisper
Tensorflow_end2end_speech_recognition
⭐
275
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Speech_dataset
⭐
229
The dataset of Speech Recognition
Awesome Large Audio Models
⭐
207
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Hey Jetson
⭐
189
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Sova Asr
⭐
149
SOVA ASR (Automatic Speech Recognition)
Deep_avsr
⭐
138
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Elpis
⭐
137
🙊 software for creating speech recognition models.
At16k
⭐
123
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Fast Rir
⭐
122
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Automatic Speech Recognition
⭐
116
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Mongolian Speech Recognition
⭐
86
Mongolian speech recognition with PyTorch
Zerospeech Tts Without T
⭐
79
A Pytorch implementation for the ZeroSpeech 2019 challenge.
Wav2letter
⭐
70
Speech Recognition model based off of FAIR research paper built using Pytorch.
Viet Asr
⭐
65
VietASR - Vietnamese Automatic Speech Recognition
Thonburian Whisper
⭐
55
A repository for Thonburian Whisper: An Automatic Speech Recognition model for Thai fine-tuned on Whisper. Try demo on Huggingface space:
Tafrigh
⭐
55
تفريغ المواد المرئية أو المسموعة إلى نصوص
Obvi
⭐
51
A Polymer 3+ webcomponent / button for doing speech recognition
Auto Subtitled Video Generator
⭐
49
Input a YouTube video link or upload a video file and get a video with subtitles.
Go Subgen
⭐
49
Automatically generate subtitles for your media using whisper.cpp via webhooks with support for Radarr & Sonarr
Asrecognition
⭐
47
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
Automatic Speech Recognition
⭐
43
Automatic Speech Recognition using Tensorflow
Soxan
⭐
37
Wav2Vec for speech recognition, classification, and audio classification
Hf Experiments
⭐
37
Experiments with Hugging Face 🔬 🤗
Cif Pytorch
⭐
36
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
Pythaiasr
⭐
35
Python Thai Automatic Speech Recognition
Ts Rir
⭐
35
Translating Synthetic RIRs to Real RIRs
Automatic_speech_recognition
⭐
33
Vietnamese Automatic Speech Recognition
Ir Gan
⭐
32
Augmenting Room Impulse Response
Speech Recognition
⭐
31
End-to-End Speech Recognition using Neural Networks.
Automatic Speech Recognition
⭐
30
End-to-End Speech Recognition Using Tensorflow
Kenlm Training
⭐
27
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
Ai Engine
⭐
25
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Bembaspeech
⭐
25
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources. The corpus has 14, 438 utterances culminating into over 24 hours of speech.
Asr Corpus Creator
⭐
24
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
Best Rq Pytorch
⭐
20
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
Jasper
⭐
20
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
Speech To Windows Input
⭐
19
Perform speech-to-text (STT/ASR) with Azure speech service and simulate keyboard to input the recognized text; Supports English, Chinese, Japanese, and more
Tgisper
⭐
18
Telegram bot with ASR
Kaldi Long Audio Alignment
⭐
17
Long audio alignment using Kaldi
Wav2vec4bp
⭐
16
Wav2vec resources and models for Brazilian Portuguese
Fast Seamlessm4t Onnx
⭐
16
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
Openai_whisper_asr
⭐
15
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models
Wave2vec Recognize Docker
⭐
15
Wave2vec 2.0 Recognize pipeline
Linto Agent
⭐
14
LinTO platform services stack deployment tool for Docker Swarm cluster
Ovos Stt Plugin Vosk
⭐
14
vosk STT plugin for mycroft
Quartznet Asr
⭐
13
Asr_demo
⭐
13
语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译
Openai Whisper Microservice
⭐
13
This is an OpenAI Whisper automatic speech recognition microservice
Wav2vec2 Live Japanese Translator
⭐
13
real time japanese speech recognition translator using wav2vec2
Turkish Speech To Text
⭐
13
Fine-tuning for automatic speech recognition on low-resource languages with character-based CTC model
Learn Tensorflow
⭐
12
Make Smart Things with TensorFlow
Gan_harmonized_with_hmms
⭐
12
Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models
Whisper_android
⭐
12
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Arabic Speech Recognition
⭐
12
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Kaldi_helpers
⭐
12
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Asr
⭐
11
A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).
Sharif Emotional Speech Dataset
⭐
11
A large-scale validated database for Persian speech emotion detection.
Uyghur Asr Ctc
⭐
11
Speech Recognition for Uyghur using deep learning
Vad Sli Asr
⭐
11
A pipeline to isolate and transcribe one language in mixed-language speech
Sgem
⭐
10
Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization (INTERSPEECH 2023 Oral Presentation)
Automatic_speech_recognition_with_multi_models
⭐
10
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
Asr
⭐
10
Automatic speech recognition using neural networks
Youtube Asr Crawler
⭐
10
Audionet
⭐
10
A deep model for speech recognition via Keras(front_end) and TensorFlow(back_end).
Personalized_asr
⭐
9
[MTAP] Official implementation: A mechanism for personalized Automatic Speech Recognition for less frequently spoken languages: the Greek case
Bbc Speech Segmenter
⭐
9
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Q Asr
⭐
9
Integer-only Zero-shot Quantization for Efficient Speech Recognition
Tatt
⭐
8
Transcribe All The Things™ is a CLI for creating and managing speech-to-text transcripts.
Detect Segment Cough
⭐
8
A python model to detect and segment coughs, forked from coughvid's repo
Werpy
⭐
8
🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.
Dljeju2018coderepoasr
⭐
8
Details on my work on using GANs for speech synthesis for improving Speech Recognition accuracy for ASR problem
Kaldi For Dummies
⭐
8
This is the repository for my version of Kaldi for Dummies example.
Quartznet Pytorch
⭐
7
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
Ntspeechrecognition
⭐
7
NTSpeechRecognition is a iOS/macOS framework, written in Objective-c, providing speech recognition functionality. For decoding PocketSphinx is used. (Keyword spotting, JSGF Grammar, NGram)
Kaust Whisper Adapter
⭐
7
INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!
Kaldi Asr Aws
⭐
7
This code repo is in reference to the Medium Article for setting up Kaldi on AWS
Matlab_feat
⭐
6
Functions for creating speech features in MATLAB.
Speech Data Augmentation
⭐
6
Speech dataset processing and augmentation (add background noise and change speech pitch) for speech recognition
End To End Asr Transformer
⭐
6
An end to end ASR Transformer model training repo
Whisper Youtube
⭐
5
This repository will guide you to create automatically generate YouTube Transcription using Using OpenAI's Whisper
1-97 of 97 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.