Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for speech recognition asr
asr
x
speech-recognition
x
209 search results found
Asr Ios Local
⭐
53
基于kaldi的ios本地语音识别(本地实时流)Kaldi-based ios native speech recognition (local real-time streaming)
Vosk Asterisk
⭐
52
Speech Recognition in Asterisk with Vosk Server
Speech Transformer Tf2.0
⭐
51
transformer for ASR-systerm (via tensorflow2.0)
Opensnips
⭐
50
Open source projects related to Snips https://snips.ai/.
Bertpunc
⭐
49
SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model
Docker Whisperx
⭐
49
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
Keras Sincnet
⭐
49
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Spokestack Android
⭐
49
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Asr
⭐
48
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
Asrecognition
⭐
47
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
React Native Spokestack
⭐
46
Spokestack: give your React Native app a voice interface!
Voice Privacy Challenge 2022
⭐
45
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Whispers2t
⭐
44
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Enginer
Awesome Asr Contextualization
⭐
42
A curated list of awesome papers on contextualizing E2E ASR outputs
Py Nltools
⭐
42
A collection of basic python modules for spoken natural language processing
Iflytek_awaken_asr
⭐
40
use iflytek's technology to realize awaken and order recognition
Transfer Learning Asr
⭐
40
Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017
Open_stt_e2e
⭐
39
PyTorch end-to-end speech recognition
Yandex Speech
⭐
38
node.js module for Yandex speech systems (ASR & TTS)
Cif Pytorch
⭐
36
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
Conformer Athena
⭐
33
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
Baiduasrandtts
⭐
32
Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech; 百度语音识别、语音合成API使用。
Voice Recognition Ua
⭐
32
Training scripts for Speech-To-Text models for Ukrainian language
Miniasr
⭐
31
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
Greenkey Asrtoolkit
⭐
31
A collection of useful tools for handling speech recognition data
Voice Assistant Chatgpt
⭐
31
Voice Assistant based on Whisper ASR and ChatGPT API
Linto Stt
⭐
30
An automatic speech recognition API
Automatic Speech Recognition
⭐
30
End-to-End Speech Recognition Using Tensorflow
Multilingual Asr
⭐
29
Multilingual Speech Recognition for Indonesian Languages
Lightning Asr
⭐
29
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Openasr
⭐
29
A pytorch based end2end speech recognition system.
React Native Vosk
⭐
29
Speech recognition module for react native using Vosk library
Pytorch_mlp_for_asr
⭐
27
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
Mspeech
⭐
26
Program for speech recognition using the Google Speech API, voice commands, control your computer.
Spokestack Ios
⭐
26
Spokestack: give your iOS app a voice interface!
Ai Engine
⭐
25
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Asr Corpus Creator
⭐
24
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
Speech Recognition Experiments
⭐
24
Experiments to test different speech recognition systems for SEPIA Framework
Voice100
⭐
22
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
Meta Transfer Learning
⭐
22
Implementation of meta-transfer-learning (ACL 2020)
Korean_asr
⭐
22
Korean Automatic Speech Recognition
Keenasr Android Poc
⭐
21
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Kaldi Br
⭐
21
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro
Opensource Voice Tools
⭐
21
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Speech Recognition Evaluation
⭐
21
Evaluate results from ASR/Speech-to-Text quickly
Jasper
⭐
20
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
Summerasr
⭐
20
SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be easily built standalone without any depencency.
Transcrater
⭐
19
An open-source tool for automatic speech recognition ASR quality estimation.
Kosr
⭐
18
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children
⭐
18
Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing an Automatic Speech Recognition (ASR) system would not help since they are pre-trained on voices that differ from children's in terms of frequency and amplitude. Because most of these are pre-tr
Deepasr
⭐
18
Keras(Tensorflow) implementations of Automatic Speech Recognition
Listen Attend Spell V2
⭐
17
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Speech Recognition
⭐
17
SDKs and docs for Skit's speech to text service
Kaldi Long Audio Alignment
⭐
17
Long audio alignment using Kaldi
Speechloop
⭐
16
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
End To End Mandarin Asr
⭐
15
End-to-end speech recognition on AISHELL dataset.
Atra
⭐
15
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands
Vosk Model Ru Adaptation
⭐
15
Ovos Stt Plugin Vosk
⭐
14
vosk STT plugin for mycroft
Megs
⭐
14
A merged version of multiple open-source German speech datasets.
Pcpm
⭐
14
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Korean Speech Recognition Quartznet
⭐
14
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식
Unityasr
⭐
13
Automatic Speech Recognition in Unity.
Openai Whisper Microservice
⭐
13
This is an OpenAI Whisper automatic speech recognition microservice
Whisper Finetune
⭐
13
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
Asr Nepali Using Cnn Bilstm Resnet
⭐
13
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet - IEEE (ICICT - 2022)
Arabic Speech Recognition
⭐
12
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Voskjs
⭐
12
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.
Gujarati_speech_recognition
⭐
12
Offline speech recognition for Gujarati Language.
Sonosco
⭐
12
Framework for Deep Speech Recognition
Whisper_android
⭐
12
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
End To End Speech Recognition Models
⭐
11
PyTorch implementation of automatic speech recognition models.
Ispeech Speech Recognition Asr Voice Recognition.js
⭐
11
iSpeech's open source javascript SDK for speech recognition (ASR) API, enables you to easily create Web applications using iSpeech freeform, command or custom statistical language models. The speech recognition API powering this speech recognition SDK supports nearly 30 languages and accents. The acoustic models are based on huge amounts of low and high quality hand labeled audio data (millions of utterances). iSpeech is a viable alternative to Google ASR (Web Speech API), which only includes V
13.3 Hours Chinese Mandarin Synthesis Corpus Female Emotional
⭐
11
Chinese Mandarin Synthesis Corpus-Female/Emotional
Asr
⭐
11
A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).
Uma Asr
⭐
11
This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".
Galvasr
⭐
11
ASR library
Asr
⭐
10
Automatic speech recognition using neural networks
Lattice Rescore
⭐
10
Automatic_speech_recognition_with_multi_models
⭐
10
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
Semi Supervised Asr
⭐
9
Asr_project
⭐
9
This repository created for the NHN ASR hackathon competition.
Voice Tech Study
⭐
9
语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总
Deepspeech Pytorch
⭐
8
Pytorch implementation for DeepSpeech 2.0
Simplespeechloop
⭐
8
A very basic demonstration connecting speech recognition and text-to-speech
Kaggle Ai
⭐
8
Categorize AI problems and record through kaggle, Google's data science website
Deepspeech Kabyle
⭐
8
Automatic Speech Recognition (ASR) - Kabyle
React Native Spokestack Tray
⭐
8
React Native component for adding Spokestack to a React Native app
Be_nlp_speech_resources
⭐
8
Links to Belarusian NLP and Speech resources
Kaldi Avsr
⭐
8
Kaldi-based audio-visual speech recognition
Taiwanese Whisper
⭐
7
fine-tune Whipser model for Taiwanese speech recognition
Bilatticernn Confidence
⭐
7
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/1910.11933 or https://ieeexplore.ieee.org/document/9053264
Cl Asr
⭐
7
A (not entirely working) stand-alone speech recognizer written in Common Lisp
Ce Optimizedloss
⭐
7
Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.
Speech Adapters
⭐
6
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding
Sentence Generator
⭐
6
Randomly generate phrases, sentences, and queries. This can be used to generate test set for Automatic Speech Recognition (ASR).
Asr Mp3 Compression Aaes
⭐
6
MP3 Compression, End-to-End Automatic Speech Recognition (ASR), Adversarial Noise
Speech Datasets For Asr
⭐
6
Download speech datasets (English and non-English) for Automatic Speech Recognition
Whisper Finetuning Be
⭐
6
Finetuning Whisper ASR model for Belarusian language
Full Lattice Search
⭐
6
Full Text Search Over Probabilistic Lattices with Elasticsearch!
Related Searches
Python Speech Recognition (876)
Python Asr (347)
101-200 of 209 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.