Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for asr
asr
x
542 search results found
Transcrater
⭐
19
An open-source tool for automatic speech recognition ASR quality estimation.
Audio
⭐
18
Datasets and Transforms specific to ASR
Deepasr
⭐
18
Keras(Tensorflow) implementations of Automatic Speech Recognition
Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children
⭐
18
Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing an Automatic Speech Recognition (ASR) system would not help since they are pre-trained on voices that differ from children's in terms of frequency and amplitude. Because most of these are pre-tr
Burrmill
⭐
18
BurrMill core
Asr Disparities
⭐
18
Code and data for Koenecke et al. (2020)
Kosr
⭐
18
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Freeswitch Asr
⭐
17
TTS and ASR module with auto Voice Active Detecting supported for Freeswitch. I build it for Nature sound interactive, With the embedded LUA engine we could easly build a Freeswtich application like this.
Speech Recognition
⭐
17
SDKs and docs for Skit's speech to text service
Asrservice
⭐
17
asr service based on kaldi
Openhab Cfg
⭐
17
My Openhab configuration
Inv Tn
⭐
17
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
Listen Attend Spell V2
⭐
17
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Kaldi Long Audio Alignment
⭐
17
Long audio alignment using Kaldi
Kaldi Readers For Tensorflow
⭐
16
readers that enable reading kaldi ark in tensorflow
Speech To Text Viewer
⭐
16
AWS Transcribe evaluation pipeline: bulk-process audio files and view the results
Ffasr Openapi Demos
⭐
16
语智科技远场(单麦克风)语音识别引擎 FFASR 接入指南
Goparrot
⭐
16
Goodness of Pronunciation (GOP) for oral reading assessment.
Xf Ros
⭐
16
xfei sdk for ros
G2p_seq2seq_pytorch
⭐
16
Grapheme to phoneme model for PyTorch
End To End E2e Named Entity Recognition From English Speech
⭐
16
Speechloop
⭐
16
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
Snppar
⭐
15
Parallel/Homoplasic SNP Finder
Torch Asg
⭐
15
Auto Segmentation Criterion (ASG) implemented in pytorch
Atra
⭐
15
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands
Assemblyai Node Sdk
⭐
15
The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.
Vosk Model Ru Adaptation
⭐
15
Wave2vec Recognize Docker
⭐
15
Wave2vec 2.0 Recognize pipeline
Kws Scripts
⭐
15
Keyword Search Recipe for Subword ASR
End To End Mandarin Asr
⭐
15
End-to-end speech recognition on AISHELL dataset.
Speech_sdk
⭐
15
Chinese Speech SDK for Android, iOS and embedded Linux platforms. http://ai.mobvoi.com
Torchain
⭐
15
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
A2300
⭐
15
MIMO platform for advanced communications and PNT applications
Pcpm
⭐
14
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Speech System Zh
⭐
14
该功能包意在结合ROS框架开发出具有中文语音交互功能的语音交互系统 -- hntea-hong
Linto Agent
⭐
14
LinTO platform services stack deployment tool for Docker Swarm cluster
Ovos Stt Plugin Vosk
⭐
14
vosk STT plugin for mycroft
Powershellsamples
⭐
14
Persianspeech
⭐
14
Persian ASR dataset
Wiscore
⭐
14
WisCore , Openwrt, Amazon AVS , Amazon Alexa Module ,IoT Gateway Module
Korean Speech Recognition Quartznet
⭐
14
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식
Megs
⭐
14
A merged version of multiple open-source German speech datasets.
Cs224s Deepspeech
⭐
14
CS224S Course Project
Learning_to_adapt
⭐
14
Coordinate-wise meta-learner for speaker adaptation of ASR models.
Asr Repr Analysis
⭐
13
Wav2vec2 Live Japanese Translator
⭐
13
real time japanese speech recognition translator using wav2vec2
Asr Nepali Using Cnn Bilstm Resnet
⭐
13
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet - IEEE (ICICT - 2022)
Quartznet Asr
⭐
13
Unityasr
⭐
13
Automatic Speech Recognition in Unity.
Asr_demo
⭐
13
语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译
Spokestack Tray Android
⭐
13
A UI component that makes it easy to add voice interaction to your app.
Whisper Finetune
⭐
13
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
German Asr Lm Tools
⭐
13
Crawling and creating a German language model resource
Kaldi Alligner
⭐
13
scripts to align a given wave to its transcription using trained models by Kaldi
Openai Whisper Microservice
⭐
13
This is an OpenAI Whisper automatic speech recognition microservice
Kbase Media
⭐
12
视频、音频、图片内容识别、语音转写、语音合成 / easy convert video audio image to text, and revert text to audio(base64)
Mimic Enhance
⭐
12
Speech enhancement using mimic loss
Rokid Webhook Hass
⭐
12
rokid webhook component for Home Assistant (若琪HA组件)
Arabic Speech Recognition
⭐
12
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
Lvterminal
⭐
12
Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)
Exkaldi
⭐
12
An advance kaldi wrapper for Pyhton
Whisper_android
⭐
12
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Persian Ctc Segmentation
⭐
12
Persian Speech Segmentation by CTC-segmentation Method
Media Annotator
⭐
12
Web-based annotation tool for media data. The easiest way to create you own media dataset.
Gujarati_speech_recognition
⭐
12
Offline speech recognition for Gujarati Language.
Sonosco
⭐
12
Framework for Deep Speech Recognition
Voskjs
⭐
12
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.
Seaco Paraformer
⭐
12
Source code, model links and open test sets for paper SeACo-Paraformer.
Galvasr
⭐
11
ASR library
End To End Speech Recognition Models
⭐
11
PyTorch implementation of automatic speech recognition models.
Asr
⭐
11
A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).
Pansori Tedxkr Corpus
⭐
11
Korean ASR Corpus generated from TEDx talks
Uyghur Asr Ctc
⭐
11
Speech Recognition for Uyghur using deep learning
Twilio Asr Realtime Dashboard
⭐
11
Twilio ASR and Intent Realtime Dashboard
Arsenalpython
⭐
11
python 开发的兵器库. 收藏内容包括参考代码,实验,培训资料等
Ispeech Speech Recognition Asr Voice Recognition.js
⭐
11
iSpeech's open source javascript SDK for speech recognition (ASR) API, enables you to easily create Web applications using iSpeech freeform, command or custom statistical language models. The speech recognition API powering this speech recognition SDK supports nearly 30 languages and accents. The acoustic models are based on huge amounts of low and high quality hand labeled audio data (millions of utterances). iSpeech is a viable alternative to Google ASR (Web Speech API), which only includes V
Pzh Py Speech
⭐
11
A tiny audio speech (.wav) utility tool (GUI) based on Python2.7+wxPython4.0+PyAudio+Matplotlib+SpeechRec | 痞子衡语音处理助手,一款支持多引擎的wav格式语音处理小工具(音频录播与波形显示,语音识别,文语合成
Karotz
⭐
11
ruby bindings for the karotz rest api
Lattice_combination
⭐
11
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
Prayer Times
⭐
11
a PHP package to calculate prayer times
Docker Py Kaldi Asr And Model
⭐
11
STT Service based on Kaldi ASR
Uma Asr
⭐
11
This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".
13.3 Hours Chinese Mandarin Synthesis Corpus Female Emotional
⭐
11
Chinese Mandarin Synthesis Corpus-Female/Emotional
Fastaci
⭐
11
fastACI toolbox: the MATLAB toolbox for investigating auditory perception using reverse correlation.
Automatic_speech_recognition_with_multi_models
⭐
10
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
Slt.kit
⭐
10
Spoken Language Translation System
Etos Keywordspotting
⭐
10
PyTorch implementations of neural network models for keyword spotting
Asr1601
⭐
10
L4 R3: ASR Cortex-R5 LTE Cat.1 SoC (ASR1601/ASR1603/ASR3601)
Ice Asr
⭐
10
An automatic speech recognition environment for Icelandic based on Kaldi
Sincnet_adapt
⭐
10
Raw waveform adaptation with SincNet
Albanian Asr
⭐
10
This project is an AI-based transcription tool for the Albanian language. The tool is designed to automatically transcribe Albanian speech to text using Python.
Torgo_asr
⭐
10
A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech
Defending Neural Backdoors Via Generative Distribution Modeling
⭐
10
The code is for our NeurIPS 2019 paper: https://arxiv.org/abs/1910.04749
Lattice Rescore
⭐
10
Cnn Speech Recognition
⭐
10
Chinese Asr
⭐
10
Chinese-ASR built on kaldi
Asr
⭐
10
Automatic speech recognition using neural networks
Corr Seq Labeling
⭐
10
Code for paper "Joint Learning of Correlated Sequence Labelling Tasks Using Bidirectional Recurrent Neural Networks"
Smartspeech
⭐
10
SmartSpeech — это сервис для синтеза и распознавания речи
Imagingapfs
⭐
9
How to make a disk image of APFS container, then restore it.
Related Searches
Python Asr (347)
Speech Recognition Asr (250)
301-400 of 542 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.