Awesome Open Source

Programming Languages

Search results for asr

542 search results found

Transcrater ⭐ 19

An open-source tool for automatic speech recognition ASR quality estimation.

Datasets and Transforms specific to ASR

Keras(Tensorflow) implementations of Automatic Speech Recognition

Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children ⭐ 18

Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing an Automatic Speech Recognition (ASR) system would not help since they are pre-trained on voices that differ from children's in terms of frequency and amplitude. Because most of these are pre-tr

Burrmill ⭐ 18

Asr Disparities ⭐ 18

Code and data for Koenecke et al. (2020)

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Freeswitch Asr ⭐ 17

TTS and ASR module with auto Voice Active Detecting supported for Freeswitch. I build it for Nature sound interactive, With the embedded LUA engine we could easly build a Freeswtich application like this.

Speech Recognition ⭐ 17

SDKs and docs for Skit's speech to text service

Asrservice ⭐ 17

asr service based on kaldi

Openhab Cfg ⭐ 17

My Openhab configuration

A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)

Listen Attend Spell V2 ⭐ 17

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

Kaldi Long Audio Alignment ⭐ 17

Long audio alignment using Kaldi

Kaldi Readers For Tensorflow ⭐ 16

readers that enable reading kaldi ark in tensorflow

Speech To Text Viewer ⭐ 16

AWS Transcribe evaluation pipeline: bulk-process audio files and view the results

Ffasr Openapi Demos ⭐ 16

语智科技远场(单麦克风)语音识别引擎 FFASR 接入指南

Goparrot ⭐ 16

Goodness of Pronunciation (GOP) for oral reading assessment.

xfei sdk for ros

G2p_seq2seq_pytorch ⭐ 16

Grapheme to phoneme model for PyTorch

End To End E2e Named Entity Recognition From English Speech ⭐ 16

Speechloop ⭐ 16

Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?

Parallel/Homoplasic SNP Finder

Torch Asg ⭐ 15

Auto Segmentation Criterion (ASG) implemented in pytorch

An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands

Assemblyai Node Sdk ⭐ 15

The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.

Vosk Model Ru Adaptation ⭐ 15

Wave2vec Recognize Docker ⭐ 15

Wave2vec 2.0 Recognize pipeline

Kws Scripts ⭐ 15

Keyword Search Recipe for Subword ASR

End To End Mandarin Asr ⭐ 15

End-to-end speech recognition on AISHELL dataset.

Speech_sdk ⭐ 15

Chinese Speech SDK for Android, iOS and embedded Linux platforms. http://ai.mobvoi.com

Torchain ⭐ 15

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

MIMO platform for advanced communications and PNT applications

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Speech System Zh ⭐ 14

该功能包意在结合ROS框架开发出具有中文语音交互功能的语音交互系统 -- hntea-hong

Linto Agent ⭐ 14

LinTO platform services stack deployment tool for Docker Swarm cluster

Ovos Stt Plugin Vosk ⭐ 14

vosk STT plugin for mycroft

Powershellsamples ⭐ 14

Persianspeech ⭐ 14

Persian ASR dataset

WisCore , Openwrt, Amazon AVS , Amazon Alexa Module ,IoT Gateway Module

Korean Speech Recognition Quartznet ⭐ 14

Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식

A merged version of multiple open-source German speech datasets.

Cs224s Deepspeech ⭐ 14

CS224S Course Project

Learning_to_adapt ⭐ 14

Coordinate-wise meta-learner for speaker adaptation of ASR models.

Asr Repr Analysis ⭐ 13

Wav2vec2 Live Japanese Translator ⭐ 13

real time japanese speech recognition translator using wav2vec2

Asr Nepali Using Cnn Bilstm Resnet ⭐ 13

Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet - IEEE (ICICT - 2022)

Quartznet Asr ⭐ 13

Unityasr ⭐ 13

Automatic Speech Recognition in Unity.

Asr_demo ⭐ 13

语音识别API，分实时语音和长语音离线上传识别，支持中英文等多达100个国家的语言实时转写和同声传译

Spokestack Tray Android ⭐ 13

A UI component that makes it easy to add voice interaction to your app.

Whisper Finetune ⭐ 13

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

German Asr Lm Tools ⭐ 13

Crawling and creating a German language model resource

Kaldi Alligner ⭐ 13

scripts to align a given wave to its transcription using trained models by Kaldi

Openai Whisper Microservice ⭐ 13

This is an OpenAI Whisper automatic speech recognition microservice

Kbase Media ⭐ 12

视频、音频、图片内容识别、语音转写、语音合成 / easy convert video audio image to text, and revert text to audio(base64)

Mimic Enhance ⭐ 12

Speech enhancement using mimic loss

Rokid Webhook Hass ⭐ 12

rokid webhook component for Home Assistant (若琪HA组件)

Arabic Speech Recognition ⭐ 12

This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"

Lvterminal ⭐ 12

Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)

An advance kaldi wrapper for Pyhton

Whisper_android ⭐ 12

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

Persian Ctc Segmentation ⭐ 12

Persian Speech Segmentation by CTC-segmentation Method

Media Annotator ⭐ 12

Web-based annotation tool for media data. The easiest way to create you own media dataset.

Gujarati_speech_recognition ⭐ 12

Offline speech recognition for Gujarati Language.

Framework for Deep Speech Recognition

Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.

Seaco Paraformer ⭐ 12

Source code, model links and open test sets for paper SeACo-Paraformer.

End To End Speech Recognition Models ⭐ 11

PyTorch implementation of automatic speech recognition models.

A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).

Pansori Tedxkr Corpus ⭐ 11

Korean ASR Corpus generated from TEDx talks

Uyghur Asr Ctc ⭐ 11

Speech Recognition for Uyghur using deep learning

Twilio Asr Realtime Dashboard ⭐ 11

Twilio ASR and Intent Realtime Dashboard

Arsenalpython ⭐ 11

python 开发的兵器库. 收藏内容包括参考代码，实验，培训资料等

Ispeech Speech Recognition Asr Voice Recognition.js ⭐ 11

iSpeech's open source javascript SDK for speech recognition (ASR) API, enables you to easily create Web applications using iSpeech freeform, command or custom statistical language models. The speech recognition API powering this speech recognition SDK supports nearly 30 languages and accents. The acoustic models are based on huge amounts of low and high quality hand labeled audio data (millions of utterances). iSpeech is a viable alternative to Google ASR (Web Speech API), which only includes V

Pzh Py Speech ⭐ 11

A tiny audio speech (.wav) utility tool (GUI) based on Python2.7+wxPython4.0+PyAudio+Matplotlib+SpeechRec | 痞子衡语音处理助手，一款支持多引擎的wav格式语音处理小工具（音频录播与波形显示，语音识别，文语合成

ruby bindings for the karotz rest api

Lattice_combination ⭐ 11

Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices

Prayer Times ⭐ 11

a PHP package to calculate prayer times

Docker Py Kaldi Asr And Model ⭐ 11

STT Service based on Kaldi ASR

This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".

13.3 Hours Chinese Mandarin Synthesis Corpus Female Emotional ⭐ 11

Chinese Mandarin Synthesis Corpus-Female/Emotional

fastACI toolbox: the MATLAB toolbox for investigating auditory perception using reverse correlation.

Automatic_speech_recognition_with_multi_models ⭐ 10

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

Spoken Language Translation System

Etos Keywordspotting ⭐ 10

PyTorch implementations of neural network models for keyword spotting

L4 R3: ASR Cortex-R5 LTE Cat.1 SoC (ASR1601/ASR1603/ASR3601)

An automatic speech recognition environment for Icelandic based on Kaldi

Sincnet_adapt ⭐ 10

Raw waveform adaptation with SincNet

Albanian Asr ⭐ 10

This project is an AI-based transcription tool for the Albanian language. The tool is designed to automatically transcribe Albanian speech to text using Python.

Torgo_asr ⭐ 10

A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech

Defending Neural Backdoors Via Generative Distribution Modeling ⭐ 10

The code is for our NeurIPS 2019 paper: https://arxiv.org/abs/1910.04749

Lattice Rescore ⭐ 10

Cnn Speech Recognition ⭐ 10

Chinese Asr ⭐ 10

Chinese-ASR built on kaldi

Automatic speech recognition using neural networks

Corr Seq Labeling ⭐ 10

Code for paper "Joint Learning of Correlated Sequence Labelling Tasks Using Bidirectional Recurrent Neural Networks"

Smartspeech ⭐ 10

SmartSpeech — это сервис для синтеза и распознавания речи

Imagingapfs ⭐ 9

How to make a disk image of APFS container, then restore it.

Related Searches

Python Asr (347)

Speech Recognition Asr (250)

301-400 of 542 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.