Awesome Open Source

Programming Languages

Search results for speech recognition asr

speech-recognition x

209 search results found

Asr Ios Local ⭐ 53

基于kaldi的ios本地语音识别（本地实时流）Kaldi-based ios native speech recognition (local real-time streaming)

Vosk Asterisk ⭐ 52

Speech Recognition in Asterisk with Vosk Server

Speech Transformer Tf2.0 ⭐ 51

transformer for ASR-systerm (via tensorflow2.0)

Opensnips ⭐ 50

Open source projects related to Snips https://snips.ai/.

Bertpunc ⭐ 49

SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model

Docker Whisperx ⭐ 49

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

Keras Sincnet ⭐ 49

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Spokestack Android ⭐ 49

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.

Asrecognition ⭐ 47

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.

React Native Spokestack ⭐ 46

Spokestack: give your React Native app a voice interface!

Voice Privacy Challenge 2022 ⭐ 45

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Whispers2t ⭐ 44

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Enginer

Awesome Asr Contextualization ⭐ 42

A curated list of awesome papers on contextualizing E2E ASR outputs

Py Nltools ⭐ 42

A collection of basic python modules for spoken natural language processing

Iflytek_awaken_asr ⭐ 40

use iflytek's technology to realize awaken and order recognition

Transfer Learning Asr ⭐ 40

Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017

Open_stt_e2e ⭐ 39

PyTorch end-to-end speech recognition

Yandex Speech ⭐ 38

node.js module for Yandex speech systems (ASR & TTS)

Cif Pytorch ⭐ 36

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

Conformer Athena ⭐ 33

Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.

Baiduasrandtts ⭐ 32

Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech; 百度语音识别、语音合成API使用。

Voice Recognition Ua ⭐ 32

Training scripts for Speech-To-Text models for Ukrainian language

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

Greenkey Asrtoolkit ⭐ 31

A collection of useful tools for handling speech recognition data

Voice Assistant Chatgpt ⭐ 31

Voice Assistant based on Whisper ASR and ChatGPT API

Linto Stt ⭐ 30

An automatic speech recognition API

Automatic Speech Recognition ⭐ 30

End-to-End Speech Recognition Using Tensorflow

Multilingual Asr ⭐ 29

Multilingual Speech Recognition for Indonesian Languages

Lightning Asr ⭐ 29

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

A pytorch based end2end speech recognition system.

React Native Vosk ⭐ 29

Speech recognition module for react native using Vosk library

Pytorch_mlp_for_asr ⭐ 27

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

Program for speech recognition using the Google Speech API, voice commands, control your computer.

Spokestack Ios ⭐ 26

Spokestack: give your iOS app a voice interface!

Ai Engine ⭐ 25

Easy to use Multi-Provider ASR/Speech To Text and NLP engine

Asr Corpus Creator ⭐ 24

This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.

Speech Recognition Experiments ⭐ 24

Experiments to test different speech recognition systems for SEPIA Framework

Voice100 ⭐ 22

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.

Meta Transfer Learning ⭐ 22

Implementation of meta-transfer-learning (ACL 2020)

Korean_asr ⭐ 22

Korean Automatic Speech Recognition

Keenasr Android Poc ⭐ 21

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Kaldi Br ⭐ 21

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

Opensource Voice Tools ⭐ 21

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Speech Recognition Evaluation ⭐ 21

Evaluate results from ASR/Speech-to-Text quickly

PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)

Summerasr ⭐ 20

SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be easily built standalone without any depencency.

Transcrater ⭐ 19

An open-source tool for automatic speech recognition ASR quality estimation.

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children ⭐ 18

Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing an Automatic Speech Recognition (ASR) system would not help since they are pre-trained on voices that differ from children's in terms of frequency and amplitude. Because most of these are pre-tr

Keras(Tensorflow) implementations of Automatic Speech Recognition

Listen Attend Spell V2 ⭐ 17

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

Speech Recognition ⭐ 17

SDKs and docs for Skit's speech to text service

Kaldi Long Audio Alignment ⭐ 17

Long audio alignment using Kaldi

Speechloop ⭐ 16

Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?

End To End Mandarin Asr ⭐ 15

End-to-end speech recognition on AISHELL dataset.

An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands

Vosk Model Ru Adaptation ⭐ 15

Ovos Stt Plugin Vosk ⭐ 14

vosk STT plugin for mycroft

A merged version of multiple open-source German speech datasets.

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Korean Speech Recognition Quartznet ⭐ 14

Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식

Unityasr ⭐ 13

Automatic Speech Recognition in Unity.

Openai Whisper Microservice ⭐ 13

This is an OpenAI Whisper automatic speech recognition microservice

Whisper Finetune ⭐ 13

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Asr Nepali Using Cnn Bilstm Resnet ⭐ 13

Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet - IEEE (ICICT - 2022)

Arabic Speech Recognition ⭐ 12

This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"

Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.

Gujarati_speech_recognition ⭐ 12

Offline speech recognition for Gujarati Language.

Framework for Deep Speech Recognition

Whisper_android ⭐ 12

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

End To End Speech Recognition Models ⭐ 11

PyTorch implementation of automatic speech recognition models.

Ispeech Speech Recognition Asr Voice Recognition.js ⭐ 11

iSpeech's open source javascript SDK for speech recognition (ASR) API, enables you to easily create Web applications using iSpeech freeform, command or custom statistical language models. The speech recognition API powering this speech recognition SDK supports nearly 30 languages and accents. The acoustic models are based on huge amounts of low and high quality hand labeled audio data (millions of utterances). iSpeech is a viable alternative to Google ASR (Web Speech API), which only includes V

13.3 Hours Chinese Mandarin Synthesis Corpus Female Emotional ⭐ 11

Chinese Mandarin Synthesis Corpus-Female/Emotional

A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).

This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".

Automatic speech recognition using neural networks

Lattice Rescore ⭐ 10

Automatic_speech_recognition_with_multi_models ⭐ 10

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

Semi Supervised Asr ⭐ 9

Asr_project ⭐ 9

This repository created for the NHN ASR hackathon competition.

Voice Tech Study ⭐ 9

语音识别语音前端处理语音合成语音转换等等语音技术的资料汇总

Deepspeech Pytorch ⭐ 8

Pytorch implementation for DeepSpeech 2.0

Simplespeechloop ⭐ 8

A very basic demonstration connecting speech recognition and text-to-speech

Kaggle Ai ⭐ 8

Categorize AI problems and record through kaggle, Google's data science website

Deepspeech Kabyle ⭐ 8

Automatic Speech Recognition (ASR) - Kabyle

React Native Spokestack Tray ⭐ 8

React Native component for adding Spokestack to a React Native app

Be_nlp_speech_resources ⭐ 8

Links to Belarusian NLP and Speech resources

Kaldi Avsr ⭐ 8

Kaldi-based audio-visual speech recognition

Taiwanese Whisper ⭐ 7

fine-tune Whipser model for Taiwanese speech recognition

Bilatticernn Confidence ⭐ 7

Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/1910.11933 or https://ieeexplore.ieee.org/document/9053264

A (not entirely working) stand-alone speech recognizer written in Common Lisp

Ce Optimizedloss ⭐ 7

Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.

Speech Adapters ⭐ 6

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding

Sentence Generator ⭐ 6

Randomly generate phrases, sentences, and queries. This can be used to generate test set for Automatic Speech Recognition (ASR).

Asr Mp3 Compression Aaes ⭐ 6

MP3 Compression, End-to-End Automatic Speech Recognition (ASR), Adversarial Noise

Speech Datasets For Asr ⭐ 6

Download speech datasets (English and non-English) for Automatic Speech Recognition

Whisper Finetuning Be ⭐ 6

Finetuning Whisper ASR model for Belarusian language

Full Lattice Search ⭐ 6

Full Text Search Over Probabilistic Lattices with Elasticsearch!

Related Searches

Python Speech Recognition (876)

Python Asr (347)

101-200 of 209 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.