Awesome Open Source

Programming Languages

Search results for automatic speech recognition

automatic-speech-recognition x

97 search results found

Wenet ⭐ 3,717

Production First and Production Ready End-to-End Speech Recognition Toolkit

Awesome Speech Recognition Speech Synthesis Papers ⭐ 2,869

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Automatic_speech_recognition ⭐ 2,743

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Whisper Asr Webservice ⭐ 1,317

OpenAI Whisper ASR Webservice API

Pororo ⭐ 1,081

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Tensorflowasr ⭐ 890

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Open_stt ⭐ 671

Whispering ⭐ 597

Streaming transcriber with whisper

Cheetah ⭐ 537

On-device streaming speech-to-text engine powered by deep learning

Awesome Kaldi ⭐ 478

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Neural_sp ⭐ 466

End-to-end ASR/LM implementation with PyTorch

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

Tensorflowasr ⭐ 434

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于

Leopard ⭐ 390

On-device speech-to-text engine powered by deep learning

Huggingsound ⭐ 357

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

Whisper Youtube ⭐ 282

🔉 Youtube Videos Transcription with OpenAI's Whisper

Tensorflow_end2end_speech_recognition ⭐ 275

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Speech_dataset ⭐ 229

The dataset of Speech Recognition

Awesome Large Audio Models ⭐ 207

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Hey Jetson ⭐ 189

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Sova Asr ⭐ 149

SOVA ASR (Automatic Speech Recognition)

Deep_avsr ⭐ 138

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

🙊 software for creating speech recognition models.

Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

Fast Rir ⭐ 122

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Automatic Speech Recognition ⭐ 116

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Mongolian Speech Recognition ⭐ 86

Mongolian speech recognition with PyTorch

Zerospeech Tts Without T ⭐ 79

A Pytorch implementation for the ZeroSpeech 2019 challenge.

Wav2letter ⭐ 70

Speech Recognition model based off of FAIR research paper built using Pytorch.

Viet Asr ⭐ 65

VietASR - Vietnamese Automatic Speech Recognition

Thonburian Whisper ⭐ 55

A repository for Thonburian Whisper: An Automatic Speech Recognition model for Thai fine-tuned on Whisper. Try demo on Huggingface space:

تفريغ المواد المرئية أو المسموعة إلى نصوص

A Polymer 3+ webcomponent / button for doing speech recognition

Auto Subtitled Video Generator ⭐ 49

Input a YouTube video link or upload a video file and get a video with subtitles.

Go Subgen ⭐ 49

Automatically generate subtitles for your media using whisper.cpp via webhooks with support for Radarr & Sonarr

Asrecognition ⭐ 47

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.

Automatic Speech Recognition ⭐ 43

Automatic Speech Recognition using Tensorflow

Wav2Vec for speech recognition, classification, and audio classification

Hf Experiments ⭐ 37

Experiments with Hugging Face 🔬 🤗

Cif Pytorch ⭐ 36

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

Pythaiasr ⭐ 35

Python Thai Automatic Speech Recognition

Translating Synthetic RIRs to Real RIRs

Automatic_speech_recognition ⭐ 33

Vietnamese Automatic Speech Recognition

Augmenting Room Impulse Response

Speech Recognition ⭐ 31

End-to-End Speech Recognition using Neural Networks.

Automatic Speech Recognition ⭐ 30

End-to-End Speech Recognition Using Tensorflow

Kenlm Training ⭐ 27

Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2

Ai Engine ⭐ 25

Easy to use Multi-Provider ASR/Speech To Text and NLP engine

Bembaspeech ⭐ 25

This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources. The corpus has 14, 438 utterances culminating into over 24 hours of speech.

Asr Corpus Creator ⭐ 24

This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.

Best Rq Pytorch ⭐ 20

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)

Speech To Windows Input ⭐ 19

Perform speech-to-text (STT/ASR) with Azure speech service and simulate keyboard to input the recognized text; Supports English, Chinese, Japanese, and more

Telegram bot with ASR

Kaldi Long Audio Alignment ⭐ 17

Long audio alignment using Kaldi

Wav2vec4bp ⭐ 16

Wav2vec resources and models for Brazilian Portuguese

Fast Seamlessm4t Onnx ⭐ 16

ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Openai_whisper_asr ⭐ 15

A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models

Wave2vec Recognize Docker ⭐ 15

Wave2vec 2.0 Recognize pipeline

Linto Agent ⭐ 14

LinTO platform services stack deployment tool for Docker Swarm cluster

Ovos Stt Plugin Vosk ⭐ 14

vosk STT plugin for mycroft

Quartznet Asr ⭐ 13

Asr_demo ⭐ 13

语音识别API，分实时语音和长语音离线上传识别，支持中英文等多达100个国家的语言实时转写和同声传译

Openai Whisper Microservice ⭐ 13

This is an OpenAI Whisper automatic speech recognition microservice

Wav2vec2 Live Japanese Translator ⭐ 13

real time japanese speech recognition translator using wav2vec2

Turkish Speech To Text ⭐ 13

Fine-tuning for automatic speech recognition on low-resource languages with character-based CTC model

Learn Tensorflow ⭐ 12

Make Smart Things with TensorFlow

Gan_harmonized_with_hmms ⭐ 12

Code：Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Models

Whisper_android ⭐ 12

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

Arabic Speech Recognition ⭐ 12

This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"

Kaldi_helpers ⭐ 12

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).

Sharif Emotional Speech Dataset ⭐ 11

A large-scale validated database for Persian speech emotion detection.

Uyghur Asr Ctc ⭐ 11

Speech Recognition for Uyghur using deep learning

Vad Sli Asr ⭐ 11

A pipeline to isolate and transcribe one language in mixed-language speech

Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization (INTERSPEECH 2023 Oral Presentation)

Automatic_speech_recognition_with_multi_models ⭐ 10

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

Automatic speech recognition using neural networks

Youtube Asr Crawler ⭐ 10

Audionet ⭐ 10

A deep model for speech recognition via Keras(front_end) and TensorFlow(back_end).

Personalized_asr ⭐ 9

[MTAP] Official implementation: A mechanism for personalized Automatic Speech Recognition for less frequently spoken languages: the Greek case

Bbc Speech Segmenter ⭐ 9

A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

Integer-only Zero-shot Quantization for Efficient Speech Recognition

Transcribe All The Things™ is a CLI for creating and managing speech-to-text transcripts.

Detect Segment Cough ⭐ 8

A python model to detect and segment coughs, forked from coughvid's repo

🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.

Dljeju2018coderepoasr ⭐ 8

Details on my work on using GANs for speech synthesis for improving Speech Recognition accuracy for ASR problem

Kaldi For Dummies ⭐ 8

This is the repository for my version of Kaldi for Dummies example.

Quartznet Pytorch ⭐ 7

Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]

Ntspeechrecognition ⭐ 7

NTSpeechRecognition is a iOS/macOS framework, written in Objective-c, providing speech recognition functionality. For decoding PocketSphinx is used. (Keyword spotting, JSGF Grammar, NGram)

Kaust Whisper Adapter ⭐ 7

INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!

Kaldi Asr Aws ⭐ 7

This code repo is in reference to the Medium Article for setting up Kaldi on AWS

Matlab_feat ⭐ 6

Functions for creating speech features in MATLAB.

Speech Data Augmentation ⭐ 6

Speech dataset processing and augmentation (add background noise and change speech pitch) for speech recognition

End To End Asr Transformer ⭐ 6

An end to end ASR Transformer model training repo

Whisper Youtube ⭐ 5

This repository will guide you to create automatically generate YouTube Transcription using Using OpenAI's Whisper

1-97 of 97 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.