Awesome Open Source

Programming Languages

Search results for asr

542 search results found

Rustfst ⭐ 134

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Rasa Voice Interface ⭐ 131

🎤 A simple web interface for building voice assistants with Rasa

Asr Study ⭐ 131

Implementation of all-neural speech recognition systems using Keras and Tensorflow

Awesome Speech ⭐ 131

this is a treasure-house of speech

Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"

Spokestack Python ⭐ 124

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Keras Kaldi ⭐ 124

Keras Interface for Kaldi ASR

Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

Multimodal Speech Emotion ⭐ 122

TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18

Elevateaijavasdk ⭐ 121

Java SDK for ElevateAI

Cv Dataset ⭐ 120

Metadata and versioning details for the Common Voice dataset

Elevateaidotnetsdk ⭐ 115

.Net core 6 SDK for ElevateAI

Asr_syllable ⭐ 112

基于卷积神经网络的语音识别声学模型的研究

Code for end-to-end ASR with neural networks, build with TensorFlow

Elevateaipythonsdk ⭐ 111

ElevateAI - Speech-to-text API Python SDK

Deepgram Python Sdk ⭐ 110

Official Python SDK for Deepgram's automated speech recognition APIs.

Obsidian Transcription ⭐ 107

Obsidian plugin to create high-quality transcriptions from markdown linked audio files

Whisper Openvino ⭐ 107

openvino version of openai/whisper

Sepia Stt Server ⭐ 105

SEPIA server to support open-source speech recognition via WebSocket connection.

Las_mandarin_pytorch ⭐ 104

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

Chatgpt Web ⭐ 100

ChatGPT web application, use OpenAI official API. ChatGPT 网页应用，支持多对话、海量提示词、PWA、ASR、TTS

Pytorch Asr ⭐ 100

ASR with PyTorch

Rnn Transducer ⭐ 100

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Listen Attend Spell ⭐ 98

A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

Awesome Russian Speech ⭐ 97

Russian speech technology links

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Rokid 语音开放平台，包含技能开发、语音设备接入及智能家居接入的文档、SDK 及示例代码

Whisper Auto Transcribe ⭐ 91

Auto transcribe tool based on whisper

Aind Vui Capstone ⭐ 90

AIND Term 2 -- VUI Capstone Project

Speech Corpus Collection ⭐ 87

A Collection of Speech Corpus for ASR and TTS

Mongolian Speech Recognition ⭐ 86

Mongolian speech recognition with PyTorch

Zasr_tensorflow ⭐ 85

Mandarin ASR system based on tensorflow

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

Asr For Chinese Pipeline ⭐ 85

Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese

Pytorch Edit Distance ⭐ 85

Levenshtein edit-distance on PyTorch and CUDA

Speech_course ⭐ 83

YSDA course in Speech Processing.

Deepgram Js Sdk ⭐ 81

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

A release version for https://github.com/athena-team/athena

Kaldi Serve ⭐ 79

Server framework for Kaldi ASR Toolkit

Zerospeech Tts Without T ⭐ 79

A Pytorch implementation for the ZeroSpeech 2019 challenge.

PyTorch Implementations for End-to-End Automatic Speech Recognition

Spinorama ⭐ 78

A library to display and compare spinorama (speakers measurements) graphs.

Whispertimesync ⭐ 77

Synchronize Whisper's timestamps over an existing accurate transcription

Adhan Dart ⭐ 76

Adhan for Dart / Muslim Prayer Times Library. Now retrieving Prayer time in Dart easier than ever.

Asr Wav2vec Finetune ⭐ 76

⚡ Finetune Wa2vec 2.0 For Speech Recognition

AGI-server voice recognizer for #Asterisk

Tools for ASR Corpus Generation from Online Video

Indian Accent Speech Recognition ⭐ 73

Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech

Ktspeechcrawler ⭐ 73

Automatically constructing corpus for automatic speech recognition from YouTube videos

百度云流式语音识别客户端 SDK

Cgmm Mvdr ⭐ 71

Implementation of the CGMM-MVDR beamforming

Wav2letter ⭐ 70

Speech Recognition model based off of FAIR research paper built using Pytorch.

Threathunt ⭐ 70

ThreatHunt is a PowerShell repository that allows you to train your threat hunting skills.

Time delay neural network (TDNN) implementation in Pytorch using unfold method

Punctuationmodel ⭐ 69

中文标点符号模型，可以给文本添加标点符号。

A simple asr translator powered by avernakis react.

Vakyansh Wav2vec2 Experimentation ⭐ 67

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

Cgmm_mvdr ⭐ 66

Leopard Chat Ui Teneo ⭐ 65

Leopard Chat UI - A Teneo Chat Client based on Vue and Vuetify

Viet Asr ⭐ 65

VietASR - Vietnamese Automatic Speech Recognition

Azure Stack Hub Foundation Core ⭐ 64

The Azure Stack Hub Foundation Core are a set of materials (PowerPoint presentations, workshops, links to videos, and tools) aiming to provide Azure Stack Hub Operators the foundational materials required to ramp-up and understand the basics of operating Azure Stack Hub, as well as accelerate their operational practices.

Cloud Asr ⭐ 63

Cloud-based Automatic Speech Recognition (ASR) platform and a public ASR webservice.

Simple_diarizer ⭐ 63

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Syn Speech ⭐ 62

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Eesen For Thchs30 ⭐ 62

ASR for Chinese Mandarin

Aaltoasr ⭐ 61

Aalto Automatic Speech Recognition tools

Squeezeformer ⭐ 60

PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)

Asr_benchmark ⭐ 60

Program to benchmark various speech recognition APIs

Transfusion Asr ⭐ 59

Transcribing Speech with Multinomial Diffusion, training code and models.

Avsr Tf1 ⭐ 59

Audio-Visual Speech Recognition using Sequence to Sequence Models

Asr_word ⭐ 59

采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。

Pb_chime5 ⭐ 58

Speech enhancement system for the CHiME-5 dinner party scenario

Alimeeting ⭐ 57

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

ADvISER is a flexible framework to encourage task-oriented dialog system research & development

Sepia Html Client App ⭐ 55

Application to communicate with SEPIA via browser, iOS and Android. Works as chat messenger with personal-assistant, ASR and TTS integration.

Kaldi Yesno Tutorial ⭐ 55

Tutorial on Kaldi for Brandeis ASR course

تفريغ المواد المرئية أو المسموعة إلى نصوص

maracas is a library for corrupting audio files with additive and convolutive noise.

Asr Ios Local ⭐ 53

基于kaldi的ios本地语音识别（本地实时流）Kaldi-based ios native speech recognition (local real-time streaming)

Vosk Asterisk ⭐ 52

Speech Recognition in Asterisk with Vosk Server

Yoruba Text ⭐ 51

Yorùbá language training text for NLP, ASR and TTS tasks

Speech Transformer Tf2.0 ⭐ 51

transformer for ASR-systerm (via tensorflow2.0)

The RWTH ASR Toolkit.

Opensnips ⭐ 50

Open source projects related to Snips https://snips.ai/.

Bertpunc ⭐ 49

SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model

Keras Sincnet ⭐ 49

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Docker Whisperx ⭐ 49

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

Alex Asr ⭐ 49

Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.

Go Subgen ⭐ 49

Automatically generate subtitles for your media using whisper.cpp via webhooks with support for Radarr & Sonarr

Spokestack Android ⭐ 49

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.

Azure Pricer ⭐ 47

Asrecognition ⭐ 47

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.

Voice Privacy Challenge 2020 ⭐ 47

Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/

React Native Spokestack ⭐ 46

Spokestack: give your React Native app a voice interface!

Voice Privacy Challenge 2022 ⭐ 45

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Commonvoice Utils ⭐ 45

Linguistic processing for Common Voice

Athena Decoder ⭐ 44

Awesome Ai List Guide ⭐ 44

The guide of awesome list about AI

Related Searches

Python Asr (347)

Speech Recognition Asr (250)

101-200 of 542 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.