Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for asr
asr
x
542 search results found
Rustfst
⭐
134
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Rasa Voice Interface
⭐
131
🎤 A simple web interface for building voice assistants with Rasa
Asr Study
⭐
131
Implementation of all-neural speech recognition systems using Keras and Tensorflow
Awesome Speech
⭐
131
this is a treasure-house of speech
Hero
⭐
125
Research code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
Keras Kaldi
⭐
124
Keras Interface for Kaldi ASR
At16k
⭐
123
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Multimodal Speech Emotion
⭐
122
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
Elevateaijavasdk
⭐
121
Java SDK for ElevateAI
Ner
⭐
121
Cv Dataset
⭐
120
Metadata and versioning details for the Common Voice dataset
Elevateaidotnetsdk
⭐
115
.Net core 6 SDK for ElevateAI
Asr_syllable
⭐
112
基于卷积神经网络的语音识别声学模型的研究
Nabu
⭐
112
Code for end-to-end ASR with neural networks, build with TensorFlow
Elevateaipythonsdk
⭐
111
ElevateAI - Speech-to-text API Python SDK
Deepgram Python Sdk
⭐
110
Official Python SDK for Deepgram's automated speech recognition APIs.
Obsidian Transcription
⭐
107
Obsidian plugin to create high-quality transcriptions from markdown linked audio files
Whisper Openvino
⭐
107
openvino version of openai/whisper
Sepia Stt Server
⭐
105
SEPIA server to support open-source speech recognition via WebSocket connection.
Las_mandarin_pytorch
⭐
104
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
Chatgpt Web
⭐
100
ChatGPT web application, use OpenAI official API. ChatGPT 网页应用,支持多对话、海量提示词、PWA、ASR、TTS
Pytorch Asr
⭐
100
ASR with PyTorch
Rnn Transducer
⭐
100
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Listen Attend Spell
⭐
98
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Awesome Russian Speech
⭐
97
Russian speech technology links
Ctc Asr
⭐
92
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Docs
⭐
91
Rokid 语音开放平台,包含技能开发、语音设备接入及智能家居接入的文档、SDK 及示例代码
Whisper Auto Transcribe
⭐
91
Auto transcribe tool based on whisper
Aind Vui Capstone
⭐
90
AIND Term 2 -- VUI Capstone Project
Speech Corpus Collection
⭐
87
A Collection of Speech Corpus for ASR and TTS
Mongolian Speech Recognition
⭐
86
Mongolian speech recognition with PyTorch
Zasr_tensorflow
⭐
85
Mandarin ASR system based on tensorflow
Sms_wsj
⭐
85
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
Asr For Chinese Pipeline
⭐
85
Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese
Pytorch Edit Distance
⭐
85
Levenshtein edit-distance on PyTorch and CUDA
Speech_course
⭐
83
YSDA course in Speech Processing.
Deepgram Js Sdk
⭐
81
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
Athena
⭐
81
A release version for https://github.com/athena-team/athena
Kaldi Serve
⭐
79
Server framework for Kaldi ASR Toolkit
Zerospeech Tts Without T
⭐
79
A Pytorch implementation for the ZeroSpeech 2019 challenge.
E2e Asr
⭐
79
PyTorch Implementations for End-to-End Automatic Speech Recognition
Spinorama
⭐
78
A library to display and compare spinorama (speakers measurements) graphs.
Whispertimesync
⭐
77
Synchronize Whisper's timestamps over an existing accurate transcription
Adhan Dart
⭐
76
Adhan for Dart / Muslim Prayer Times Library. Now retrieving Prayer time in Dart easier than ever.
Asr Wav2vec Finetune
⭐
76
⚡ Finetune Wa2vec 2.0 For Speech Recognition
Voicer
⭐
75
AGI-server voice recognizer for #Asterisk
Pansori
⭐
74
Tools for ASR Corpus Generation from Online Video
Indian Accent Speech Recognition
⭐
73
Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech
Ktspeechcrawler
⭐
73
Automatically constructing corpus for automatic speech recognition from YouTube videos
Pie
⭐
72
百度云流式语音识别客户端 SDK
Cgmm Mvdr
⭐
71
Implementation of the CGMM-MVDR beamforming
Wav2letter
⭐
70
Speech Recognition model based off of FAIR research paper built using Pytorch.
Threathunt
⭐
70
ThreatHunt is a PowerShell repository that allows you to train your threat hunting skills.
Tdnn
⭐
70
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Punctuationmodel
⭐
69
中文标点符号模型,可以给文本添加标点符号。
Echo
⭐
68
A simple asr translator powered by avernakis react.
Vakyansh Wav2vec2 Experimentation
⭐
67
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
Cgmm_mvdr
⭐
66
Leopard Chat Ui Teneo
⭐
65
Leopard Chat UI - A Teneo Chat Client based on Vue and Vuetify
Viet Asr
⭐
65
VietASR - Vietnamese Automatic Speech Recognition
Azure Stack Hub Foundation Core
⭐
64
The Azure Stack Hub Foundation Core are a set of materials (PowerPoint presentations, workshops, links to videos, and tools) aiming to provide Azure Stack Hub Operators the foundational materials required to ramp-up and understand the basics of operating Azure Stack Hub, as well as accelerate their operational practices.
Cloud Asr
⭐
63
Cloud-based Automatic Speech Recognition (ASR) platform and a public ASR webservice.
Simple_diarizer
⭐
63
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Syn Speech
⭐
62
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Eesen For Thchs30
⭐
62
ASR for Chinese Mandarin
Aaltoasr
⭐
61
Aalto Automatic Speech Recognition tools
Squeezeformer
⭐
60
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
Asr_benchmark
⭐
60
Program to benchmark various speech recognition APIs
Transfusion Asr
⭐
59
Transcribing Speech with Multinomial Diffusion, training code and models.
Avsr Tf1
⭐
59
Audio-Visual Speech Recognition using Sequence to Sequence Models
Asr_word
⭐
59
采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。
Pb_chime5
⭐
58
Speech enhancement system for the CHiME-5 dinner party scenario
Alimeeting
⭐
57
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.
Adviser
⭐
56
ADvISER is a flexible framework to encourage task-oriented dialog system research & development
Sepia Html Client App
⭐
55
Application to communicate with SEPIA via browser, iOS and Android. Works as chat messenger with personal-assistant, ASR and TTS integration.
Kaldi Yesno Tutorial
⭐
55
Tutorial on Kaldi for Brandeis ASR course
Tafrigh
⭐
55
تفريغ المواد المرئية أو المسموعة إلى نصوص
Maracas
⭐
54
maracas is a library for corrupting audio files with additive and convolutive noise.
Asr Ios Local
⭐
53
基于kaldi的ios本地语音识别(本地实时流)Kaldi-based ios native speech recognition (local real-time streaming)
Vosk Asterisk
⭐
52
Speech Recognition in Asterisk with Vosk Server
Yoruba Text
⭐
51
Yorùbá language training text for NLP, ASR and TTS tasks
Speech Transformer Tf2.0
⭐
51
transformer for ASR-systerm (via tensorflow2.0)
Rasr
⭐
51
The RWTH ASR Toolkit.
Opensnips
⭐
50
Open source projects related to Snips https://snips.ai/.
Bertpunc
⭐
49
SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model
Keras Sincnet
⭐
49
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Docker Whisperx
⭐
49
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
Alex Asr
⭐
49
Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.
Go Subgen
⭐
49
Automatically generate subtitles for your media using whisper.cpp via webhooks with support for Radarr & Sonarr
Spokestack Android
⭐
49
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Asr
⭐
48
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
Azure Pricer
⭐
47
Asrecognition
⭐
47
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
Voice Privacy Challenge 2020
⭐
47
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/
React Native Spokestack
⭐
46
Spokestack: give your React Native app a voice interface!
Voice Privacy Challenge 2022
⭐
45
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Commonvoice Utils
⭐
45
Linguistic processing for Common Voice
Athena Decoder
⭐
44
Awesome Ai List Guide
⭐
44
The guide of awesome list about AI
Related Searches
Python Asr (347)
Speech Recognition Asr (250)
101-200 of 542 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.