Awesome Open Source
Awesome Open Source
Combined Topics
asr
x
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210
The Top 57 Asr Open Source Projects
Categories
>
Machine Learning
>
Asr
Wukong Robot
⭐
2,768
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
Lingvo
⭐
2,218
Lingvo
Pytorch Kaldi
⭐
1,991
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Vosk Api
⭐
1,470
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Deepspeechrecognition
⭐
1,455
A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
Delta
⭐
1,404
DELTA is a deep learning based natural language and speech processing platform.
Espresso
⭐
817
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Silero Models
⭐
790
Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple
Sincnet
⭐
784
SincNet is a neural architecture for efficiently processing raw audio samples.
Pykaldi
⭐
770
A Python wrapper for Kaldi
Eesen
⭐
745
The official repository of the Eesen project
Wenet
⭐
691
Production First and Production Ready End-to-End Speech Recognition Toolkit
Libreasr
⭐
639
💬 An On-Premises, Streaming Speech Recognition System
Open_stt
⭐
592
Open STT
Speech Transformer
⭐
574
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Athena
⭐
557
an open-source implementation of sequence-to-sequence based speech processing engine
Neural_sp
⭐
419
End-to-end ASR/LM implementation with PyTorch
Nmtpytorch
⭐
392
Sequence-to-Sequence Framework in PyTorch
Cheetah
⭐
390
On-device streaming speech-to-text engine powered by deep learning
Zamia Speech
⭐
389
Open tools and data for cloudless automatic speech recognition
Asr_theory
⭐
357
语音识别理论,论文和PPT
Tensorflow_end2end_speech_recognition
⭐
307
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Vosk Server
⭐
297
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Vosk Android Demo
⭐
280
Offline speech recognition for Android with Vosk library.
Docker Kaldi Gstreamer Server
⭐
269
Dockerfile for kaldi-gstreamer-server.
Cn2an
⭐
261
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Zeroth
⭐
255
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Chinese_text_normalization
⭐
252
Chinese text normalization for speech processing
Kerasdeepspeech
⭐
244
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Kospeech
⭐
209
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Edgedict
⭐
207
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Hms Ml Demo
⭐
195
HMS ML Demo provides an example of integrating Huawei ML Kit service into applications. This example demonstrates how to integrate services provided by ML Kit, such as face detection, text recognition, image segmentation, asr, and tts.
Asr Evaluation
⭐
191
Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
End2end Asr Pytorch
⭐
182
End-to-End Automatic Speech Recognition on PyTorch
Mrcp Plugin With Freeswitch
⭐
171
使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
Py Kaldi Asr
⭐
157
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Speech To Text Russian
⭐
155
Проект для распознавания речи на русском языке на основе pykaldi.
Speecht
⭐
152
An opensource speech-to-text software written in tensorflow
Listen Attend Spell
⭐
149
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Asr_audio_data_links
⭐
133
A list of publically available audio data that anyone can download for ASR or other speech activities
Asr_syllable
⭐
129
基于卷积神经网络的语音识别声学模型的研究
Pytorch Asr
⭐
125
ASR with PyTorch
Rnn Transducer
⭐
117
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Tdnn
⭐
115
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Bigcidian
⭐
114
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
E2e Asr
⭐
107
PyTorch Implementations for End-to-End Automatic Speech Recognition
Mongolian Speech Recognition
⭐
100
Mongolian speech recognition with PyTorch
Zerospeech Tts Without T
⭐
100
A Pytorch implementation for the ZeroSpeech 2019 challenge.
Ktspeechcrawler
⭐
97
Automatically constructing corpus for automatic speech recognition from YouTube videos
Wav2letter
⭐
79
Speech Recognition model based off of FAIR research paper built using Pytorch.
Voicer
⭐
74
AGI-server voice recognizer for #Asterisk
Asr_benchmark
⭐
72
Program to benchmark various speech recognition APIs
Openasr
⭐
72
A pytorch based end2end speech recognition system.
Syn Speech
⭐
57
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Asr
⭐
54
Keras Sincnet
⭐
47
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Asrgen
⭐
32
Attacking Speaker Recognition with Deep Generative Models
1-57 of 57 projects
Advertising
📦 10
All Projects
Application Programming Interfaces
📦 124
Applications
📦 192
Artificial Intelligence
📦 78
Blockchain
📦 73
Build Tools
📦 113
Cloud Computing
📦 80
Code Quality
📦 28
Collaboration
📦 32
Command Line Interface
📦 49
Community
📦 83
Companies
📦 60
Compilers
📦 63
Computer Science
📦 80
Configuration Management
📦 42
Content Management
📦 175
Control Flow
📦 213
Data Formats
📦 78
Data Processing
📦 276
Data Storage
📦 135
Economics
📦 64
Frameworks
📦 215
Games
📦 129
Graphics
📦 110
Hardware
📦 152
Integrated Development Environments
📦 49
Learning Resources
📦 166
Legal
📦 29
Libraries
📦 129
Lists Of Projects
📦 22
Machine Learning
📦 347
Mapping
📦 64
Marketing
📦 15
Mathematics
📦 55
Media
📦 239
Messaging
📦 98
Networking
📦 315
Operating Systems
📦 89
Operations
📦 121
Package Managers
📦 55
Programming Languages
📦 245
Runtime Environments
📦 100
Science
📦 42
Security
📦 396
Social Media
📦 27
Software Architecture
📦 72
Software Development
📦 72
Software Performance
📦 58
Software Quality
📦 133
Text Editors
📦 49
Text Processing
📦 136
User Interface
📦 330
User Interface Components
📦 514
Version Control
📦 30
Virtualization
📦 71
Web Browsers
📦 42
Web Servers
📦 26
Web User Interface
📦 210