Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for kaldi
kaldi
x
306 search results found
Awesome Pytorch List
⭐
14,715
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Kaldi
⭐
13,453
kaldi-asr/kaldi is the official location of the Kaldi project.
Espnet
⭐
7,563
End-to-End Speech Processing Toolkit
Vosk Api
⭐
6,633
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Pytorch Kaldi
⭐
2,138
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Rhasspy
⭐
2,036
Offline private voice assistant for many human languages
Dragonfire
⭐
1,294
the open-source virtual assistant for Ubuntu based Linux distributions
Gentle
⭐
1,225
gentle forced aligner
Montreal Forced Aligner
⭐
1,124
Command line utility for forced alignment using Kaldi
Pykaldi
⭐
954
A Python wrapper for Kaldi
Botium Speech Processing
⭐
938
Botium Speech Processing
Espresso
⭐
930
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Kaldi Gstreamer Server
⭐
865
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Alibaba Mit Speech
⭐
860
Alibaba speech technology
Vosk Server
⭐
802
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Lhotse
⭐
794
Tools for handling speech data in machine learning projects.
Speech Transformer
⭐
714
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Eesen
⭐
673
The official repository of the Eesen project
Asv Subtools
⭐
561
An Open Source Tools for Speaker Recognition
Awesome Shell
⭐
534
A curated list of awesome Shell frameworks, libraries and software.
React Transcript Editor
⭐
506
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Awesome Kaldi
⭐
478
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Forced Alignment Tools
⭐
417
A collection of links and notes on forced alignment tools
Zamia Speech
⭐
413
Open tools and data for cloudless automatic speech recognition
Kaldi Io For Python
⭐
371
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
Opentransformer
⭐
310
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
Setk
⭐
306
Tools for Speech Enhancement integrated with Kaldi
Kaldi Active Grammar
⭐
305
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Cat
⭐
288
A CRF-based ASR Toolkit
Attention Lvcsr
⭐
259
End-to-End Attention-Based Large Vocabulary Speech Recognition
Docker Kaldi Gstreamer Server
⭐
246
Dockerfile for kaldi-gstreamer-server.
Vosk Browser
⭐
238
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
Kaldiio
⭐
232
A pure python module for reading and writing kaldi ark files
Asr_theory
⭐
221
语音识别理论,论文和PPT
Kaldi Offline Transcriber
⭐
211
Offline transcription system for Estonian using Kaldi
Zeroth
⭐
211
Kaldi-based Korean ASR (한국어 음성인식) open-source project
Kaldi Lstm
⭐
198
C++ implementation of LSTM (Long Short Term Memory), in Kaldi's nnet1 framework. Used for automatic speech recognition, possibly language modeling etc, the training can be switched between CPU and GPU(CUDA). This repo is now merged into official Kaldi codebase(Karel's setup), so this repo is no longer maintained, please check out the Kaldi project instead.
Speech Aligner
⭐
194
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。 is a tool that generate phoneme-level alignment between human speech and its transcription
Eend
⭐
192
End-to-End Neural Diarization
Dictate.js
⭐
179
A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Tfkaldi
⭐
171
Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi
Gst Kaldi Nnet2 Online
⭐
165
GStreamer plugin around Kaldi's online neural network decoder
Kaldi Tuda De
⭐
165
Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.
Kaldifeat
⭐
161
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Py Kaldi Asr
⭐
154
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Pykaldi2
⭐
147
Yet another speech toolkit based on Kaldi and PyTorch
Speech To Text Russian
⭐
138
Проект для распознавания речи на русском языке на основе pykaldi.
Elpis
⭐
137
🙊 software for creating speech recognition models.
Rustfst
⭐
134
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Kaldi Dnn Ali Gop
⭐
133
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
Awesome Speech
⭐
131
this is a treasure-house of speech
Kaldi Onnx
⭐
127
Kaldi model converter to ONNX
Keras Kaldi
⭐
124
Keras Interface for Kaldi ASR
Aps
⭐
122
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
Ctc_pytorch
⭐
121
CTC end -to-end ASR for timit and 863 corpus.
Pytorch_xvectors
⭐
120
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Factorizedhierarchicalvae
⭐
115
This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"
Nabu
⭐
112
Code for end-to-end ASR with neural networks, build with TensorFlow
Sepia Stt Server
⭐
105
SEPIA server to support open-source speech recognition via WebSocket connection.
Kaldi Gop
⭐
103
Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Kaldi
⭐
101
Pytorch Asr
⭐
100
ASR with PyTorch
Rnn Transducer
⭐
100
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Speech Representations
⭐
97
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
Sms_wsj
⭐
85
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
Asr For Chinese Pipeline
⭐
85
Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese
Machine Learning Tutorial Chinese
⭐
82
专门面向中文用户的机器学习相关的学习资料大集合
Silvius
⭐
82
Kaldi-based speech recognition system + grammar
Fakebob
⭐
81
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
Vbdiarization
⭐
81
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
Kaldi Serve
⭐
79
Server framework for Kaldi ASR Toolkit
E2e Asr
⭐
79
PyTorch Implementations for End-to-End Automatic Speech Recognition
Pykaldi
⭐
76
Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)
Tdnn
⭐
70
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Pkwrap
⭐
68
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
Kaldi Ivector
⭐
68
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure
Plda
⭐
67
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
Kaldi Decoders
⭐
65
Custom decoders for Kaldi
Opendcd
⭐
64
Open Source WFST-based Decoder Toolkit
Audiocaption
⭐
64
Dataset and baseline for the first Audiocaption task
Kaldiwebrtcserver
⭐
62
Python server for communicating with Kaldi from the browser using WebRTC
Kaldi_nl
⭐
61
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
Pb_chime5
⭐
58
Speech enhancement system for the CHiME-5 dinner party scenario
Kaldi.js
⭐
57
Auto Tuning Spectral Clustering
⭐
56
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
Kaldi Python
⭐
56
Python wrappers for Kaldi data
Kaldi Yesno Tutorial
⭐
55
Tutorial on Kaldi for Brandeis ASR course
Goodness Of Pronunciation
⭐
53
Asr Ios Local
⭐
53
基于kaldi的ios本地语音识别(本地实时流)Kaldi-based ios native speech recognition (local real-time streaming)
Speech To Text
⭐
51
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Opensnips
⭐
50
Open source projects related to Snips https://snips.ai/.
Alex Asr
⭐
49
Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.
Ivector Xvector
⭐
48
Extract xvector and ivector under kaldi
Rsrgan
⭐
48
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
Voice Privacy Challenge 2020
⭐
47
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/
Docker Kaldi Android
⭐
47
Dockerfile for compiling Kaldi for Android.
Kaldialign
⭐
46
Python wrappers for Kaldi Levenshtein's distance and alignment code.
Voice Privacy Challenge 2022
⭐
45
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Speechvae
⭐
44
This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and Transformation".
Lm_build
⭐
44
Adapting your own Language Model for Kaldi
1-100 of 306 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.