Awesome Open Source

Programming Languages

Search results for kaldi

306 search results found

Awesome Pytorch List ⭐ 14,715

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

Kaldi ⭐ 13,453

kaldi-asr/kaldi is the official location of the Kaldi project.

Espnet ⭐ 7,563

End-to-End Speech Processing Toolkit

Vosk Api ⭐ 6,633

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Pytorch Kaldi ⭐ 2,138

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Rhasspy ⭐ 2,036

Offline private voice assistant for many human languages

Dragonfire ⭐ 1,294

the open-source virtual assistant for Ubuntu based Linux distributions

Gentle ⭐ 1,225

gentle forced aligner

Montreal Forced Aligner ⭐ 1,124

Command line utility for forced alignment using Kaldi

Pykaldi ⭐ 954

A Python wrapper for Kaldi

Botium Speech Processing ⭐ 938

Botium Speech Processing

Espresso ⭐ 930

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Kaldi Gstreamer Server ⭐ 865

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Alibaba Mit Speech ⭐ 860

Alibaba speech technology

Vosk Server ⭐ 802

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Tools for handling speech data in machine learning projects.

Speech Transformer ⭐ 714

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

The official repository of the Eesen project

Asv Subtools ⭐ 561

An Open Source Tools for Speaker Recognition

Awesome Shell ⭐ 534

A curated list of awesome Shell frameworks, libraries and software.

React Transcript Editor ⭐ 506

A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

Awesome Kaldi ⭐ 478

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Forced Alignment Tools ⭐ 417

A collection of links and notes on forced alignment tools

Zamia Speech ⭐ 413

Open tools and data for cloudless automatic speech recognition

Kaldi Io For Python ⭐ 371

Python functions for reading kaldi data formats. Useful for rapid prototyping with python.

Opentransformer ⭐ 310

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

Tools for Speech Enhancement integrated with Kaldi

Kaldi Active Grammar ⭐ 305

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

A CRF-based ASR Toolkit

Attention Lvcsr ⭐ 259

End-to-End Attention-Based Large Vocabulary Speech Recognition

Docker Kaldi Gstreamer Server ⭐ 246

Dockerfile for kaldi-gstreamer-server.

Vosk Browser ⭐ 238

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

Kaldiio ⭐ 232

A pure python module for reading and writing kaldi ark files

Asr_theory ⭐ 221

语音识别理论，论文和PPT

Kaldi Offline Transcriber ⭐ 211

Offline transcription system for Estonian using Kaldi

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Kaldi Lstm ⭐ 198

C++ implementation of LSTM (Long Short Term Memory), in Kaldi's nnet1 framework. Used for automatic speech recognition, possibly language modeling etc, the training can be switched between CPU and GPU(CUDA). This repo is now merged into official Kaldi codebase(Karel's setup), so this repo is no longer maintained, please check out the Kaldi project instead.

Speech Aligner ⭐ 194

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。 is a tool that generate phoneme-level alignment between human speech and its transcription

End-to-End Neural Diarization

Dictate.js ⭐ 179

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Tfkaldi ⭐ 171

Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi

Gst Kaldi Nnet2 Online ⭐ 165

GStreamer plugin around Kaldi's online neural network decoder

Kaldi Tuda De ⭐ 165

Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.

Kaldifeat ⭐ 161

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

Py Kaldi Asr ⭐ 154

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Pykaldi2 ⭐ 147

Yet another speech toolkit based on Kaldi and PyTorch

Speech To Text Russian ⭐ 138

Проект для распознавания речи на русском языке на основе pykaldi.

🙊 software for creating speech recognition models.

Rustfst ⭐ 134

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Kaldi Dnn Ali Gop ⭐ 133

Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.

Awesome Speech ⭐ 131

this is a treasure-house of speech

Kaldi Onnx ⭐ 127

Kaldi model converter to ONNX

Keras Kaldi ⭐ 124

Keras Interface for Kaldi ASR

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

Ctc_pytorch ⭐ 121

CTC end -to-end ASR for timit and 863 corpus.

Pytorch_xvectors ⭐ 120

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Factorizedhierarchicalvae ⭐ 115

This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data"

Code for end-to-end ASR with neural networks, build with TensorFlow

Sepia Stt Server ⭐ 105

SEPIA server to support open-source speech recognition via WebSocket connection.

Kaldi Gop ⭐ 103

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Pytorch Asr ⭐ 100

ASR with PyTorch

Rnn Transducer ⭐ 100

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Speech Representations ⭐ 97

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

Asr For Chinese Pipeline ⭐ 85

Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese

Machine Learning Tutorial Chinese ⭐ 82

专门面向中文用户的机器学习相关的学习资料大集合

Kaldi-based speech recognition system + grammar

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

Vbdiarization ⭐ 81

Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data

Kaldi Serve ⭐ 79

Server framework for Kaldi ASR Toolkit

PyTorch Implementations for End-to-End Automatic Speech Recognition

Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)

Time delay neural network (TDNN) implementation in Pytorch using unfold method

A pytorch wrapper for LF-MMI training and parallel training in Kaldi

Kaldi Ivector ⭐ 68

Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

Kaldi Decoders ⭐ 65

Custom decoders for Kaldi

Open Source WFST-based Decoder Toolkit

Audiocaption ⭐ 64

Dataset and baseline for the first Audiocaption task

Kaldiwebrtcserver ⭐ 62

Python server for communicating with Kaldi from the browser using WebRTC

Kaldi_nl ⭐ 61

Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit

Pb_chime5 ⭐ 58

Speech enhancement system for the CHiME-5 dinner party scenario

Kaldi.js ⭐ 57

Auto Tuning Spectral Clustering ⭐ 56

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

Kaldi Python ⭐ 56

Python wrappers for Kaldi data

Kaldi Yesno Tutorial ⭐ 55

Tutorial on Kaldi for Brandeis ASR course

Goodness Of Pronunciation ⭐ 53

Asr Ios Local ⭐ 53

基于kaldi的ios本地语音识别（本地实时流）Kaldi-based ios native speech recognition (local real-time streaming)

Speech To Text ⭐ 51

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Opensnips ⭐ 50

Open source projects related to Snips https://snips.ai/.

Alex Asr ⭐ 49

Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.

Ivector Xvector ⭐ 48

Extract xvector and ivector under kaldi

Robust Speech Recognition Using Generative Adversarial Networks (GAN)

Voice Privacy Challenge 2020 ⭐ 47

Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/

Docker Kaldi Android ⭐ 47

Dockerfile for compiling Kaldi for Android.

Kaldialign ⭐ 46

Python wrappers for Kaldi Levenshtein's distance and alignment code.

Voice Privacy Challenge 2022 ⭐ 45

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Speechvae ⭐ 44

This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and Transformation".

Lm_build ⭐ 44

Adapting your own Language Model for Kaldi

1-100 of 306 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.