Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for pytorch speech recognition
pytorch
x
speech-recognition
x
97 search results found
Transformers
⭐
124,049
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Deeplearningexamples
⭐
12,073
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Espnet
⭐
7,563
End-to-End Speech Processing Toolkit
Speechbrain
⭐
7,166
A PyTorch-based Speech Toolkit
Silero Models
⭐
4,088
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Wenet
⭐
3,698
Production First and Production Ready End-to-End Speech Recognition Toolkit
Ml Road
⭐
2,742
Machine Learning Resources, Practice and Research
Funasr
⭐
2,315
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
Pytorch Kaldi
⭐
2,138
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Whisper Timestamped
⭐
1,217
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Espresso
⭐
930
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Conformer
⭐
809
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Lhotse
⭐
794
Tools for handling speech data in machine learning projects.
Sincnet
⭐
764
SincNet is a neural architecture for efficiently processing raw audio samples.
Speech
⭐
673
A PyTorch Implementation of End-to-End Models for Speech-to-Text
Libreasr
⭐
647
💬 An On-Premises, Streaming Speech Recognition System
Kospeech
⭐
572
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Treasure Of Transformers
⭐
541
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
Whisper Finetune
⭐
502
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
Neural_sp
⭐
466
End-to-end ASR/LM implementation with PyTorch
Masr
⭐
462
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conforme
Specaugment
⭐
411
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Allosaurus
⭐
411
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Nmtpytorch
⭐
395
Sequence-to-Sequence Framework in PyTorch
Spec_augment
⭐
374
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Sherpa
⭐
374
Speech-to-text server framework with next-gen Kaldi
Unispeech
⭐
328
UniSpeech - Large Scale Self-Supervised Learning for Speech
Jarvis Chatgpt
⭐
242
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
End2end Asr Pytorch
⭐
239
End-to-End Automatic Speech Recognition on PyTorch
End To End Lipreading
⭐
147
Pytorch code for End-to-End Audiovisual Speech Recognition
Mlm Scoring
⭐
135
Python library & examples for Masked Language Model Scoring (ACL 2020)
Mongolian Nlp
⭐
126
Useful resources for Mongolian NLP
Masr
⭐
113
中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
Zeta
⭐
106
Build high-performance AI models with modular building blocks
Nlp_toolkit
⭐
101
Library of state-of-the-art models (PyTorch) for NLP tasks
Mltu
⭐
100
Machine Learning Training Utilities (for TensorFlow and PyTorch)
Pytorch Asr
⭐
100
ASR with PyTorch
Pytorch Speech Commands
⭐
98
Speech commands recognition with PyTorch
Whisper Auto Transcribe
⭐
91
Auto transcribe tool based on whisper
Mongolian Speech Recognition
⭐
86
Mongolian speech recognition with PyTorch
Rnn Transducer
⭐
79
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
E2e Asr
⭐
79
PyTorch Implementations for End-to-End Automatic Speech Recognition
Asr Wav2vec Finetune
⭐
76
⚡ Finetune Wa2vec 2.0 For Speech Recognition
Wav2letter
⭐
70
Speech Recognition model based off of FAIR research paper built using Pytorch.
Tdnn
⭐
70
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Vakyansh Wav2vec2 Experimentation
⭐
67
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
Wav2letter.pytorch
⭐
67
A fully convolution-network for speech-to-text, built on pytorch.
Patter
⭐
61
speech-to-text in pytorch
Squeezeformer
⭐
60
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
Transfusion Asr
⭐
59
Transcribing Speech with Multinomial Diffusion, training code and models.
Speech Transformer
⭐
55
PyTorch re-implementation of Speech-Transformer
Speech Recognition Via Cnn
⭐
54
孤立词语音识别,复旦大学计算机科学技术学院数字信号处理期末项目
Ctc Optimizedloss
⭐
50
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
Triplet_loss_kws
⭐
47
Learning Efficient Representations for Keyword Spotting with Triplet Loss
Biglittlenet
⭐
46
Official repository for Big-Little Net
Noisy Student Training Asr
⭐
44
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
Pywhisper
⭐
42
openai/whisper + extra features
Deepspeech
⭐
40
A PyTorch implementation of DeepSpeech and DeepSpeech2.
Open_stt_e2e
⭐
39
PyTorch end-to-end speech recognition
Jetson Voice
⭐
39
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
Banglaspeech2text
⭐
38
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.
Factorized Tdnn
⭐
38
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Wavencoder
⭐
36
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Cif Pytorch
⭐
36
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
Miniasr
⭐
31
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
Openasr
⭐
29
A pytorch based end2end speech recognition system.
Pytorch_mlp_for_asr
⭐
27
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
Speech Emotion Recognition
⭐
23
Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.
Meta Transfer Learning
⭐
22
Implementation of meta-transfer-learning (ACL 2020)
Voice100
⭐
22
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
Jasper
⭐
20
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
Thunder Speech
⭐
18
A Hackable speech recognition library.
Kosr
⭐
18
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Listen Attend Spell V2
⭐
17
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Fast Seamlessm4t Onnx
⭐
16
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
Ete Speech Recognition
⭐
15
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
End To End Mandarin Asr
⭐
15
End-to-end speech recognition on AISHELL dataset.
Audio Lottery
⭐
15
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Zhangyang Wang
Sparse_image_warp_pytorch
⭐
14
Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779
Speech Command Recognition
⭐
13
Classify input audio segment into categories for keyword spotting with MatchboxNet with training, exporting onnx model, accelerating inference via TensorRT
Whisper Finetune
⭐
13
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
Pytorchsr
⭐
12
Pytorch based phoneme recognition (TIMIT phoneme classification)
End To End Speech Recognition Models
⭐
11
PyTorch implementation of automatic speech recognition models.
Audio Pretrained Model
⭐
11
A collection of Audio and Speech pre-trained models.
Kaggle Speech Recognition
⭐
11
Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%)
Wavenet Speech To Text
⭐
10
A PyTorch implementation of speech recognition based on DeepMind's WaveNet
Deepspeech Pytorch
⭐
8
Pytorch implementation for DeepSpeech 2.0
Kaggle Ai
⭐
8
Categorize AI problems and record through kaggle, Google's data science website
End2endautomaticspeechrecognition
⭐
7
In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.
Ce Optimizedloss
⭐
7
Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.
Inferspeech
⭐
7
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
Bilatticernn Confidence
⭐
7
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/1910.11933 or https://ieeexplore.ieee.org/document/9053264
Pytorch Commands
⭐
7
Some PyTorch code for the Kaggle Speech Recognition Challenge
Chapter9
⭐
6
Chapter 9: Attention and Memory Augmented Networks
Wav2vec2_stt_python
⭐
6
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
Bertphone
⭐
5
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
Neural Network Zoo
⭐
5
🧠🕸️ Neural network architectures implemented in PyTorch for educational purposes.
Indicasr
⭐
5
Speeech Recognition for Indic languages.
Related Searches
Python Pytorch (15,131)
Deep Learning Pytorch (7,533)
Jupyter Notebook Pytorch (4,892)
Machine Learning Pytorch (2,934)
Dataset Pytorch (1,848)
Pytorch Neural Network (1,391)
Pytorch Computer Vision (1,388)
Tensorflow Pytorch (1,333)
Pytorch Neural (1,217)
Pytorch Generative Adversarial Network (1,199)
1-97 of 97 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.