Awesome Open Source

Programming Languages

Search results for pytorch speech recognition

speech-recognition x

97 search results found

Transformers ⭐ 124,049

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Deeplearningexamples ⭐ 12,073

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Espnet ⭐ 7,563

End-to-End Speech Processing Toolkit

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

Silero Models ⭐ 4,088

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Wenet ⭐ 3,698

Production First and Production Ready End-to-End Speech Recognition Toolkit

Ml Road ⭐ 2,742

Machine Learning Resources, Practice and Research

Funasr ⭐ 2,315

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

Pytorch Kaldi ⭐ 2,138

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Whisper Timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Espresso ⭐ 930

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Conformer ⭐ 809

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Tools for handling speech data in machine learning projects.

Sincnet ⭐ 764

SincNet is a neural architecture for efficiently processing raw audio samples.

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Libreasr ⭐ 647

💬 An On-Premises, Streaming Speech Recognition System

Kospeech ⭐ 572

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Treasure Of Transformers ⭐ 541

💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️

Whisper Finetune ⭐ 502

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

Neural_sp ⭐ 466

End-to-end ASR/LM implementation with PyTorch

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conforme

Specaugment ⭐ 411

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Allosaurus ⭐ 411

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Nmtpytorch ⭐ 395

Sequence-to-Sequence Framework in PyTorch

Spec_augment ⭐ 374

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Speech-to-text server framework with next-gen Kaldi

Unispeech ⭐ 328

UniSpeech - Large Scale Self-Supervised Learning for Speech

Jarvis Chatgpt ⭐ 242

A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.

End2end Asr Pytorch ⭐ 239

End-to-End Automatic Speech Recognition on PyTorch

End To End Lipreading ⭐ 147

Pytorch code for End-to-End Audiovisual Speech Recognition

Mlm Scoring ⭐ 135

Python library & examples for Masked Language Model Scoring (ACL 2020)

Mongolian Nlp ⭐ 126

Useful resources for Mongolian NLP

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

Build high-performance AI models with modular building blocks

Nlp_toolkit ⭐ 101

Library of state-of-the-art models (PyTorch) for NLP tasks

Machine Learning Training Utilities (for TensorFlow and PyTorch)

Pytorch Asr ⭐ 100

ASR with PyTorch

Pytorch Speech Commands ⭐ 98

Speech commands recognition with PyTorch

Whisper Auto Transcribe ⭐ 91

Auto transcribe tool based on whisper

Mongolian Speech Recognition ⭐ 86

Mongolian speech recognition with PyTorch

Rnn Transducer ⭐ 79

A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition

PyTorch Implementations for End-to-End Automatic Speech Recognition

Asr Wav2vec Finetune ⭐ 76

⚡ Finetune Wa2vec 2.0 For Speech Recognition

Wav2letter ⭐ 70

Speech Recognition model based off of FAIR research paper built using Pytorch.

Time delay neural network (TDNN) implementation in Pytorch using unfold method

Vakyansh Wav2vec2 Experimentation ⭐ 67

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

Wav2letter.pytorch ⭐ 67

A fully convolution-network for speech-to-text, built on pytorch.

speech-to-text in pytorch

Squeezeformer ⭐ 60

PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)

Transfusion Asr ⭐ 59

Transcribing Speech with Multinomial Diffusion, training code and models.

Speech Transformer ⭐ 55

PyTorch re-implementation of Speech-Transformer

Speech Recognition Via Cnn ⭐ 54

孤立词语音识别，复旦大学计算机科学技术学院数字信号处理期末项目

Ctc Optimizedloss ⭐ 50

Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.

Triplet_loss_kws ⭐ 47

Learning Efficient Representations for Keyword Spotting with Triplet Loss

Biglittlenet ⭐ 46

Official repository for Big-Little Net

Noisy Student Training Asr ⭐ 44

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

Pywhisper ⭐ 42

openai/whisper + extra features

Deepspeech ⭐ 40

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Open_stt_e2e ⭐ 39

PyTorch end-to-end speech recognition

Jetson Voice ⭐ 39

ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT

Banglaspeech2text ⭐ 38

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

Factorized Tdnn ⭐ 38

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Wavencoder ⭐ 36

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

Cif Pytorch ⭐ 36

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

A pytorch based end2end speech recognition system.

Pytorch_mlp_for_asr ⭐ 27

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

Speech Emotion Recognition ⭐ 23

Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.

Meta Transfer Learning ⭐ 22

Implementation of meta-transfer-learning (ACL 2020)

Voice100 ⭐ 22

Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.

PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)

Thunder Speech ⭐ 18

A Hackable speech recognition library.

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Listen Attend Spell V2 ⭐ 17

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

Fast Seamlessm4t Onnx ⭐ 16

ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

Ete Speech Recognition ⭐ 15

Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch

End To End Mandarin Asr ⭐ 15

End-to-end speech recognition on AISHELL dataset.

Audio Lottery ⭐ 15

[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Zhangyang Wang

Sparse_image_warp_pytorch ⭐ 14

Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779

Speech Command Recognition ⭐ 13

Classify input audio segment into categories for keyword spotting with MatchboxNet with training, exporting onnx model, accelerating inference via TensorRT

Whisper Finetune ⭐ 13

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Pytorchsr ⭐ 12

Pytorch based phoneme recognition (TIMIT phoneme classification)

End To End Speech Recognition Models ⭐ 11

PyTorch implementation of automatic speech recognition models.

Audio Pretrained Model ⭐ 11

A collection of Audio and Speech pre-trained models.

Kaggle Speech Recognition ⭐ 11

Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%)

Wavenet Speech To Text ⭐ 10

A PyTorch implementation of speech recognition based on DeepMind's WaveNet

Deepspeech Pytorch ⭐ 8

Pytorch implementation for DeepSpeech 2.0

Kaggle Ai ⭐ 8

Categorize AI problems and record through kaggle, Google's data science website

End2endautomaticspeechrecognition ⭐ 7

In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.

Ce Optimizedloss ⭐ 7

Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.

Inferspeech ⭐ 7

PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant

Bilatticernn Confidence ⭐ 7

Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/1910.11933 or https://ieeexplore.ieee.org/document/9053264

Pytorch Commands ⭐ 7

Some PyTorch code for the Kaggle Speech Recognition Challenge

Chapter 9: Attention and Memory Augmented Networks

Wav2vec2_stt_python ⭐ 6

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Bertphone ⭐ 5

Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"

Neural Network Zoo ⭐ 5

🧠🕸️ Neural network architectures implemented in PyTorch for educational purposes.

Speeech Recognition for Indic languages.

Related Searches

Python Pytorch (15,131)

Deep Learning Pytorch (7,533)

Jupyter Notebook Pytorch (4,892)

Machine Learning Pytorch (2,934)

Dataset Pytorch (1,848)

Pytorch Neural Network (1,391)

Pytorch Computer Vision (1,388)

Tensorflow Pytorch (1,333)

Pytorch Neural (1,217)

Pytorch Generative Adversarial Network (1,199)

1-97 of 97 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.