Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for pytorch speech processing
pytorch
x
speech-processing
x
28 search results found
Speechbrain
⭐
7,166
A PyTorch-based Speech Toolkit
Pyannote Audio
⭐
4,460
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Deepvoice3_pytorch
⭐
1,906
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Wavenet_vocoder
⭐
1,617
WaveNet vocoder
Whisper Timestamped
⭐
1,217
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Sincnet
⭐
764
SincNet is a neural architecture for efficiently processing raw audio samples.
Fullsubnet
⭐
443
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Ims Toucan
⭐
426
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Nnmnkwii
⭐
375
Library to build speech synthesis systems designed for easy and fast prototyping.
Unispeech
⭐
328
UniSpeech - Large Scale Self-Supervised Learning for Speech
Pase
⭐
265
Problem Agnostic Speech Encoder
Wave U Net For Speech Enhancement
⭐
184
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
Vq Vae Speech
⭐
145
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Soundsourceseparation
⭐
134
The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.
Whisper Auto Transcribe
⭐
91
Auto transcribe tool based on whisper
Speechprompt
⭐
80
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm
Speechclip
⭐
80
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
A Convolutional Recurrent Neural Network For Real Time Speech Enhancement
⭐
79
A minimum unofficial implementation of the A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement (CRN) using PyTorch.
Tdnn
⭐
70
Time delay neural network (TDNN) implementation in Pytorch using unfold method
Vak
⭐
67
A neural network framework for researchers studying acoustic communication
Wavencoder
⭐
36
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Pytorch Kaldi Neural Speaker Embeddings
⭐
27
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
Huawei Challenge Speaker Identification
⭐
26
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Hifigan Denoiser
⭐
22
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Great Deep Learning Books
⭐
16
A Great Collection of Deep Learning (e)Books
Mexca
⭐
11
Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text) in the wild.
Bilatticernn Confidence
⭐
7
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/1910.11933 or https://ieeexplore.ieee.org/document/9053264
Pydrobert Speech
⭐
5
Speech processing with Python
Related Searches
Python Pytorch (14,671)
Deep Learning Pytorch (7,533)
Jupyter Notebook Pytorch (4,892)
Machine Learning Pytorch (2,934)
Dataset Pytorch (1,847)
Pytorch Natural Language Processing (1,408)
Pytorch Neural Network (1,391)
Network Pytorch (1,299)
Pytorch Computer Vision (1,230)
Pytorch Neural (1,217)
1-28 of 28 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.