Awesome Open Source

Programming Languages

Search results for pytorch speech processing

speech-processing x

28 search results found

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

Pyannote Audio ⭐ 4,460

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Deepvoice3_pytorch ⭐ 1,906

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Wavenet_vocoder ⭐ 1,617

WaveNet vocoder

Whisper Timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Sincnet ⭐ 764

SincNet is a neural architecture for efficiently processing raw audio samples.

Fullsubnet ⭐ 443

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Ims Toucan ⭐ 426

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Nnmnkwii ⭐ 375

Library to build speech synthesis systems designed for easy and fast prototyping.

Unispeech ⭐ 328

UniSpeech - Large Scale Self-Supervised Learning for Speech

Problem Agnostic Speech Encoder

Wave U Net For Speech Enhancement ⭐ 184

Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.

Vq Vae Speech ⭐ 145

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Soundsourceseparation ⭐ 134

The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.

Whisper Auto Transcribe ⭐ 91

Auto transcribe tool based on whisper

Speechprompt ⭐ 80

**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm

Speechclip ⭐ 80

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022

A Convolutional Recurrent Neural Network For Real Time Speech Enhancement ⭐ 79

A minimum unofficial implementation of the A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement (CRN) using PyTorch.

Time delay neural network (TDNN) implementation in Pytorch using unfold method

A neural network framework for researchers studying acoustic communication

Wavencoder ⭐ 36

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

Pytorch Kaldi Neural Speaker Embeddings ⭐ 27

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Huawei Challenge Speaker Identification ⭐ 26

Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.

Hifigan Denoiser ⭐ 22

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Great Deep Learning Books ⭐ 16

A Great Collection of Deep Learning (e)Books

Multimodal Emotion eXpression Capture Amsterdam. Pipeline for capturing emotion expressions from multiple modalities (video, audio, text) in the wild.

Bilatticernn Confidence ⭐ 7

Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/1910.11933 or https://ieeexplore.ieee.org/document/9053264

Pydrobert Speech ⭐ 5

Speech processing with Python

Related Searches

Python Pytorch (14,671)

Deep Learning Pytorch (7,533)

Jupyter Notebook Pytorch (4,892)

Machine Learning Pytorch (2,934)

Dataset Pytorch (1,847)

Pytorch Natural Language Processing (1,408)

Pytorch Neural Network (1,391)

Network Pytorch (1,299)

Pytorch Computer Vision (1,230)

Pytorch Neural (1,217)

1-28 of 28 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.