Awesome Open Source

Programming Languages

Search results for deep learning speech processing

deep-learning x

speech-processing x

39 search results found

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

Awesome Multimodal Ml ⭐ 5,399

Reading list for research topics in multimodal machine learning

Awesome Diarization ⭐ 1,384

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Whisper Timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Sincnet ⭐ 764

SincNet is a neural architecture for efficiently processing raw audio samples.

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Ims Toucan ⭐ 426

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Speech Denoising Wavenet ⭐ 414

A neural network for end-to-end speech denoising

Neural Voice Cloning With Few Samples ⭐ 379

This repository has implementation for "Neural Voice Cloning With Few Samples"

Multibench ⭐ 356

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Problem Agnostic Speech Encoder

Ttslearn ⭐ 197

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Audio Development Tools ⭐ 165

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

Awesome Speech Enhancement ⭐ 151

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Tutorial_separation ⭐ 117

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

Mevonai Speech Emotion Recognition ⭐ 112

Identify the emotion of multiple speakers in an Audio Segment

Tfg Voice Conversion ⭐ 109

Deep Learning-based Voice Conversion system

Whisper Auto Transcribe ⭐ 91

Auto transcribe tool based on whisper

Speechclip ⭐ 80

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022

Speechprompt ⭐ 80

**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm

Speechprompt V2 ⭐ 59

《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm

Voice2series Reprogramming ⭐ 55

ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification

a Wide Shelf for AI and Data Science | Resources 🍔

Torchsubband ⭐ 51

Pytorch implementation of subband decomposition

Keras Sincnet ⭐ 49

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Awesome Speech Emotion Recognition ⭐ 36

😎 Awesome lists about Speech Emotion Recognition

Wavencoder ⭐ 36

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

A implementation of Power Normalized Cepstral Coefficients: PNCC

Keras-based python framework to compute phonological posterior probabilities from audio files

A Convolutional Neural Network based Voice Activity Detector for Smartphones

Robustvc ⭐ 17

**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degradation / adversarial robustness of VC models.

Great Deep Learning Books ⭐ 16

A Great Collection of Deep Learning (e)Books

Awesome Speech Enhancement ⭐ 16

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

Orgainzed Digital Intelligent Network (O.D.I.N)

Speech Recognition Learning Resources ⭐ 15

✅ A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.

Speech Emotion Recognition ⭐ 13

A program that uses neural networks to detect emotions from pre-recorded and real-time speech

Booklibrary ⭐ 12

Book Library of P&W Studio

Deep Learning Sota ⭐ 11

State-of-the-art results for deep learning tasks in various fields.

Icrcyclegan Vc ⭐ 11

Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny

Speechgen ⭐ 10

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

Deep Speechgen ⭐ 9

RNN for acoustic speech generation

Speechvgg ⭐ 6

The repository was moved! For the most recent version see:

Integrated Hearing Aid App ⭐ 5

A smartphone applications with Convolutional Neural Network Voice Activity Detector, Adaptive Noise Reduction and Dynamic Audio Range Compression

Stars Collection ⭐ 5

🌟 A collection of great repositories (grouped into different categories).

Related Searches

Python Deep Learning (13,092)

Jupyter Notebook Deep Learning (10,328)

Deep Learning Neural Network (5,801)

Deep Learning Pytorch (4,653)

Deep Learning Tensorflow (4,441)

Deep Learning Computer Vision (3,018)

Deep Learning Artificial Intelligence (2,898)

Deep Learning Keras (2,519)

Deep Learning Natural Language Processing (2,283)

Deep Learning Neural (2,063)

1-39 of 39 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.