Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for deep learning speech processing
deep-learning
x
speech-processing
x
39 search results found
Speechbrain
⭐
7,166
A PyTorch-based Speech Toolkit
Awesome Multimodal Ml
⭐
5,399
Reading list for research topics in multimodal machine learning
Awesome Diarization
⭐
1,384
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Whisper Timestamped
⭐
1,217
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Sincnet
⭐
764
SincNet is a neural architecture for efficiently processing raw audio samples.
Dtln
⭐
470
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Ims Toucan
⭐
426
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Speech Denoising Wavenet
⭐
414
A neural network for end-to-end speech denoising
Neural Voice Cloning With Few Samples
⭐
379
This repository has implementation for "Neural Voice Cloning With Few Samples"
Multibench
⭐
356
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
Pase
⭐
265
Problem Agnostic Speech Encoder
Ttslearn
⭐
197
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Audio Development Tools
⭐
165
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.
Awesome Speech Enhancement
⭐
151
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Tutorial_separation
⭐
117
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Mevonai Speech Emotion Recognition
⭐
112
Identify the emotion of multiple speakers in an Audio Segment
Tfg Voice Conversion
⭐
109
Deep Learning-based Voice Conversion system
Whisper Auto Transcribe
⭐
91
Auto transcribe tool based on whisper
Speechclip
⭐
80
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
Speechprompt
⭐
80
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speech processing with prompting paradigm
Speechprompt V2
⭐
59
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
Voice2series Reprogramming
⭐
55
ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification
Shelf
⭐
52
a Wide Shelf for AI and Data Science | Resources 🍔
Torchsubband
⭐
51
Pytorch implementation of subband decomposition
Keras Sincnet
⭐
49
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Awesome Speech Emotion Recognition
⭐
36
😎 Awesome lists about Speech Emotion Recognition
Wavencoder
⭐
36
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Pncc
⭐
32
A implementation of Power Normalized Cepstral Coefficients: PNCC
Phonet
⭐
24
Keras-based python framework to compute phonological posterior probabilities from audio files
Cnn Vad
⭐
18
A Convolutional Neural Network based Voice Activity Detector for Smartphones
Robustvc
⭐
17
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degradation / adversarial robustness of VC models.
Great Deep Learning Books
⭐
16
A Great Collection of Deep Learning (e)Books
Awesome Speech Enhancement
⭐
16
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Odin Ai
⭐
15
Orgainzed Digital Intelligent Network (O.D.I.N)
Speech Recognition Learning Resources
⭐
15
✅ A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.
Speech Emotion Recognition
⭐
13
A program that uses neural networks to detect emotions from pre-recorded and real-time speech
Booklibrary
⭐
12
Book Library of P&W Studio
Deep Learning Sota
⭐
11
State-of-the-art results for deep learning tasks in various fields.
Icrcyclegan Vc
⭐
11
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
Speechgen
⭐
10
《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》
Deep Speechgen
⭐
9
RNN for acoustic speech generation
Speechvgg
⭐
6
The repository was moved! For the most recent version see:
Integrated Hearing Aid App
⭐
5
A smartphone applications with Convolutional Neural Network Voice Activity Detector, Adaptive Noise Reduction and Dynamic Audio Range Compression
Stars Collection
⭐
5
🌟 A collection of great repositories (grouped into different categories).
Related Searches
Python Deep Learning (13,092)
Jupyter Notebook Deep Learning (10,328)
Deep Learning Neural Network (5,801)
Deep Learning Pytorch (4,653)
Deep Learning Tensorflow (4,441)
Deep Learning Computer Vision (3,018)
Deep Learning Artificial Intelligence (2,898)
Deep Learning Keras (2,519)
Deep Learning Natural Language Processing (2,283)
Deep Learning Neural (2,063)
1-39 of 39 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.