Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python spectrogram
python
x
spectrogram
x
211 search results found
Ultimatevocalremovergui
⭐
12,990
GUI for a Vocal Remover that uses Deep Neural Networks.
Demucs
⭐
7,127
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Dejavu
⭐
6,108
Audio fingerprinting and recognition in Python
Deep Voice Conversion
⭐
3,739
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Audio
⭐
2,321
Data manipulation and transformation for audio signal processing, powered by PyTorch
Audioflux
⭐
1,968
A library for audio and music analysis, feature extraction.
Wavenet_vocoder
⭐
1,617
WaveNet vocoder
Audiomentations
⭐
1,605
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Hifi Gan
⭐
1,376
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Wavegan
⭐
1,165
WaveGAN: Learn to synthesize raw audio with generative adversarial networks
Vocal Remover
⭐
1,148
Vocal Remover using Deep Neural Networks
Open Unmix Pytorch
⭐
990
Open-Unmix - Music Source Separation for PyTorch
Nnaudio
⭐
882
Audio processing by using pytorch 1D convolution network
Melgan Neurips
⭐
872
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Kapre
⭐
841
kapre: Keras Audio Preprocessors
Multilingual_text_to_speech
⭐
740
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Friture
⭐
720
Real-time audio visualizations (spectrum, spectrogram, etc.)
Autovc
⭐
687
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Free Spoken Digit Dataset
⭐
518
A free audio dataset of spoken digits. Think MNIST for audio.
Speech Enhancement
⭐
515
Deep learning for audio denoising
Specaugment
⭐
411
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Randomcnn Voice Transfer
⭐
377
Audio style transfer with shallow random parameters CNN.
Torchlibrosa
⭐
348
Neural_network_voices
⭐
327
This is the code for "Neural Network Voices" by Siraj Raval on Youtube
Argus Freesound
⭐
266
Kaggle | 1st place solution for Freesound Audio Tagging 2019
Crnn Audio Classification
⭐
249
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
Spoken Language Identification
⭐
216
Spoken language identification with deep learning
Spectrographic
⭐
205
Turn an image into sound whose spectrogram looks like the image.
Tacotron_pytorch
⭐
204
PyTorch implementation of Tacotron speech synthesis model.
Squeezewave
⭐
184
Paura
⭐
180
Python AUdio Recording and Analysis (paura)
Parallel Wavenet Vocoder
⭐
149
A WaveNet-based vocoder for fast inference
Depression Detect
⭐
146
Predicting depression from acoustic features of speech using a Convolutional Neural Network.
Griffin_lim
⭐
126
Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.
Music Audio Tagging At Scale Models
⭐
117
Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"
Tacotron2 Pytorch
⭐
115
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Audeep
⭐
111
Kaggle Freesound Audio Tagging
⭐
110
8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)
Spectral_connectivity
⭐
109
Frequency domain estimation and functional and directed connectivity analysis tools for electrophysiological data
Untwist
⭐
108
Pggan Pytorch
⭐
104
Progressively Growing GAN in PyTorch for Image and Sound generation
Bird
⭐
104
Big Impulse Response Dataset
Videotovoice
⭐
100
takes in a sequence of lip images, and predicts the phonemes being said.
Convmelspec
⭐
100
Convmelspec: Convertible Melspectrograms via 1D Convolutions
Wavevae
⭐
99
A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")
Realbook
⭐
95
Easier audio-based machine learning with TensorFlow.
Self Attention Tacotron
⭐
91
An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960
Norbert
⭐
88
Painless Wiener filters for audio separation
Voicesplit
⭐
82
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
Cyclegan Vc3
⭐
82
Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3
Meta Tasnet
⭐
81
A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation
Specgan
⭐
81
SpecGAN - generate audio with adversarial training
Tacotron2
⭐
79
An implementation of Tacotron and Tacotron2
Nonparaseq2seqvc_code
⭐
77
Implementation code of non-parallel sequence-to-sequence VC
Spectrogram
⭐
73
80MHz bandwidth with LimeSDR-Mini and GQRX
Cnn_vocoder
⭐
69
A fast cnn-based vocoder
Pix2pix Timbre Transfer
⭐
68
Musical Timbre Transfer using the Pix2Pix architecture
Vak
⭐
67
A neural network framework for researchers studying acoustic communication
Wonambi
⭐
66
Package to analyze EEG, ECoG and other electrophysiology formats. It allows for visualization of the results and for a GUI that can be used to score sleep stages.
Bubblesub
⭐
64
Simple extensible ASS subtitle editor for Linux
Sound Based Bird Species Detection
⭐
62
Sound-based Bird Classification - using AI, acoustics and ornithology to classify birds in the environment, an environmental awareness project (Web Application, Flask, Python)
Deepspectrum
⭐
60
Cnns Speech Music Discrimination
⭐
59
A deep learning framework for Speech-Music discrimination of continuous audio streams
Advoc
⭐
58
Vocode spectrograms to audio with generative adversarial networks
Open Unmix Nnabla
⭐
54
Open-Unmix - Music Source Separation for NNabla
Cyclevae Vc Neuralvoco
⭐
53
Speakerdiarization_rnn_cnn_lstm
⭐
52
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).
Vae_tacotron2
⭐
51
VAE Tacotron 2, an alternative of GST Tacotron
Auorange
⭐
51
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
Neural Classifiers With Few Audio
⭐
50
Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274
Tacotron
⭐
50
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Crowdai Musical Genre Recognition Starter Kit
⭐
49
Tacotron2 Wavenet Korean Tts
⭐
49
Korean TTS, Tacotron2, Wavenet
Birdcall Identification Competition
⭐
49
2nd place in the Cornell Birdcall Identification competition
Clari_wavenet_vocoder
⭐
46
Tacotron2 Mandarin
⭐
45
PyTorch reimplementation of Tacotron2 in Mandarin
Audionet
⭐
44
Audio Classification using Image Classification
Smart Nar_fast_tts
⭐
43
Spectrogram Inversion
⭐
43
spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io
Acousticeventdetection
⭐
43
Source code complementing our paper for acoustic event classification using convolutional neural networks.
Karafan
⭐
42
The BEST music separation model with help of A.I. ... to my ears ! 👂👂
Deep Music Tagger
⭐
41
Music genre classification model using CRNN
Tacotron2
⭐
40
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Acousticrakereceiver
⭐
40
The acoustic rake receiver, a microphone beamformer that uses echoes to improve the noise and interference suppression. Python code to reproduce all the results from Raking the Cocktail Party by Ivan Dokmanic, Robin Scheibler, and Martin Vetterli.
Voice Filter
⭐
40
A unofficial Pytorch implementation of Google's VoiceFilter
Dialectid_e2e
⭐
38
End to End Dialect Identification using Convolutional Neural Network
Music Artist Classification Crnn
⭐
38
Supplementary material for IJCNN paper "Musical Artist Classification with Convolutoinal Recurrent Neural Networks"
Kaggle Whales
⭐
37
code for the Whale Detection Challenge competition on Kaggle
Tacotron 2 Explained
⭐
37
Walk through insanely commented code for an advanced recurrent model in TensorFlow
Segmentationcnn
⭐
36
Music segmentation using convolutional neural networks.
Acoustic_indices
⭐
35
Acoustic indices in python
Asam
⭐
35
This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]
Audio Auto Tagging
⭐
35
Convolutional Neural Network for auto-tagging of audio clips on MagnaTagATune dataset
Multi Source Sound Localization
⭐
33
This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.
Stft
⭐
33
Spectrogram calculation for NumPy
Mxnet Audio
⭐
33
Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet
Freesound Classification
⭐
32
Code for the 3rd place solution to Freesound Audio Tagging 2019 Challenge
Instrument Classification
⭐
31
🎣 Classify Flute, Clarinet and Trumpet
Interactive Spectrogram Inpainting
⭐
31
Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published at the 2020 Joint Conference on AI Music Creativity.
Multi_speaker_tts
⭐
31
Implementation of Multi speaker TTS
Related Searches
Python Django (28,897)
Python Machine Learning (17,806)
Python Flask (17,643)
Python Pytorch (14,934)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (13,736)
Python Command Line (13,351)
Python Deep Learning (13,095)
Python Jupyter Notebook (12,976)
1-100 of 211 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.