Awesome Open Source

Programming Languages

Search results for python spectrogram

211 search results found

Ultimatevocalremovergui ⭐ 12,990

GUI for a Vocal Remover that uses Deep Neural Networks.

Demucs ⭐ 7,127

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Dejavu ⭐ 6,108

Audio fingerprinting and recognition in Python

Deep Voice Conversion ⭐ 3,739

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Audio ⭐ 2,321

Data manipulation and transformation for audio signal processing, powered by PyTorch

Audioflux ⭐ 1,968

A library for audio and music analysis, feature extraction.

Wavenet_vocoder ⭐ 1,617

WaveNet vocoder

Audiomentations ⭐ 1,605

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Hifi Gan ⭐ 1,376

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Wavegan ⭐ 1,165

WaveGAN: Learn to synthesize raw audio with generative adversarial networks

Vocal Remover ⭐ 1,148

Vocal Remover using Deep Neural Networks

Open Unmix Pytorch ⭐ 990

Open-Unmix - Music Source Separation for PyTorch

Nnaudio ⭐ 882

Audio processing by using pytorch 1D convolution network

Melgan Neurips ⭐ 872

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

kapre: Keras Audio Preprocessors

Multilingual_text_to_speech ⭐ 740

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Friture ⭐ 720

Real-time audio visualizations (spectrum, spectrogram, etc.)

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Free Spoken Digit Dataset ⭐ 518

A free audio dataset of spoken digits. Think MNIST for audio.

Speech Enhancement ⭐ 515

Deep learning for audio denoising

Specaugment ⭐ 411

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Randomcnn Voice Transfer ⭐ 377

Audio style transfer with shallow random parameters CNN.

Torchlibrosa ⭐ 348

Neural_network_voices ⭐ 327

This is the code for "Neural Network Voices" by Siraj Raval on Youtube

Argus Freesound ⭐ 266

Kaggle | 1st place solution for Freesound Audio Tagging 2019

Crnn Audio Classification ⭐ 249

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

Spoken Language Identification ⭐ 216

Spoken language identification with deep learning

Spectrographic ⭐ 205

Turn an image into sound whose spectrogram looks like the image.

Tacotron_pytorch ⭐ 204

PyTorch implementation of Tacotron speech synthesis model.

Squeezewave ⭐ 184

Python AUdio Recording and Analysis (paura)

Parallel Wavenet Vocoder ⭐ 149

A WaveNet-based vocoder for fast inference

Depression Detect ⭐ 146

Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Griffin_lim ⭐ 126

Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.

Music Audio Tagging At Scale Models ⭐ 117

Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"

Tacotron2 Pytorch ⭐ 115

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Kaggle Freesound Audio Tagging ⭐ 110

8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)

Spectral_connectivity ⭐ 109

Frequency domain estimation and functional and directed connectivity analysis tools for electrophysiological data

Untwist ⭐ 108

Pggan Pytorch ⭐ 104

Progressively Growing GAN in PyTorch for Image and Sound generation

Big Impulse Response Dataset

Videotovoice ⭐ 100

takes in a sequence of lip images, and predicts the phonemes being said.

Convmelspec ⭐ 100

Convmelspec: Convertible Melspectrograms via 1D Convolutions

A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")

Realbook ⭐ 95

Easier audio-based machine learning with TensorFlow.

Self Attention Tacotron ⭐ 91

An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" https://arxiv.org/abs/1810.11960

Painless Wiener filters for audio separation

Voicesplit ⭐ 82

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Cyclegan Vc3 ⭐ 82

Voice Conversion by CycleGAN (语音克隆/语音转换)：CycleGAN-VC3

Meta Tasnet ⭐ 81

A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation

SpecGAN - generate audio with adversarial training

Tacotron2 ⭐ 79

An implementation of Tacotron and Tacotron2

Nonparaseq2seqvc_code ⭐ 77

Implementation code of non-parallel sequence-to-sequence VC

Spectrogram ⭐ 73

80MHz bandwidth with LimeSDR-Mini and GQRX

Cnn_vocoder ⭐ 69

A fast cnn-based vocoder

Pix2pix Timbre Transfer ⭐ 68

Musical Timbre Transfer using the Pix2Pix architecture

A neural network framework for researchers studying acoustic communication

Package to analyze EEG, ECoG and other electrophysiology formats. It allows for visualization of the results and for a GUI that can be used to score sleep stages.

Bubblesub ⭐ 64

Simple extensible ASS subtitle editor for Linux

Sound Based Bird Species Detection ⭐ 62

Sound-based Bird Classification - using AI, acoustics and ornithology to classify birds in the environment, an environmental awareness project (Web Application, Flask, Python)

Deepspectrum ⭐ 60

Cnns Speech Music Discrimination ⭐ 59

A deep learning framework for Speech-Music discrimination of continuous audio streams

Vocode spectrograms to audio with generative adversarial networks

Open Unmix Nnabla ⭐ 54

Open-Unmix - Music Source Separation for NNabla

Cyclevae Vc Neuralvoco ⭐ 53

Speakerdiarization_rnn_cnn_lstm ⭐ 52

Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).

Vae_tacotron2 ⭐ 51

VAE Tacotron 2, an alternative of GST Tacotron

Auorange ⭐ 51

Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet

Neural Classifiers With Few Audio ⭐ 50

Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274

Tacotron ⭐ 50

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Crowdai Musical Genre Recognition Starter Kit ⭐ 49

Tacotron2 Wavenet Korean Tts ⭐ 49

Korean TTS, Tacotron2, Wavenet

Birdcall Identification Competition ⭐ 49

2nd place in the Cornell Birdcall Identification competition

Clari_wavenet_vocoder ⭐ 46

Tacotron2 Mandarin ⭐ 45

PyTorch reimplementation of Tacotron2 in Mandarin

Audionet ⭐ 44

Audio Classification using Image Classification

Smart Nar_fast_tts ⭐ 43

Spectrogram Inversion ⭐ 43

spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io

Acousticeventdetection ⭐ 43

Source code complementing our paper for acoustic event classification using convolutional neural networks.

The BEST music separation model with help of A.I. ... to my ears ! 👂👂

Deep Music Tagger ⭐ 41

Music genre classification model using CRNN

Tacotron2 ⭐ 40

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Acousticrakereceiver ⭐ 40

The acoustic rake receiver, a microphone beamformer that uses echoes to improve the noise and interference suppression. Python code to reproduce all the results from Raking the Cocktail Party by Ivan Dokmanic, Robin Scheibler, and Martin Vetterli.

Voice Filter ⭐ 40

A unofficial Pytorch implementation of Google's VoiceFilter

Dialectid_e2e ⭐ 38

End to End Dialect Identification using Convolutional Neural Network

Music Artist Classification Crnn ⭐ 38

Supplementary material for IJCNN paper "Musical Artist Classification with Convolutoinal Recurrent Neural Networks"

Kaggle Whales ⭐ 37

code for the Whale Detection Challenge competition on Kaggle

Tacotron 2 Explained ⭐ 37

Walk through insanely commented code for an advanced recurrent model in TensorFlow

Segmentationcnn ⭐ 36

Music segmentation using convolutional neural networks.

Acoustic_indices ⭐ 35

Acoustic indices in python

This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]

Audio Auto Tagging ⭐ 35

Convolutional Neural Network for auto-tagging of audio clips on MagnaTagATune dataset

Multi Source Sound Localization ⭐ 33

This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.

Spectrogram calculation for NumPy

Mxnet Audio ⭐ 33

Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet

Freesound Classification ⭐ 32

Code for the 3rd place solution to Freesound Audio Tagging 2019 Challenge

Instrument Classification ⭐ 31

🎣 Classify Flute, Clarinet and Trumpet

Interactive Spectrogram Inpainting ⭐ 31

Implementation of the framework described in the paper Spectrogram Inpainting for Interactive Generation of Instrument Sounds published at the 2020 Joint Conference on AI Music Creativity.

Multi_speaker_tts ⭐ 31

Implementation of Multi speaker TTS

Related Searches

Python Django (28,897)

Python Machine Learning (17,806)

Python Flask (17,643)

Python Pytorch (14,934)

Python Dataset (14,792)

Python Docker (14,113)

Python Tensorflow (13,736)

Python Command Line (13,351)

Python Deep Learning (13,095)

Python Jupyter Notebook (12,976)

1-100 of 211 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.