Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for mfcc
mfcc
x
53 search results found
Numpy Ml
⭐
14,162
Machine learning, in numpy
Aubio
⭐
3,082
a library for audio and music analysis
Audioflux
⭐
1,968
A library for audio and music analysis, feature extraction.
Emotion Recognition Using Speech
⭐
392
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
Spafe
⭐
338
🔉 spafe: Simplified Python Audio Features Extraction
Nwaves
⭐
277
.NET DSP library with a lot of audio processing functions
Gist
⭐
266
A C++ Library for Audio Analysis
Speech_signal_processing_and_classification
⭐
203
Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of th
Sptk
⭐
200
A suite of speech signal processing tools
Pyaudioprocessing
⭐
175
Audio feature extraction and classification
Kaldifeat
⭐
161
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
Diffsptk
⭐
142
A differentiable version of SPTK
Voice Based Gender Recognition
⭐
122
🔉 👦 👧Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
Subsync
⭐
113
Synchronize your subtitles using machine learning
Mevonai Speech Emotion Recognition
⭐
112
Identify the emotion of multiple speakers in an Audio Segment
Speech Emotion Recognition
⭐
78
Detecting emotions using MFCC features of human speech using Deep Learning
Sonopy
⭐
78
A simple audio feature extraction library
Speaker Identification
⭐
76
A program for automatic speaker identification using deep learning techniques.
Python_kaldi_features
⭐
39
python codes to extract MFCC and FBANK speech features for Kaldi
Node Personal Wakeword
⭐
37
Personal wake word detector
Pncc
⭐
32
A implementation of Power Normalized Cepstral Coefficients: PNCC
Spectra
⭐
32
Spectra extraction tutorials based on torch and torchaudio.
Pytorch Mfcc
⭐
30
A pytorch implementation of MFCC.
Zaf Matlab
⭐
27
Zafar's Audio Functions in Matlab for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
Vamp Aubio Plugins
⭐
26
aubio plugins for Vamp
Alignmentduration
⭐
23
Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.
Cqhc Python
⭐
22
Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.
Convolutionaneuralnetworkstoenhancecodedspeech
⭐
22
In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions
Speakervoiceidentifier
⭐
22
SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.
Dtw_digital_voice_recognition
⭐
21
基于DTW与MFCC特征进行数字0-9的语音识别,DTW,MFCC,语音识别,中英数据,端点检测,D Voice Recognition。
Fragit Main
⭐
20
FragIt main repository
Live Audio Mfcc
⭐
20
Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial
Emergency Vehicle Detection
⭐
17
Python implementation of papers on Emergency Vehicle Detection
Basicsmusicalinstrumclassifi
⭐
16
Basics of Musical Instruments Classification using Machine Learning
Timit Preprocessor
⭐
14
Extract mfcc vectors and phones from TIMIT dataset
Audio Genre Classification
⭐
12
Automatic music genre classification using Machine Learning algorithms like- Logistic Regression and K-Nearest Neighbours
Asr
⭐
11
A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).
Speech_signal_processing
⭐
11
Zaf Python
⭐
11
Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.
Voice Based Speaker Identification
⭐
11
🔉 👦 👧 👩 👨 Speaker identification using voice MFCCs and GMM
Mandarin Tone Classification
⭐
10
Deep learning using CNN for tone classification of 4 Mandarin Chinese tones
Lpvspectral.jl
⭐
10
Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.
Gmm_digital_voice_recognition
⭐
8
基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklear Voice Recognition。
Personality Trait Prediction
⭐
7
Big Five personality trait prediction on First Impressions V2 dataset
Speaker Diarization
⭐
6
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Spoken Language Identification
⭐
6
Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features
Matlab_feat
⭐
6
Functions for creating speech features in MATLAB.
Asr Paper
⭐
6
🔥 ASR教程: https://dataxujing.github.io/ASR-paper/
Tiger Costume Voice Conversion
⭐
6
Voice Alignment and Conversion with Neural Networks and the WORLD codec.
Rirnet
⭐
6
🎙🔊 DCNN for immersive soundscapes in AR environments
Spoken Digit Recognition
⭐
6
Classifying English spoken digit by Hidden Markov Model
Speech Accent Detection
⭐
6
The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.
Voice_actor_recog
⭐
5
Extract MFCC from movie files and detect speaker using it
1-53 of 53 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.