Awesome Open Source

Programming Languages

Search results for mfcc

53 search results found

Numpy Ml ⭐ 14,162

Machine learning, in numpy

Aubio ⭐ 3,082

a library for audio and music analysis

Audioflux ⭐ 1,968

A library for audio and music analysis, feature extraction.

Emotion Recognition Using Speech ⭐ 392

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

🔉 spafe: Simplified Python Audio Features Extraction

.NET DSP library with a lot of audio processing functions

A C++ Library for Audio Analysis

Speech_signal_processing_and_classification ⭐ 203

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of th

A suite of speech signal processing tools

Pyaudioprocessing ⭐ 175

Audio feature extraction and classification

Kaldifeat ⭐ 161

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

Diffsptk ⭐ 142

A differentiable version of SPTK

Voice Based Gender Recognition ⭐ 122

🔉 👦 👧Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)

Subsync ⭐ 113

Synchronize your subtitles using machine learning

Mevonai Speech Emotion Recognition ⭐ 112

Identify the emotion of multiple speakers in an Audio Segment

Speech Emotion Recognition ⭐ 78

Detecting emotions using MFCC features of human speech using Deep Learning

A simple audio feature extraction library

Speaker Identification ⭐ 76

A program for automatic speaker identification using deep learning techniques.

Python_kaldi_features ⭐ 39

python codes to extract MFCC and FBANK speech features for Kaldi

Node Personal Wakeword ⭐ 37

Personal wake word detector

A implementation of Power Normalized Cepstral Coefficients: PNCC

Spectra extraction tutorials based on torch and torchaudio.

Pytorch Mfcc ⭐ 30

A pytorch implementation of MFCC.

Zaf Matlab ⭐ 27

Zafar's Audio Functions in Matlab for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.

Vamp Aubio Plugins ⭐ 26

aubio plugins for Vamp

Alignmentduration ⭐ 23

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

Cqhc Python ⭐ 22

Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.

Convolutionaneuralnetworkstoenhancecodedspeech ⭐ 22

In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions

Speakervoiceidentifier ⭐ 22

SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.

Dtw_digital_voice_recognition ⭐ 21

基于DTW与MFCC特征进行数字0-9的语音识别，DTW，MFCC，语音识别，中英数据，端点检测，D Voice Recognition。

Fragit Main ⭐ 20

FragIt main repository

Live Audio Mfcc ⭐ 20

Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial

Emergency Vehicle Detection ⭐ 17

Python implementation of papers on Emergency Vehicle Detection

Basicsmusicalinstrumclassifi ⭐ 16

Basics of Musical Instruments Classification using Machine Learning

Timit Preprocessor ⭐ 14

Extract mfcc vectors and phones from TIMIT dataset

Audio Genre Classification ⭐ 12

Automatic music genre classification using Machine Learning algorithms like- Logistic Regression and K-Nearest Neighbours

A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech Recognition (ASR).

Speech_signal_processing ⭐ 11

Zaf Python ⭐ 11

Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT spectrogram, CQT chromagram, DCT, DST, MDCT, inverse MDCT.

Voice Based Speaker Identification ⭐ 11

🔉 👦 👧 👩 👨 Speaker identification using voice MFCCs and GMM

Mandarin Tone Classification ⭐ 10

Deep learning using CNN for tone classification of 4 Mandarin Chinese tones

Lpvspectral.jl ⭐ 10

Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.

Gmm_digital_voice_recognition ⭐ 8

基于GMM与MFCC特征进行数字0-9的语音识别，GMM，MFCC，语音识别，中文数据，sklear Voice Recognition。

Personality Trait Prediction ⭐ 7

Big Five personality trait prediction on First Impressions V2 dataset

Speaker Diarization ⭐ 6

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

Spoken Language Identification ⭐ 6

Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features

Matlab_feat ⭐ 6

Functions for creating speech features in MATLAB.

Asr Paper ⭐ 6

🔥 ASR教程: https://dataxujing.github.io/ASR-paper/

Tiger Costume Voice Conversion ⭐ 6

Voice Alignment and Conversion with Neural Networks and the WORLD codec.

🎙🔊 DCNN for immersive soundscapes in AR environments

Spoken Digit Recognition ⭐ 6

Classifying English spoken digit by Hidden Markov Model

Speech Accent Detection ⭐ 6

The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.

Voice_actor_recog ⭐ 5

Extract MFCC from movie files and detect speaker using it

1-53 of 53 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.