Project Name	Stars	Most Recent Commit	Open Issues	License	Language
Dtln	470	9 months ago	31	mit	Python
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Tfg Voice Conversion	109	7 years ago	6	gpl-3.0	Python
Deep Learning-based Voice Conversion system
Keras Sincnet	49	3 years ago	5		Python
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Convolutionaneuralnetworkstoenhancecodedspeech	22	4 years ago	n,ull	bsd-3-clause	Python
In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions. The proposed postprocessor improves speech quality (PESQ) by up to 0.25 MOS-LQO points for G.711, 0.30 points for G.726, 0.82 points for G.722, and 0.26 points for adaptive multirate wideband codec (AMR-WB). In a subjective CCR listening test, the proposed postprocessor on G.711-coded speech exceeds the speech quality of an ITU-T-standardized postfilter by 0.36 CMOS points, and obtains a clear preference of 1.77 CMOS points compared to G.711, even en par with uncoded speech.
Great Deep Learning Books	16	a year ago		mit
A Great Collection of Deep Learning (e)Books
Speech Emotion Recognition	13	2 years ago		mit	Python
A program that uses neural networks to detect emotions from pre-recorded and real-time speech
Overlap Detection	11	6 years ago	2		Python
Overlapped Speech detection in Multi-party Conversations

Alternatives To Overlap Detection

Select To Compare

Dtln ⭐ 470

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

most recent commit 9 months ago

Tfg Voice Conversion ⭐ 109

Deep Learning-based Voice Conversion system

most recent commit 7 years ago

Keras Sincnet ⭐ 49

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

most recent commit 3 years ago

Convolutionaneuralnetworkstoenhancecodedspeech ⭐ 22

In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions

most recent commit 4 years ago

Great Deep Learning Books ⭐ 16

A Great Collection of Deep Learning (e)Books

most recent commit a year ago

Speech Emotion Recognition ⭐ 13

A program that uses neural networks to detect emotions from pre-recorded and real-time speech

most recent commit 2 years ago

Overlap Detection ⭐ 11

Overlapped Speech detection in Multi-party Conversations

most recent commit 6 years ago

Suggest An Alternative To Overlap-Detection

Alternative Project Comparisons

Overlap Detection vs Dtln

Overlap Detection vs Tfg Voice Conversion

Overlap Detection vs Keras Sincnet

Overlap Detection vs Convolutionaneuralnetworkstoenhancecodedspeech

Overlap Detection vs Great Deep Learning Books

Overlap Detection vs Speech Emotion Recognition

Popular Speech Processing Projects

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

most recent commit 3 months ago

Awesome Multimodal Ml ⭐ 5,399

Reading list for research topics in multimodal machine learning

most recent commit 20 days ago

Pyannote Audio ⭐ 4,460

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

dependent packages 13total releases 24latest release December 01, 2023most recent commit 3 months ago

Torchscale ⭐ 2,804

Foundation Architecture for (M)LLMs

dependent packages 8total releases 5latest release October 20, 2023most recent commit 3 months ago

Deepvoice3_pytorch ⭐ 1,906

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

most recent commit 4 months ago

Popular Keras Projects

Tqdm ⭐ 26,694

:zap: A Fast, Extensible Progress Bar for Python and CLI

dependent packages 15,455total releases 135latest release August 10, 2023most recent commit 4 months ago

Netron ⭐ 26,068

Visualizer for neural network, deep learning and machine learning models

dependent packages 70total releases 610latest release December 09, 2023most recent commit 2 days ago

Data Science Ipython Notebooks ⭐ 25,668

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

most recent commit 6 months ago

Mask_rcnn ⭐ 23,745

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

total releases 5latest release March 05, 2019most recent commit 4 months ago

100 Days Of Ml Code ⭐ 20,750

100-Days-Of-ML-Code中文版

most recent commit 2 years ago

Popular Machine Learning Categories

Natural Language Processing

Neural Network

Neural

Computer Vision

Convolutional Neural Networks

Opencv