Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Dtln | 470 | 9 months ago | 31 | mit | Python | |||||
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support. | ||||||||||
Tfg Voice Conversion | 109 | 7 years ago | 6 | gpl-3.0 | Python | |||||
Deep Learning-based Voice Conversion system | ||||||||||
Keras Sincnet | 49 | 3 years ago | 5 | Python | ||||||
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet) | ||||||||||
Convolutionaneuralnetworkstoenhancecodedspeech | 22 | 4 years ago | n,ull | bsd-3-clause | Python | |||||
In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions. The proposed postprocessor improves speech quality (PESQ) by up to 0.25 MOS-LQO points for G.711, 0.30 points for G.726, 0.82 points for G.722, and 0.26 points for adaptive multirate wideband codec (AMR-WB). In a subjective CCR listening test, the proposed postprocessor on G.711-coded speech exceeds the speech quality of an ITU-T-standardized postfilter by 0.36 CMOS points, and obtains a clear preference of 1.77 CMOS points compared to G.711, even en par with uncoded speech. | ||||||||||
Great Deep Learning Books | 16 | a year ago | mit | |||||||
A Great Collection of Deep Learning (e)Books | ||||||||||
Speech Emotion Recognition | 13 | 2 years ago | mit | Python | ||||||
A program that uses neural networks to detect emotions from pre-recorded and real-time speech | ||||||||||
Overlap Detection | 11 | 6 years ago | 2 | Python | ||||||
Overlapped Speech detection in Multi-party Conversations |