Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Speechbrain | 7,166 | 3 months ago | 149 | apache-2.0 | Python | |||||
A PyTorch-based Speech Toolkit | ||||||||||
Awesome Multimodal Ml | 5,399 | 19 days ago | 8 | mit | ||||||
Reading list for research topics in multimodal machine learning | ||||||||||
Awesome Diarization | 1,384 | 3 months ago | 3 | apache-2.0 | ||||||
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources. | ||||||||||
Whisper Timestamped | 1,217 | 3 | 3 months ago | 3 | December 08, 2023 | 15 | agpl-3.0 | Python | ||
Multilingual Automatic Speech Recognition with word-level timestamps and confidence | ||||||||||
Sincnet | 764 | 3 years ago | 22 | mit | Python | |||||
SincNet is a neural architecture for efficiently processing raw audio samples. | ||||||||||
Dtln | 470 | 9 months ago | 31 | mit | Python | |||||
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support. | ||||||||||
Ims Toucan | 426 | 3 months ago | 29 | apache-2.0 | Python | |||||
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality. | ||||||||||
Speech Denoising Wavenet | 414 | 5 years ago | 29 | mit | Python | |||||
A neural network for end-to-end speech denoising | ||||||||||
Neural Voice Cloning With Few Samples | 379 | 3 years ago | mit | Python | ||||||
This repository has implementation for "Neural Voice Cloning With Few Samples" | ||||||||||
Multibench | 356 | 6 months ago | 10 | mit | HTML | |||||
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning |