Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Awesome Multimodal Ml | 5,399 | 21 days ago | 8 | mit | ||||||
Reading list for research topics in multimodal machine learning | ||||||||||
Torchscale | 2,804 | 8 | 3 months ago | 5 | October 20, 2023 | 18 | mit | Python | ||
Foundation Architecture for (M)LLMs | ||||||||||
Deepvoice3_pytorch | 1,906 | 4 months ago | 43 | other | Python | |||||
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models | ||||||||||
Awesome Diarization | 1,384 | 3 months ago | 3 | apache-2.0 | ||||||
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources. | ||||||||||
Whisper Timestamped | 1,217 | 3 | 3 months ago | 3 | December 08, 2023 | 15 | agpl-3.0 | Python | ||
Multilingual Automatic Speech Recognition with word-level timestamps and confidence | ||||||||||
Audino | 988 | 4 months ago | 52 | mit | JavaScript | |||||
Open source audio annotation tool for humans | ||||||||||
Speech Denoising Wavenet | 414 | 5 years ago | 29 | mit | Python | |||||
A neural network for end-to-end speech denoising | ||||||||||
Nnmnkwii | 375 | 15 | 1 | a year ago | 26 | January 04, 2022 | 6 | other | Python | |
Library to build speech synthesis systems designed for easy and fast prototyping. | ||||||||||
Surfboard | 369 | 2 years ago | 5 | July 17, 2020 | 8 | gpl-3.0 | Python | |||
Novoic's audio feature extraction library | ||||||||||
Multibench | 356 | 6 months ago | 10 | mit | HTML | |||||
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning |