Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Awesome Diarization | 1,384 | 3 months ago | 3 | apache-2.0 | ||||||
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources. | ||||||||||
Vad | 632 | 3 years ago | 32 | MATLAB | ||||||
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset. | ||||||||||
Free Spoken Digit Dataset | 518 | a year ago | 7 | Python | ||||||
A free audio dataset of spoken digits. Think MNIST for audio. | ||||||||||
Speech Recognition Uk | 262 | 6 months ago | 6 | Python | ||||||
Speech Recognition for Ukrainian | ||||||||||
Speech_dataset | 229 | a year ago | 1 | apache-2.0 | ||||||
The dataset of Speech Recognition | ||||||||||
Ai Audio Datasets | 199 | 4 months ago | mit | |||||||
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc. | ||||||||||
Voice_activity_detection | 171 | 3 years ago | 5 | gpl-3.0 | Python | |||||
Voice Activity Detection based on Deep Learning & TensorFlow | ||||||||||
Rnnt Speech Recognition | 152 | 3 years ago | 13 | mit | Python | |||||
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0 | ||||||||||
End To End Lipreading | 147 | a year ago | 11 | Python | ||||||
Pytorch code for End-to-End Audiovisual Speech Recognition | ||||||||||
Chinese Speech To Text | 144 | a year ago | 11 | apache-2.0 | Python | |||||
Chinese Speech To Text Using Wavenet |