Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for dataset speech recognition
dataset
x
speech-recognition
x
35 search results found
Awesome Diarization
⭐
1,384
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Vad
⭐
632
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Free Spoken Digit Dataset
⭐
518
A free audio dataset of spoken digits. Think MNIST for audio.
Speech Recognition Uk
⭐
262
Speech Recognition for Ukrainian
Speech_dataset
⭐
229
The dataset of Speech Recognition
Ai Audio Datasets
⭐
199
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
Voice_activity_detection
⭐
171
Voice Activity Detection based on Deep Learning & TensorFlow
Rnnt Speech Recognition
⭐
152
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
End To End Lipreading
⭐
147
Pytorch code for End-to-End Audiovisual Speech Recognition
Chinese Speech To Text
⭐
144
Chinese Speech To Text Using Wavenet
Mongolian Nlp
⭐
126
Useful resources for Mongolian NLP
How2 Dataset
⭐
125
This repository contains code and metadata of How2 dataset
Audiomate
⭐
123
Python library for handling audio datasets.
Cv Dataset
⭐
120
Metadata and versioning details for the Common Voice dataset
Mongolian Speech Recognition
⭐
86
Mongolian speech recognition with PyTorch
Download_audioset
⭐
81
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Pyspeechrev
⭐
62
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.
Keras Sincnet
⭐
49
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Speech2face
⭐
43
Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL
Automatic Speech Recognition
⭐
43
Automatic Speech Recognition using Tensorflow
Vocalforge
⭐
39
Your one-stop solution for voice dataset creation
Itri Speech Recognition Dataset Generation
⭐
26
Automatic Speech Recognition Dataset Generation
Ucla Phonetic Corpus
⭐
25
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
Chinese Speech Emotion Datasets
⭐
23
Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.
Meta Transfer Learning
⭐
22
Implementation of meta-transfer-learning (ACL 2020)
Speech Commands Classification By Lstm Pytorch
⭐
19
Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.
Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children
⭐
18
Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing an Automatic Speech Recognition (ASR) system would not help since they are pre-trained on voices that differ from children's in terms of frequency and amplitude. Because most of these are pre-tr
Timitspeech
⭐
17
Speech recognition on the TIMIT (or any other) dataset
Speech Recognition Transfer Learning
⭐
16
Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow
Speech Command Recognition With Capsule Network
⭐
15
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
Spoken Digit Recognition
⭐
14
🎙️Spoken Digit Recognition with LSTM
Timit Preprocessor
⭐
14
Extract mfcc vectors and phones from TIMIT dataset
Korean Speech Recognition Quartznet
⭐
14
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식
Megs
⭐
14
A merged version of multiple open-source German speech datasets.
Learning_invariances_in_speech_recognition
⭐
13
In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset and tested on different scenarios. A main problem on speech recognition consists in the differences on pronunciations of words among different people: o
Arabic Speech Recognition
⭐
12
This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"
13.3 Hours Chinese Mandarin Synthesis Corpus Female Emotional
⭐
11
Chinese Mandarin Synthesis Corpus-Female/Emotional
Galvasr
⭐
11
ASR library
Recurrentnn_speechrecognition
⭐
10
A model based in Tensorflow to recognize words from the 30 word Speech Commands Dataset from Google using LSTM based Recurrent Neural Network.
Unified_multilingual_dataset_of_emotional_human_utterances
⭐
5
A unified dataset of multilingual emotional human utterances
Mfcc_ctc_speech
⭐
5
apply mfcc feature of waveform with the LSTM + CTC loss architecture
Related Searches
Python Dataset (14,792)
Jupyter Notebook Dataset (6,824)
Deep Learning Dataset (2,364)
Machine Learning Dataset (2,279)
Dataset Pytorch (1,847)
Dataset Tensorflow (1,583)
Dataset Classification (1,500)
Dataset Convolutional Neural Networks (1,264)
Dataset Paper (1,252)
Javascript Dataset (1,014)
1-35 of 35 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.