Awesome Open Source

Programming Languages

Search results for dataset speech recognition

speech-recognition x

35 search results found

Awesome Diarization ⭐ 1,384

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Free Spoken Digit Dataset ⭐ 518

A free audio dataset of spoken digits. Think MNIST for audio.

Speech Recognition Uk ⭐ 262

Speech Recognition for Ukrainian

Speech_dataset ⭐ 229

The dataset of Speech Recognition

Ai Audio Datasets ⭐ 199

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

Voice_activity_detection ⭐ 171

Voice Activity Detection based on Deep Learning & TensorFlow

Rnnt Speech Recognition ⭐ 152

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

End To End Lipreading ⭐ 147

Pytorch code for End-to-End Audiovisual Speech Recognition

Chinese Speech To Text ⭐ 144

Chinese Speech To Text Using Wavenet

Mongolian Nlp ⭐ 126

Useful resources for Mongolian NLP

How2 Dataset ⭐ 125

This repository contains code and metadata of How2 dataset

Audiomate ⭐ 123

Python library for handling audio datasets.

Cv Dataset ⭐ 120

Metadata and versioning details for the Common Voice dataset

Mongolian Speech Recognition ⭐ 86

Mongolian speech recognition with PyTorch

Download_audioset ⭐ 81

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Pyspeechrev ⭐ 62

This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.

Keras Sincnet ⭐ 49

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Speech2face ⭐ 43

Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL

Automatic Speech Recognition ⭐ 43

Automatic Speech Recognition using Tensorflow

Vocalforge ⭐ 39

Your one-stop solution for voice dataset creation

Itri Speech Recognition Dataset Generation ⭐ 26

Automatic Speech Recognition Dataset Generation

Ucla Phonetic Corpus ⭐ 25

Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION

Chinese Speech Emotion Datasets ⭐ 23

Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.

Meta Transfer Learning ⭐ 22

Implementation of meta-transfer-learning (ACL 2020)

Speech Commands Classification By Lstm Pytorch ⭐ 19

Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.

Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children ⭐ 18

Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing an Automatic Speech Recognition (ASR) system would not help since they are pre-trained on voices that differ from children's in terms of frequency and amplitude. Because most of these are pre-tr

Timitspeech ⭐ 17

Speech recognition on the TIMIT (or any other) dataset

Speech Recognition Transfer Learning ⭐ 16

Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow

Speech Command Recognition With Capsule Network ⭐ 15

Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.

Spoken Digit Recognition ⭐ 14

🎙️Spoken Digit Recognition with LSTM

Timit Preprocessor ⭐ 14

Extract mfcc vectors and phones from TIMIT dataset

Korean Speech Recognition Quartznet ⭐ 14

Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식

A merged version of multiple open-source German speech datasets.

Learning_invariances_in_speech_recognition ⭐ 13

In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset and tested on different scenarios. A main problem on speech recognition consists in the differences on pronunciations of words among different people: o

Arabic Speech Recognition ⭐ 12

This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the publicly-available dataset "Arabic Corpus of Isolated Words"

13.3 Hours Chinese Mandarin Synthesis Corpus Female Emotional ⭐ 11

Chinese Mandarin Synthesis Corpus-Female/Emotional

Recurrentnn_speechrecognition ⭐ 10

A model based in Tensorflow to recognize words from the 30 word Speech Commands Dataset from Google using LSTM based Recurrent Neural Network.

Unified_multilingual_dataset_of_emotional_human_utterances ⭐ 5

A unified dataset of multilingual emotional human utterances

Mfcc_ctc_speech ⭐ 5

apply mfcc feature of waveform with the LSTM + CTC loss architecture

Related Searches

Python Dataset (14,792)

Jupyter Notebook Dataset (6,824)

Deep Learning Dataset (2,364)

Machine Learning Dataset (2,279)

Dataset Pytorch (1,847)

Dataset Tensorflow (1,583)

Dataset Classification (1,500)

Dataset Convolutional Neural Networks (1,264)

Dataset Paper (1,252)

Javascript Dataset (1,014)

1-35 of 35 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.