Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children

Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing an Automatic Speech Recognition (ASR) system would not help since they are pre-trained on voices that differ from children's in terms of frequency and amplitude. Because most of these are pre-trained with data in a specific range of amplitude, their objectives do not make them ready for voices in different amplitudes. To overcome this issue, we added a new objective to the masking objective of the Wav2Vec 2.0 model called Random Frequency Pitch (RFP). In addition, we used our newly introduced dataset to fine-tune our model for Meaningless Words (MW) and Rapid Automatic Naming (RAN) tests. Using masking in concatenation with RFP outperforms the masking objective of Wav2Vec 2.0 by reaching a Word Error Rate (WER) of 1.35. Our new approach reaches a WER of 6.45 on the Persian section of the CommonVoice dataset. Furthermore, our novel methodology produces positive outcomes in zero- and few-shot scenarios.
Alternatives To Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Open_stt671
2 years ago3otherPython
Open STT
Speech_dataset229
a year ago1apache-2.0
The dataset of Speech Recognition
Asr Study131
7 years ago5mitPython
Implementation of all-neural speech recognition systems using Keras and Tensorflow
Cv Dataset120
4 months ago11mpl-2.0JavaScript
Metadata and versioning details for the Common Voice dataset
Speech Corpus Collection87
7 years agomit
A Collection of Speech Corpus for ASR and TTS
Mongolian Speech Recognition86
4 years agoPython
Mongolian speech recognition with PyTorch
Zerospeech Tts Without T79
4 years agomitPython
A Pytorch implementation for the ZeroSpeech 2019 challenge.
Keras Sincnet49
3 years ago5Python
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Odsqa41
2 years ago1Shell
ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
Libriadapt34
3 years ago1Python
Instructions on downloading and using the LibriAdapt dataset
Alternatives To Automatic Speech Recognition For Speech Assessment Of Persian Preschool Children
Select To Compare


Alternative Project Comparisons
Popular Dataset Projects
Popular Asr Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Jupyter Notebook
Deep Learning
Dataset
Speech Recognition
Asr