Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Face Api.js | 15,377 | a month ago | 438 | mit | TypeScript | |||||
JavaScript API for face detection and face recognition in the browser and nodejs with tensorflow.js | ||||||||||
Asrt_speechrecognition | 6,942 | 10 days ago | 1 | October 23, 2020 | 101 | gpl-3.0 | Python | |||
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统 | ||||||||||
Lstm Human Activity Recognition | 3,074 | a year ago | 19 | mit | Jupyter Notebook | |||||
Human Activity Recognition example using TensorFlow on smartphone sensors dataset and an LSTM RNN. Classifying the type of movement amongst six activity categories - Guillaume Chevalier | ||||||||||
Automatic_speech_recognition | 2,743 | 2 years ago | 69 | mit | Python | |||||
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow | ||||||||||
Tensorflow Speech Recognition | 2,124 | a year ago | 32 | other | Python | |||||
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks | ||||||||||
Zh Ner Tf | 1,761 | 4 years ago | 67 | Python | ||||||
A very simple BiLSTM-CRF model for Chinese Named Entity Recognition 中文命名实体识别 (TensorFlow) | ||||||||||
Labelbox Custom Labeling Apps | 1,732 | 5 months ago | 40 | apache-2.0 | JavaScript | |||||
Explore example custom labeling apps built with Labelbox SDK | ||||||||||
Sequence_tagging | 1,725 | 4 years ago | 15 | apache-2.0 | Python | |||||
Named Entity Recognition (LSTM + CRF) - Tensorflow | ||||||||||
Alpr Unconstrained | 1,462 | 2 years ago | 106 | other | C | |||||
License Plate Detection and Recognition in Unconstrained Scenarios | ||||||||||
Lip Reading Deeplearning | 1,433 | 4 years ago | 1 | apache-2.0 | Python | |||||
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures |
Tensorflow Speech Recognition Challenge was a Kaggle competition organised by Google Brain to use the Speech Commands Dataset to build an algorithm that understands simple spoken commands. https://www.kaggle.com/c/tensorflow-speech-recognition-challenge
This solution achieved a rank of 63 on private leaderboard (top 5%).
Download the Speech Commands Dataset and extract the dataset in the train folder. Test Audio can be placed in data/test/audio folder.
The notebooks can be run individually using Jupyter. To run the scripts from command line edit the notebooks using Jupyter and run:
./script/execute_notebook.py
and select the notebook to run. The results are stored in results/notebook_name.log
P0 Predict Test WAV.ipynb can be used to predict audio files using a trained graphdef model.
The model was trained using a GCP instance with the following specifications:
Most of the models converged in 30k steps. Pseudo Labelling on test data was used to improve the model performance.
The final model was a ensemble 13 models. Weighted Averaging and Stacking was used to generate the final predictions.
If you like this project or have any queries don't hesitate to send an email to [email protected]