Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for deep learning speech recognition
deep-learning
x
speech-recognition
x
135 search results found
Transformers
⭐
124,049
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Deepspeech
⭐
24,127
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Deeplearningexamples
⭐
12,073
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Deep Learning Drizzle
⭐
10,767
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Faster Whisper
⭐
8,711
Faster Whisper transcription with CTranslate2
Espnet
⭐
7,563
End-to-End Speech Processing Toolkit
Asrt_speechrecognition
⭐
7,253
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Speechbrain
⭐
7,166
A PyTorch-based Speech Toolkit
Vosk Api
⭐
6,633
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Wav2letter
⭐
6,326
Facebook AI Research's Automatic Speech Recognition Toolkit
Openvino
⭐
5,316
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Whisper Jax
⭐
3,824
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Automatic_speech_recognition
⭐
2,743
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Ml Road
⭐
2,742
Machine Learning Resources, Practice and Research
Willow
⭐
2,223
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
Tensorflow Speech Recognition
⭐
2,150
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Pytorch Kaldi
⭐
2,138
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stt
⭐
1,988
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Delta
⭐
1,584
DELTA is a deep learning based natural language and speech processing platform.
Lip Reading Deeplearning
⭐
1,433
🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Ios_ml
⭐
1,406
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Openseq2seq
⭐
1,393
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Awesome Diarization
⭐
1,384
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Whisper Timestamped
⭐
1,217
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Speech Emotion Analyzer
⭐
1,155
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Kur
⭐
812
Descriptive Deep Learning
Lhotse
⭐
794
Tools for handling speech data in machine learning projects.
Sincnet
⭐
764
SincNet is a neural architecture for efficiently processing raw audio samples.
Ppasr
⭐
701
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目
Libreasr
⭐
647
💬 An On-Premises, Streaming Speech Recognition System
Speech To Text Benchmark
⭐
570
speech to text benchmark framework
Paddlepaddle Deepspeech
⭐
536
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows, Jetson开发板预测。
Masr
⭐
462
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conforme
Dla
⭐
421
Deep learning for audio processing
Nmtpytorch
⭐
395
Sequence-to-Sequence Framework in PyTorch
Caffe Speech Recognition
⭐
320
Speech Recognition with the Caffe deep learning framework, migrating to
Bmlist
⭐
297
A List of Big Models
Libfaceid
⭐
290
libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Deep Learning Papers For Fish
⭐
248
a list of pappers in deep learning for new-comes.
Kerasdeepspeech
⭐
244
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Speech_dataset
⭐
229
The dataset of Speech Recognition
Ltu
⭐
223
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
Rnn_ctc
⭐
216
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Ai Audio Datasets
⭐
199
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
Willow Inference Server
⭐
190
Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
Hey Jetson
⭐
189
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Voice_activity_detection
⭐
171
Voice Activity Detection based on Deep Learning & TensorFlow
Chinese Automatic Speech Recognition
⭐
157
Chinese speech recognition
Bidirectional_rnn
⭐
152
bidirectional lstm
Rnnt Speech Recognition
⭐
152
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Speech2text
⭐
148
A Deep-Learning-Based Persian Speech Recognition System
Tevr Asr Tool
⭐
132
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
Speech Recognition Neural Network
⭐
128
This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Tensorflow Ctc Speech Recognition
⭐
127
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Summary
⭐
126
summaries of all the papers I read
Mongolian Nlp
⭐
126
Useful resources for Mongolian NLP
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
Deep Learning Papers Reading Roadmap
⭐
122
深度学习论文阅读路线图
Cep
⭐
120
CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.
Automatic Speech Recognition
⭐
116
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Masr
⭐
113
中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
Zeta
⭐
106
Build high-performance AI models with modular building blocks
Las_mandarin_pytorch
⭐
104
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
Pytorch Speech Commands
⭐
98
Speech commands recognition with PyTorch
Speech Representations
⭐
97
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
Whisper Auto Transcribe
⭐
91
Auto transcribe tool based on whisper
Mongolian Speech Recognition
⭐
86
Mongolian speech recognition with PyTorch
Speech Emotion Recognition
⭐
78
Detecting emotions using MFCC features of human speech using Deep Learning
Awesome Openai Whisper
⭐
72
A curated list of awesome OpenAI's Whisper
Wav2letter
⭐
70
Speech Recognition model based off of FAIR research paper built using Pytorch.
Wav2letter.pytorch
⭐
67
A fully convolution-network for speech-to-text, built on pytorch.
Python Deep Learning Projects
⭐
67
Codebase for my book "Python DeepLearning Projects" | Learn applied deep learning for various use-cases on NLP, CV and ASR using TensorFlow and Keras. Book link.
Ai Study
⭐
63
人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP
Papers
⭐
60
A list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in which it is implemented
Tf Speech Recognition Challenge Solution
⭐
53
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recogn The solution ranked in top 5% in private leaderboard.
Torchsubband
⭐
51
Pytorch implementation of subband decomposition
Keras Sincnet
⭐
49
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Asr
⭐
48
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
Triplet_loss_kws
⭐
47
Learning Efficient Representations for Keyword Spotting with Triplet Loss
Lip_reading_in_the_wild_avsr
⭐
46
Audio-Visual Speech Recognition using Deep Learning
A_chronology_of_deep_learning
⭐
45
Tracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Whispers2t
⭐
44
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Enginer
Noisy Student Training Asr
⭐
44
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
Automatic Speech Recognition
⭐
43
Automatic Speech Recognition using Tensorflow
Speech2face
⭐
43
Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL
Pywhisper
⭐
42
openai/whisper + extra features
Avsr Deep Speech
⭐
42
Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Deepspeech
⭐
40
A PyTorch implementation of DeepSpeech and DeepSpeech2.
Jetson Voice
⭐
39
ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT
Banglaspeech2text
⭐
38
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.
Hf Experiments
⭐
37
Experiments with Hugging Face 🔬 🤗
Wavencoder
⭐
36
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
Turkicasr
⭐
35
A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.
Deep Learning And Paper
⭐
33
【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、
Pncc
⭐
32
A implementation of Power Normalized Cepstral Coefficients: PNCC
Aniemore
⭐
28
Emotions recognition from audio and text files (only russian language)
Deepspeech Api
⭐
27
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
Pytorch_mlp_for_asr
⭐
27
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
Ai Engine
⭐
25
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Related Searches
Python Deep Learning (18,303)
Jupyter Notebook Deep Learning (10,328)
Deep Learning Neural Network (5,801)
Deep Learning Pytorch (4,652)
Deep Learning Tensorflow (4,441)
Deep Learning Keras (3,084)
Deep Learning Computer Vision (3,017)
Deep Learning Natural Language Processing (2,283)
Deep Learning Neural (2,063)
Network Deep Learning (1,857)
1-100 of 135 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.