Awesome Open Source

Programming Languages

Search results for deep learning speech recognition

deep-learning x

speech-recognition x

135 search results found

Transformers ⭐ 124,049

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Deepspeech ⭐ 24,127

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Deeplearningexamples ⭐ 12,073

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Deep Learning Drizzle ⭐ 10,767

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

NeMo: a toolkit for conversational AI

Faster Whisper ⭐ 8,711

Faster Whisper transcription with CTranslate2

Espnet ⭐ 7,563

End-to-End Speech Processing Toolkit

Asrt_speechrecognition ⭐ 7,253

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

Vosk Api ⭐ 6,633

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Wav2letter ⭐ 6,326

Facebook AI Research's Automatic Speech Recognition Toolkit

Openvino ⭐ 5,316

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

Whisper Jax ⭐ 3,824

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Automatic_speech_recognition ⭐ 2,743

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Ml Road ⭐ 2,742

Machine Learning Resources, Practice and Research

Willow ⭐ 2,223

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

Tensorflow Speech Recognition ⭐ 2,150

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Pytorch Kaldi ⭐ 2,138

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Delta ⭐ 1,584

DELTA is a deep learning based natural language and speech processing platform.

Lip Reading Deeplearning ⭐ 1,433

🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

Ios_ml ⭐ 1,406

List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.

Openseq2seq ⭐ 1,393

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Awesome Diarization ⭐ 1,384

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Whisper Timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Speech Emotion Analyzer ⭐ 1,155

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Descriptive Deep Learning

Tools for handling speech data in machine learning projects.

Sincnet ⭐ 764

SincNet is a neural architecture for efficiently processing raw audio samples.

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目

Libreasr ⭐ 647

💬 An On-Premises, Streaming Speech Recognition System

Speech To Text Benchmark ⭐ 570

speech to text benchmark framework

Paddlepaddle Deepspeech ⭐ 536

基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows， Jetson开发板预测。

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conforme

Deep learning for audio processing

Nmtpytorch ⭐ 395

Sequence-to-Sequence Framework in PyTorch

Caffe Speech Recognition ⭐ 320

Speech Recognition with the Caffe deep learning framework, migrating to

A List of Big Models

Libfaceid ⭐ 290

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Deep Learning Papers For Fish ⭐ 248

a list of pappers in deep learning for new-comes.

Kerasdeepspeech ⭐ 244

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Speech_dataset ⭐ 229

The dataset of Speech Recognition

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Rnn_ctc ⭐ 216

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Ai Audio Datasets ⭐ 199

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

Willow Inference Server ⭐ 190

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

Hey Jetson ⭐ 189

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Voice_activity_detection ⭐ 171

Voice Activity Detection based on Deep Learning & TensorFlow

Chinese Automatic Speech Recognition ⭐ 157

Chinese speech recognition

Bidirectional_rnn ⭐ 152

bidirectional lstm

Rnnt Speech Recognition ⭐ 152

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

Speech2text ⭐ 148

A Deep-Learning-Based Persian Speech Recognition System

Tevr Asr Tool ⭐ 132

State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.

Speech Recognition Neural Network ⭐ 128

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

Tensorflow Ctc Speech Recognition ⭐ 127

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Summary ⭐ 126

summaries of all the papers I read

Mongolian Nlp ⭐ 126

Useful resources for Mongolian NLP

Spokestack Python ⭐ 124

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Deep Learning Papers Reading Roadmap ⭐ 122

深度学习论文阅读路线图

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

Automatic Speech Recognition ⭐ 116

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

Build high-performance AI models with modular building blocks

Las_mandarin_pytorch ⭐ 104

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

Pytorch Speech Commands ⭐ 98

Speech commands recognition with PyTorch

Speech Representations ⭐ 97

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

Whisper Auto Transcribe ⭐ 91

Auto transcribe tool based on whisper

Mongolian Speech Recognition ⭐ 86

Mongolian speech recognition with PyTorch

Speech Emotion Recognition ⭐ 78

Detecting emotions using MFCC features of human speech using Deep Learning

Awesome Openai Whisper ⭐ 72

A curated list of awesome OpenAI's Whisper

Wav2letter ⭐ 70

Speech Recognition model based off of FAIR research paper built using Pytorch.

Wav2letter.pytorch ⭐ 67

A fully convolution-network for speech-to-text, built on pytorch.

Python Deep Learning Projects ⭐ 67

Codebase for my book "Python DeepLearning Projects" | Learn applied deep learning for various use-cases on NLP, CV and ASR using TensorFlow and Keras. Book link.

Ai Study ⭐ 63

人工智能学习资料超全整理，包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP

A list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in which it is implemented

Tf Speech Recognition Challenge Solution ⭐ 53

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recogn The solution ranked in top 5% in private leaderboard.

Torchsubband ⭐ 51

Pytorch implementation of subband decomposition

Keras Sincnet ⭐ 49

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.

Triplet_loss_kws ⭐ 47

Learning Efficient Representations for Keyword Spotting with Triplet Loss

Lip_reading_in_the_wild_avsr ⭐ 46

Audio-Visual Speech Recognition using Deep Learning

A_chronology_of_deep_learning ⭐ 45

Tracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.

Whispers2t ⭐ 44

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Enginer

Noisy Student Training Asr ⭐ 44

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

Automatic Speech Recognition ⭐ 43

Automatic Speech Recognition using Tensorflow

Speech2face ⭐ 43

Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL

Pywhisper ⭐ 42

openai/whisper + extra features

Avsr Deep Speech ⭐ 42

Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab

Deepspeech ⭐ 40

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Jetson Voice ⭐ 39

ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT

Banglaspeech2text ⭐ 38

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

Hf Experiments ⭐ 37

Experiments with Hugging Face 🔬 🤗

Wavencoder ⭐ 36

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

Turkicasr ⭐ 35

A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

Deep Learning And Paper ⭐ 33

【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、

A implementation of Power Normalized Cepstral Coefficients: PNCC

Aniemore ⭐ 28

Emotions recognition from audio and text files (only russian language)

Deepspeech Api ⭐ 27

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Pytorch_mlp_for_asr ⭐ 27

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

Ai Engine ⭐ 25

Easy to use Multi-Provider ASR/Speech To Text and NLP engine

Related Searches

Python Deep Learning (18,303)

Jupyter Notebook Deep Learning (10,328)

Deep Learning Neural Network (5,801)

Deep Learning Pytorch (4,652)

Deep Learning Tensorflow (4,441)

Deep Learning Keras (3,084)

Deep Learning Computer Vision (3,017)

Deep Learning Natural Language Processing (2,283)

Deep Learning Neural (2,063)

Network Deep Learning (1,857)

1-100 of 135 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.