Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for python speech recognition
python
x
speech-recognition
x
413 search results found
Transformers
⭐
129,496
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Leon
⭐
13,937
🧠 Leon is your open-source personal assistant.
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Speech_recognition
⭐
7,801
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Espnet
⭐
7,563
End-to-End Speech Processing Toolkit
Whisperx
⭐
7,510
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Asrt_speechrecognition
⭐
7,253
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Speechbrain
⭐
7,166
A PyTorch-based Speech Toolkit
Vosk Api
⭐
6,633
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Pocketsphinx
⭐
3,620
A small speech recognizer
Lingvo
⭐
2,776
Lingvo
Distil Whisper
⭐
2,760
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Automatic_speech_recognition
⭐
2,743
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Ml Road
⭐
2,742
Machine Learning Resources, Practice and Research
Funasr
⭐
2,315
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
Tensorflow Speech Recognition
⭐
2,150
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Pytorch Kaldi
⭐
2,138
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Lip Reading Deeplearning
⭐
1,433
🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Openseq2seq
⭐
1,393
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Dragonfire
⭐
1,294
the open-source virtual assistant for Ubuntu based Linux distributions
Speech Emotion Analyzer
⭐
1,279
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Whisper Timestamped
⭐
1,217
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Pykaldi
⭐
954
A Python wrapper for Kaldi
Espresso
⭐
930
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Quillman
⭐
880
A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.
Kur
⭐
812
Descriptive Deep Learning
Conformer
⭐
809
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Lhotse
⭐
794
Tools for handling speech data in machine learning projects.
Stephanie Va
⭐
769
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Sincnet
⭐
764
SincNet is a neural architecture for efficiently processing raw audio samples.
Deepspeech Examples
⭐
739
Examples of how to use or integrate DeepSpeech
Salmonn
⭐
710
SALMONN: Speech Audio Language Music Open Neural Network
Ppasr
⭐
701
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目
Stt
⭐
694
Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式
Sherpa Ncnn
⭐
673
Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, etc.
Speech
⭐
673
A PyTorch Implementation of End-to-End Models for Speech-to-Text
Libreasr
⭐
647
💬 An On-Premises, Streaming Speech Recognition System
Irene Voice Assistant
⭐
644
Ирина - русский голосовой ассистент для работы оффлайн. Поддерживает скиллы через плагины.
Cn2an
⭐
642
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Speecht5
⭐
638
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Whisper Playground
⭐
637
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
Whisper Ctranslate2
⭐
628
Whisper command line client compatible with original OpenAI client based on CTranslate2.
Speech To Text Benchmark
⭐
570
speech to text benchmark framework
Whisper_mic
⭐
560
Project that allows one to use a microphone with OpenAI whisper.
Treasure Of Transformers
⭐
541
💁 Awesome Treasure of Transformers Models for Natural Language processing contains papers, videos, blogs, official repo along with colab Notebooks. 🛫☑️
Paddlepaddle Deepspeech
⭐
536
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows, Jetson开发板预测。
Free Spoken Digit Dataset
⭐
518
A free audio dataset of spoken digits. Think MNIST for audio.
Storytoolkitai
⭐
504
An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models
Masr
⭐
462
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conforme
Ai Waifu Vtuber
⭐
457
AI Vtuber for Streaming on Youtube/Twitch
Specaugment
⭐
411
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Pocketsphinx Python
⭐
367
Python interface to CMU Sphinxbase and Pocketsphinx libraries
Huggingsound
⭐
357
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
Unispeech
⭐
328
UniSpeech - Large Scale Self-Supervised Learning for Speech
Parrots
⭐
318
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese. 中文语音识别、文字转语音,基于语音库实现,易扩展。
Edenai Apis
⭐
313
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
Opentransformer
⭐
310
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
Kaldi Active Grammar
⭐
305
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Bmlist
⭐
297
A List of Big Models
Libfaceid
⭐
290
libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Vosk
⭐
287
VOSK Speech Recognition Toolkit
Deepspeech German
⭐
284
Automatic Speech Recognition (ASR) - German
Tensorflow_end2end_speech_recognition
⭐
275
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Speech Recognition Uk
⭐
262
Speech Recognition for Ukrainian
Livewhisper
⭐
261
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Attention Lvcsr
⭐
259
End-to-End Attention-Based Large Vocabulary Speech Recognition
Jarvis Chatgpt
⭐
242
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
End2end Asr Pytorch
⭐
239
End-to-End Automatic Speech Recognition on PyTorch
Gpt Voice Conversation Chatbot
⭐
232
Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.
Edgedict
⭐
229
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stanford Ctc
⭐
226
Neural net code for lexicon-free speech recognition with connectionist temporal classification
Ltu
⭐
223
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
Voicestreamai
⭐
222
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
Wav2vec2 Live
⭐
218
A live speech recognition using Facebooks wav2vec 2.0 model.
Rnn_ctc
⭐
216
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Whisper At
⭐
212
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
Clovacall
⭐
171
ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Voice_activity_detection
⭐
171
Voice Activity Detection based on Deep Learning & TensorFlow
Ollama Voice Mac
⭐
165
Mac compatible Ollama Voice
Py Kaldi Asr
⭐
154
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Bidirectional_rnn
⭐
152
bidirectional lstm
Rnnt Speech Recognition
⭐
152
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Sova Asr
⭐
149
SOVA ASR (Automatic Speech Recognition)
Speech2text
⭐
148
A Deep-Learning-Based Persian Speech Recognition System
End To End Lipreading
⭐
147
Pytorch code for End-to-End Audiovisual Speech Recognition
Synthalingua
⭐
144
Synthalingua - Real Time Translation
Deep_avsr
⭐
138
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Speech To Text Russian
⭐
138
Проект для распознавания речи на русском языке на основе pykaldi.
Cobra
⭐
136
On-device voice activity detection (VAD) powered by deep learning
M.i.t.s.u.h.a.
⭐
134
World's First Multilingual Inexpensive Therapeutic Sophisticated Ultra-responsive Holographic Agent. In simple terms, an AI you can talk to and it'll talk back with a body using VTube Studio.
Tensorflow Wavenet
⭐
132
speech recognition based on tensorflow 1.0.0
Tensorflow Ctc Speech Recognition
⭐
127
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
How2 Dataset
⭐
125
This repository contains code and metadata of How2 dataset
Keras Kaldi
⭐
124
Keras Interface for Kaldi ASR
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
Python Speech Recognition
⭐
124
Speech Recognition with Python examples
At16k
⭐
123
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Audiomate
⭐
123
Python library for handling audio datasets.
Scribe
⭐
123
Simple speech recognition using your microphone.
Aps
⭐
122
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
Related Searches
Python Django (28,897)
Python Machine Learning (20,195)
Python Flask (17,643)
Python Pytorch (15,131)
Python Dataset (14,792)
Python Docker (14,113)
Python Tensorflow (14,061)
Python Command Line (13,351)
Python Deep Learning (13,092)
Python Jupyter Notebook (12,976)
1-100 of 413 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.