Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for speech to text asr
asr
x
speech-to-text
x
99 search results found
Kaldi
⭐
13,453
kaldi-asr/kaldi is the official location of the Kaldi project.
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Whisperx
⭐
7,510
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Speechbrain
⭐
7,166
A PyTorch-based Speech Toolkit
Vosk Api
⭐
6,633
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Silero Models
⭐
4,088
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Lingvo
⭐
2,776
Lingvo
Stt
⭐
1,988
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Whisper Diarization
⭐
1,538
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Whisper Asr Webservice
⭐
1,317
OpenAI Whisper ASR Webservice API
Whisper Timestamped
⭐
1,217
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Ppasr
⭐
701
基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目
Eesen
⭐
673
The official repository of the Eesen project
Open_stt
⭐
671
Open STT
Cheetah
⭐
537
On-device streaming speech-to-text engine powered by deep learning
Paddlepaddle Deepspeech
⭐
536
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows, Jetson开发板预测。
Autosub
⭐
525
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
Whisper Standalone Win
⭐
488
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
Masr
⭐
462
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conforme
Leopard
⭐
390
On-device speech-to-text engine powered by deep learning
Huggingsound
⭐
357
HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools
Langhelper
⭐
292
Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts, Listening test, so on.
Tensorflow_end2end_speech_recognition
⭐
275
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Kerasdeepspeech
⭐
244
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Vosk Browser
⭐
238
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
Speech_dataset
⭐
229
The dataset of Speech Recognition
Edgedict
⭐
229
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Dsnote
⭐
225
Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.
Wav2vec2 Live
⭐
218
A live speech recognition using Facebooks wav2vec 2.0 model.
Whisper.unity
⭐
218
Running speech to text model (whisper.cpp) in Unity3d on your local machine.
Asr Audio Data Links
⭐
187
A list of publically available audio data that anyone can download for ASR or other speech activities
Chinese Automatic Speech Recognition
⭐
157
Chinese speech recognition
Speecht
⭐
156
An opensource speech-to-text software written in tensorflow
Sova Asr
⭐
149
SOVA ASR (Automatic Speech Recognition)
Speech To Text Russian
⭐
138
Проект для распознавания речи на русском языке на основе pykaldi.
Spokestack Python
⭐
124
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.
At16k
⭐
123
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Elevateaijavasdk
⭐
121
Java SDK for ElevateAI
Elevateaidotnetsdk
⭐
115
.Net core 6 SDK for ElevateAI
Elevateaipythonsdk
⭐
111
ElevateAI - Speech-to-text API Python SDK
Obsidian Transcription
⭐
107
Obsidian plugin to create high-quality transcriptions from markdown linked audio files
Las_mandarin_pytorch
⭐
104
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
Awesome Russian Speech
⭐
97
Russian speech technology links
Whisper Auto Transcribe
⭐
91
Auto transcribe tool based on whisper
Mongolian Speech Recognition
⭐
86
Mongolian speech recognition with PyTorch
Deepgram Js Sdk
⭐
81
Official JavaScript SDK for Deepgram's automated speech recognition APIs.
Kaldi Serve
⭐
79
Server framework for Kaldi ASR Toolkit
Asr Wav2vec Finetune
⭐
76
⚡ Finetune Wa2vec 2.0 For Speech Recognition
Wav2letter
⭐
70
Speech Recognition model based off of FAIR research paper built using Pytorch.
Viet Asr
⭐
65
VietASR - Vietnamese Automatic Speech Recognition
Simple_diarizer
⭐
63
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Syn Speech
⭐
62
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Vosk Asterisk
⭐
52
Speech Recognition in Asterisk with Vosk Server
Docker Whisperx
⭐
49
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
Asrecognition
⭐
47
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
React Native Spokestack
⭐
46
Spokestack: give your React Native app a voice interface!
Whispers2t
⭐
44
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Enginer
Cif Pytorch
⭐
36
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
Voice Recognition Ua
⭐
32
Training scripts for Speech-To-Text models for Ukrainian language
Greenkey Asrtoolkit
⭐
31
A collection of useful tools for handling speech recognition data
Linto Stt
⭐
30
An automatic speech recognition API
Openasr
⭐
29
A pytorch based end2end speech recognition system.
Spokestack Ios
⭐
26
Spokestack: give your iOS app a voice interface!
Ai Engine
⭐
25
Easy to use Multi-Provider ASR/Speech To Text and NLP engine
Speech Recognition Experiments
⭐
24
Experiments to test different speech recognition systems for SEPIA Framework
Keenasr Android Poc
⭐
21
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Platform
⭐
21
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
Kaldi Br
⭐
21
☕🇧🇷 Scripts para o Kaldi em Português Brasileiro
Speech Recognition Evaluation
⭐
21
Evaluate results from ASR/Speech-to-Text quickly
Deepasr
⭐
18
Keras(Tensorflow) implementations of Automatic Speech Recognition
Speech Recognition
⭐
17
SDKs and docs for Skit's speech to text service
Kaldi Long Audio Alignment
⭐
17
Long audio alignment using Kaldi
Speechloop
⭐
16
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
Speech To Text Viewer
⭐
16
AWS Transcribe evaluation pipeline: bulk-process audio files and view the results
Assemblyai Node Sdk
⭐
15
The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.
Megs
⭐
14
A merged version of multiple open-source German speech datasets.
Ovos Stt Plugin Vosk
⭐
14
vosk STT plugin for mycroft
Pcpm
⭐
14
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Openai Whisper Microservice
⭐
13
This is an OpenAI Whisper automatic speech recognition microservice
Wav2vec2 Live Japanese Translator
⭐
13
real time japanese speech recognition translator using wav2vec2
Voskjs
⭐
12
Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.
Be_nlp_speech_resources
⭐
8
Links to Belarusian NLP and Speech resources
Werpy
⭐
8
🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.
Deepspeech Kabyle
⭐
8
Automatic Speech Recognition (ASR) - Kabyle
Tatt
⭐
8
Transcribe All The Things™ is a CLI for creating and managing speech-to-text transcripts.
Speechtotext
⭐
7
Bi-directional streaming speech-to-text service using Cloud ASRs
Speech Datasets For Asr
⭐
6
Download speech datasets (English and non-English) for Automatic Speech Recognition
Full Lattice Search
⭐
6
Full Text Search Over Probabilistic Lattices with Elasticsearch!
Speech To Text Wavenet
⭐
6
Speech to Text
Keras_asr
⭐
6
ASR experiment using Google's Universal Sentence Encoder
Whisper Finetuning Be
⭐
6
Finetuning Whisper ASR model for Belarusian language
Speech Adapters
⭐
6
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding
Google Speech To Text Api Word Error Rate Analysis Tool
⭐
6
Takes audio and reference transcriptions in bulk and generates WER
Assemblyai Java Sdk
⭐
6
The AssemblyAI Java SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.
All About Speech
⭐
5
Coquisttjs
⭐
5
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
Kaldi Arabic
⭐
5
HHM-based Arabic ASR using Kaldi engine
Indicasr
⭐
5
Speeech Recognition for Indic languages.
Wav2vec2 Fa
⭐
5
fine-tune Wav2vec2. an ASR model released by Facebook
Related Searches
Python Speech To Text (468)
Python Asr (347)
Speech Recognition Asr (250)
1-99 of 99 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.