Awesome Open Source

Programming Languages

Search results for speech to text asr

speech-to-text x

99 search results found

Kaldi ⭐ 13,453

kaldi-asr/kaldi is the official location of the Kaldi project.

NeMo: a toolkit for conversational AI

Whisperx ⭐ 7,510

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

Vosk Api ⭐ 6,633

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Silero Models ⭐ 4,088

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Lingvo ⭐ 2,776

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Whisper Diarization ⭐ 1,538

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Whisper Asr Webservice ⭐ 1,317

OpenAI Whisper ASR Webservice API

Whisper Timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目

The official repository of the Eesen project

Open_stt ⭐ 671

Cheetah ⭐ 537

On-device streaming speech-to-text engine powered by deep learning

Paddlepaddle Deepspeech ⭐ 536

基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows， Jetson开发板预测。

Autosub ⭐ 525

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

Whisper Standalone Win ⭐ 488

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conforme

Leopard ⭐ 390

On-device speech-to-text engine powered by deep learning

Huggingsound ⭐ 357

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

Langhelper ⭐ 292

Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts, Listening test, so on.

Tensorflow_end2end_speech_recognition ⭐ 275

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Kerasdeepspeech ⭐ 244

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Vosk Browser ⭐ 238

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

Speech_dataset ⭐ 229

The dataset of Speech Recognition

Edgedict ⭐ 229

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

Wav2vec2 Live ⭐ 218

A live speech recognition using Facebooks wav2vec 2.0 model.

Whisper.unity ⭐ 218

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

Asr Audio Data Links ⭐ 187

A list of publically available audio data that anyone can download for ASR or other speech activities

Chinese Automatic Speech Recognition ⭐ 157

Chinese speech recognition

Speecht ⭐ 156

An opensource speech-to-text software written in tensorflow

Sova Asr ⭐ 149

SOVA ASR (Automatic Speech Recognition)

Speech To Text Russian ⭐ 138

Проект для распознавания речи на русском языке на основе pykaldi.

Spokestack Python ⭐ 124

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

Elevateaijavasdk ⭐ 121

Java SDK for ElevateAI

Elevateaidotnetsdk ⭐ 115

.Net core 6 SDK for ElevateAI

Elevateaipythonsdk ⭐ 111

ElevateAI - Speech-to-text API Python SDK

Obsidian Transcription ⭐ 107

Obsidian plugin to create high-quality transcriptions from markdown linked audio files

Las_mandarin_pytorch ⭐ 104

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

Awesome Russian Speech ⭐ 97

Russian speech technology links

Whisper Auto Transcribe ⭐ 91

Auto transcribe tool based on whisper

Mongolian Speech Recognition ⭐ 86

Mongolian speech recognition with PyTorch

Deepgram Js Sdk ⭐ 81

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

Kaldi Serve ⭐ 79

Server framework for Kaldi ASR Toolkit

Asr Wav2vec Finetune ⭐ 76

⚡ Finetune Wa2vec 2.0 For Speech Recognition

Wav2letter ⭐ 70

Speech Recognition model based off of FAIR research paper built using Pytorch.

Viet Asr ⭐ 65

VietASR - Vietnamese Automatic Speech Recognition

Simple_diarizer ⭐ 63

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Syn Speech ⭐ 62

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Vosk Asterisk ⭐ 52

Speech Recognition in Asterisk with Vosk Server

Docker Whisperx ⭐ 49

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

Asrecognition ⭐ 47

ASRecognition: just an easy-to-use library for Automatic Speech Recognition.

React Native Spokestack ⭐ 46

Spokestack: give your React Native app a voice interface!

Whispers2t ⭐ 44

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Enginer

Cif Pytorch ⭐ 36

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

Voice Recognition Ua ⭐ 32

Training scripts for Speech-To-Text models for Ukrainian language

Greenkey Asrtoolkit ⭐ 31

A collection of useful tools for handling speech recognition data

Linto Stt ⭐ 30

An automatic speech recognition API

A pytorch based end2end speech recognition system.

Spokestack Ios ⭐ 26

Spokestack: give your iOS app a voice interface!

Ai Engine ⭐ 25

Easy to use Multi-Provider ASR/Speech To Text and NLP engine

Speech Recognition Experiments ⭐ 24

Experiments to test different speech recognition systems for SEPIA Framework

Keenasr Android Poc ⭐ 21

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Platform ⭐ 21

Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)

Kaldi Br ⭐ 21

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

Speech Recognition Evaluation ⭐ 21

Evaluate results from ASR/Speech-to-Text quickly

Keras(Tensorflow) implementations of Automatic Speech Recognition

Speech Recognition ⭐ 17

SDKs and docs for Skit's speech to text service

Kaldi Long Audio Alignment ⭐ 17

Long audio alignment using Kaldi

Speechloop ⭐ 16

Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?

Speech To Text Viewer ⭐ 16

AWS Transcribe evaluation pipeline: bulk-process audio files and view the results

Assemblyai Node Sdk ⭐ 15

The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.

A merged version of multiple open-source German speech datasets.

Ovos Stt Plugin Vosk ⭐ 14

vosk STT plugin for mycroft

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Openai Whisper Microservice ⭐ 13

This is an OpenAI Whisper automatic speech recognition microservice

Wav2vec2 Live Japanese Translator ⭐ 13

real time japanese speech recognition translator using wav2vec2

Vosk ASR offline engine API for NodeJs developers. With a simple HTTP ASR server.

Be_nlp_speech_resources ⭐ 8

Links to Belarusian NLP and Speech resources

🐍📦 Rapidly calculate and analyze the Word Error Rate (WER) with this powerful yet lightweight Python package.

Deepspeech Kabyle ⭐ 8

Automatic Speech Recognition (ASR) - Kabyle

Transcribe All The Things™ is a CLI for creating and managing speech-to-text transcripts.

Speechtotext ⭐ 7

Bi-directional streaming speech-to-text service using Cloud ASRs

Speech Datasets For Asr ⭐ 6

Download speech datasets (English and non-English) for Automatic Speech Recognition

Full Lattice Search ⭐ 6

Full Text Search Over Probabilistic Lattices with Elasticsearch!

Speech To Text Wavenet ⭐ 6

Keras_asr ⭐ 6

ASR experiment using Google's Universal Sentence Encoder

Whisper Finetuning Be ⭐ 6

Finetuning Whisper ASR model for Belarusian language

Speech Adapters ⭐ 6

Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding

Google Speech To Text Api Word Error Rate Analysis Tool ⭐ 6

Takes audio and reference transcriptions in bulk and generates WER

Assemblyai Java Sdk ⭐ 6

The AssemblyAI Java SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.

All About Speech ⭐ 5

Coquisttjs ⭐ 5

Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.

Kaldi Arabic ⭐ 5

HHM-based Arabic ASR using Kaldi engine

Speeech Recognition for Indic languages.

Wav2vec2 Fa ⭐ 5

fine-tune Wav2vec2. an ASR model released by Facebook

Related Searches

Python Speech To Text (468)

Python Asr (347)

Speech Recognition Asr (250)

1-99 of 99 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.