Awesome Large Audio Models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Alternatives To Awesome Large Audio Models
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Speechbrain7,166
5 months ago149apache-2.0Python
A PyTorch-based Speech Toolkit
Awesome Large Audio Models207
8 months ago
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
Runtimespeechrecognizer153
5 months ago2mitC++
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Web Voice Processor1473195 months ago38July 19, 20234apache-2.0TypeScript
A library for real-time voice processing in web browsers
Audiototext132
5 months ago1Jupyter Notebook
Transcribe and translate audio to text using Whisper and DeepL.
Vid2cleantxt53
2 years ago2February 24, 2022apache-2.0Jupyter Notebook
Python API & command-line tool to easily transcribe speech-based video files into clean text
Vocalforge39
10 months ago8July 20, 2023mitPython
Your one-stop solution for voice dataset creation
Audio Speech Tutorial12
6 months agomitJupyter Notebook
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
Audio Pretrained Model11
4 years agomit
A collection of Audio and Speech pre-trained models.
End2endautomaticspeechrecognition7
3 years agoPython
In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.
Alternatives To Awesome Large Audio Models
Select To Compare


Alternative Project Comparisons
Popular Audio Processing Projects
Popular Speech To Text Projects
Popular Media Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Speech To Text
Audio Processing
Music Information Retrieval