Awesome Large Audio Models Alternatives

Name: EmulationAI/awesome-large-audio-models
Brand: EmulationAI/awesome-large-audio-models
SKU: project/EmulationAI/awesome-large-audio-models
Rating: 4.5 (207 reviews)

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Categories > Media > Large Language Models

Suggest Alternative

Stars

207

Alternatives

License

No license specified

Open Issues

Most Recent Commit

over 2 years ago

Dependent Repos

Dependent Packages

Total Releases

Categories

Machine Learning > Speech To Text

Media > Audio Processing

Machine Learning > Music Information Retrieval

Site

Repo

Alternatives To EmulationAI/awesome-large-audio-models

Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
speechbrain/speechbrain	7,166	0	0	over 2 years ago	0		149	apache-2.0	Python
A PyTorch-based Speech Toolkit
EmulationAI/awesome-large-audio-models	207	0	0	over 2 years ago	0		0
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
gtreshchev/RuntimeSpeechRecognizer	153	0	0	over 2 years ago	0		2	mit	C++
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Picovoice/web-voice-processor	147	4	26	over 2 years ago	39	March 15, 2024	4	apache-2.0	TypeScript
A library for real-time voice processing in web browsers
Carleslc/AudioToText	132	0	0	over 2 years ago	0		1		Jupyter Notebook
Transcribe and translate audio to text using Whisper and DeepL.
pszemraj/vid2cleantxt	53	0	0	almost 4 years ago	2	February 24, 2022	0	apache-2.0	Jupyter Notebook
Python API & command-line tool to easily transcribe speech-based video files into clean text
rioharper/VocalForge	39	0	0	almost 3 years ago	8	July 20, 2023	0	mit	Python
Your one-stop solution for voice dataset creation
koudounasalkis/Audio-Speech-Tutorial	12	0	0	over 2 years ago	0		0	mit	Jupyter Notebook
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
balavenkatesh3322/audio-pretrained-model	11	0	0	almost 6 years ago	0		0	mit
A collection of Audio and Speech pre-trained models.
victor369basu/End2EndAutomaticSpeechRecognition	7	0	0	about 5 years ago	0		0		Python
In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.

Alternatives To EmulationAI/awesome-large-audio-models

Select To Compare

speechbrain/speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

dependent packages 0 total releases 0 most recent commit over 2 years ago

EmulationAI/awesome-large-audio-models ⭐ 207

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

dependent packages 0 total releases 0 most recent commit over 2 years ago

gtreshchev/RuntimeSpeechRecognizer ⭐ 153

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

dependent packages 0 total releases 0 most recent commit over 2 years ago

Picovoice/web-voice-processor ⭐ 147

A library for real-time voice processing in web browsers

dependent packages 26 total releases 39 most recent commit over 2 years ago downloads badge

Carleslc/AudioToText ⭐ 132

Transcribe and translate audio to text using Whisper and DeepL.

dependent packages 0 total releases 0 most recent commit over 2 years ago

pszemraj/vid2cleantxt ⭐ 53

Python API & command-line tool to easily transcribe speech-based video files into clean text

dependent packages 0 total releases 2 most recent commit almost 4 years ago downloads badge

rioharper/VocalForge ⭐ 39

Your one-stop solution for voice dataset creation

dependent packages 0 total releases 8 most recent commit almost 3 years ago downloads badge

koudounasalkis/Audio-Speech-Tutorial ⭐ 12

This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.

dependent packages 0 total releases 0 most recent commit over 2 years ago

balavenkatesh3322/audio-pretrained-model ⭐ 11

A collection of Audio and Speech pre-trained models.

dependent packages 0 total releases 0 most recent commit almost 6 years ago

victor369basu/End2EndAutomaticSpeechRecognition ⭐ 7

In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.

dependent packages 0 total releases 0 most recent commit about 5 years ago

Suggest An Alternative To awesome-large-audio-models

Alternative Project Comparisons

EmulationAI/awesome-large-audio-models vs Speechbrain

EmulationAI/awesome-large-audio-models vs Awesome Large Audio Models

EmulationAI/awesome-large-audio-models vs Runtimespeechrecognizer

EmulationAI/awesome-large-audio-models vs Web Voice Processor

EmulationAI/awesome-large-audio-models vs Audiototext

EmulationAI/awesome-large-audio-models vs Vid2cleantxt

EmulationAI/awesome-large-audio-models vs Vocalforge

EmulationAI/awesome-large-audio-models vs Audio Speech Tutorial

EmulationAI/awesome-large-audio-models vs Audio Pretrained Model

EmulationAI/awesome-large-audio-models vs End2endautomaticspeechrecognition

Popular Audio Processing Projects

google/mediapipe⭐ 24,511

Cross-platform, customizable ML solutions for live and streaming media.

deezer/spleeter⭐ 24,258

Deezer source separation library including pretrained models.

tenacityteam/tenacity-legacy⭐ 7,226

**Old repository**. Tenacity is an easy-to-use, privacy-friendly, FLOSS, cross-platform multi-track audio editor/recorder for Windows, macOS, Linux and other operating systems.

bitgapp/eqMac⭐ 5,243

macOS System-wide Audio Equalizer & Volume Mixer 🎧

NVIDIA/DALI⭐ 4,770

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Popular Speech To Text Projects

ggerganov/whisper.cpp⭐ 27,404

Port of OpenAI's Whisper model in C/C++

SYSTRAN/faster-whisper⭐ 24,200

Faster Whisper transcription with CTranslate2

mozilla/DeepSpeech⭐ 23,687

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

leon-ai/leon⭐ 13,937

🧠 Leon is your open-source personal assistant.

kaldi-asr/kaldi⭐ 13,453

kaldi-asr/kaldi is the official location of the Kaldi project.

Popular Media Categories

Screenshot

Ffmpeg

Volume

Image Processing

Spotify

Radio

Playlist

Decoder

Qrcode

Encoder