Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for voice activity detection
voice-activity-detection
x
55 search results found
Noisetorch
⭐
8,684
Real-time microphone noise suppression on Linux.
Ffsubsync
⭐
6,523
Automagically synchronize subtitles with video.
Pyannote Audio
⭐
4,460
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Silero Vad
⭐
2,339
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Funasr
⭐
2,315
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
Autosub
⭐
1,191
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
Voice_datasets
⭐
846
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Open Speech Corpora
⭐
830
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Python Ai Assistant
⭐
812
Python AI assistant 🧠
Diart
⭐
635
A python package to build AI-powered real-time audio applications
Vad
⭐
632
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Inaspeechsegmenter
⭐
630
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Auditok
⭐
615
An audio/acoustic activity detection and audio segmentation tool
Subaligner
⭐
393
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
Voicebook
⭐
325
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Voice_activity_detection
⭐
171
Voice Activity Detection based on Deep Learning & TensorFlow
Android Vad
⭐
150
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Cobra
⭐
136
On-device voice activity detection (VAD) powered by deep learning
Rvad
⭐
113
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Rvadfast
⭐
95
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Whisper Auto Transcribe
⭐
91
Auto transcribe tool based on whisper
Gpv
⭐
89
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Voice_activity_detector
⭐
81
A statistical model-based Voice Activity Detection
Spokestack Android
⭐
49
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
React Native Spokestack
⭐
46
Spokestack: give your React Native app a voice interface!
Py Nltools
⭐
42
A collection of basic python modules for spoken natural language processing
Datadriven Gpvad
⭐
35
The codebase for Data-driven general-purpose voice activity detection.
Spectra
⭐
32
Spectra extraction tutorials based on torch and torchaudio.
Nala
⭐
30
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
Sepia Web Audio
⭐
27
Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, resampling and much more...
Spokestack Ios
⭐
26
Spokestack: give your iOS app a voice interface!
Huawei Challenge Speaker Identification
⭐
26
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Voice Activity Detection
⭐
26
Voice Activity Detection (VAD) using deep learning. Supervised by Retune DSP.
Voice Activity Detection
⭐
16
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Zff_vad
⭐
16
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
Voice_gender_detection
⭐
14
♂️♀️ Detect a person's gender from a voice file. Achieves 90.7% +/- 1.3% accuracy.
Whisperseg
⭐
12
Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection
Whisper_ros
⭐
12
silero-vad + whisper.cpp for ROS 2 (speech-to-text for ROS 2)
End To End Speech Recognition Models
⭐
11
PyTorch implementation of automatic speech recognition models.
Litevad
⭐
11
Voice activity detection (VAD) library for speech-end detection, based on WebRTC's VAD engine
Vad Sli Asr
⭐
11
A pipeline to isolate and transcribe one language in mixed-language speech
Webrtcvad_wrapper
⭐
10
A simple Python wrapper to simplify working with WebRTC VAD and its rougher analogue based on RMS and ZCR (useful for processing audio recordings before using them with neural networks).
Android Speaker Audioanalysis
⭐
10
This is my Masters thesis project titled "Speaker Detection and Conversation Analysis on Mobile Devices".
Mica Speech Activity Detection
⭐
9
Robust Speech Activity Detection (SAD) in movie audio
Webrtc_vad
⭐
9
Voice activity detection (VAD) library and Go bindings based on WebRTC's VAD engine
Bbc Speech Segmenter
⭐
9
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Audio Katana
⭐
8
A tool to slice your audio files into chunks using the Voice Activity Detection technique
Conv Vad
⭐
7
A packaged convolutional voice activity detector for noisy environments.
Voice_activity_detection_v2
⭐
7
2018 Lenovo AI Lab Summer Intern
Asr 2pass
⭐
6
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/Fun
Chromecast_vad
⭐
6
RNN implementation of a voice activity detector to control Chromecast device volume.
Speaker Diarization
⭐
6
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Voice_activity_detection_v1
⭐
5
2018 Lenovo AI Lab Summer Intern
Simple Voice Activity Detector Using Mfcc Based On Fpga Kintex
⭐
5
Voice Activity Detector based on MFCC features and DNN model
Voxseg
⭐
5
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
1-55 of 55 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.