Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for speaker diarization
speaker-diarization
x
50 search results found
Nemo
⭐
9,041
NeMo: a toolkit for conversational AI
Espnet
⭐
7,563
End-to-End Speech Processing Toolkit
Speechbrain
⭐
7,166
A PyTorch-based Speech Toolkit
Pyannote Audio
⭐
4,460
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Funasr
⭐
2,315
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
Whisper Diarization
⭐
1,538
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Uis Rnn
⭐
1,491
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Awesome Diarization
⭐
1,384
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Whisper Timestamped
⭐
1,217
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Diart
⭐
635
A python package to build AI-powered real-time audio applications
Spectralcluster
⭐
480
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
Wespeaker
⭐
450
Research and Production Oriented Speaker Recognition Toolkit
3d Speaker
⭐
435
A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.
Speaker Diarization
⭐
292
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Pyannote Whisper
⭐
290
Speaker Id
⭐
270
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Vaf_2
⭐
227
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
Eend
⭐
192
End-to-End Neural Diarization
Chatbot Watson Android
⭐
181
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Pytorch_xvectors
⭐
120
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Uhv Ots Speech
⭐
90
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Awesome Speaker Diarization
⭐
86
Some comprehensive papers about speaker diarization
Dnc
⭐
72
Discriminative Neural Clustering for Speaker Diarisation
Tdnn
⭐
70
Time delay neural network (TDNN) implementation in Pytorch using unfold method
D Tdnn
⭐
70
PyTorch implementation of Densely Connected Time Delay Neural Network
Watbot
⭐
68
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Simple_diarizer
⭐
63
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Alimeeting
⭐
57
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.
Speakerdiarization_rnn_cnn_lstm
⭐
52
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).
Fs Eend
⭐
50
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Simpleder
⭐
44
A lightweight library to compute Diarization Error Rate (DER).
Factorized Tdnn
⭐
38
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Is2023 Powerset Diarization
⭐
28
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Lium
⭐
25
Scripts for LIUM SpkDiarization tools
Minutes
⭐
24
🔭 Speaker diarization via transfer learning
Ge2e Loss
⭐
23
Pytorch implementation of Generalized End-to-End Loss for speaker verification
Rttm Viewer
⭐
22
Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way
Re Verb
⭐
21
speaker diarization system using an LSTM
Deepaudio Speaker
⭐
20
neural network based speaker embedder
Re Verb
⭐
16
speaker diarization system using an LSTM
Nn Similarity Diarization
⭐
16
Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")
Resource_speech
⭐
16
语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download
Minivox
⭐
14
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Speech_signal_processing
⭐
11
Vb_diarization
⭐
10
VB Diarization with Eigenvoice and HMM Priors, refactored
Falcon
⭐
8
On-device speaker diarization powered by deep learning
Speaker Diarization
⭐
6
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Speakerdiarization
⭐
6
Audio based speaker diarization
Smooth Convex Kl Nmf
⭐
5
Repository holding various implementation of specific NMF methods for speaker diarization
Csda
⭐
5
Companion repository for the paper "Continual Self-supervised Domain Adaptation for End-to-end Speaker Diarization"
1-50 of 50 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.