Awesome Open Source

Programming Languages

Search results for speaker diarization

speaker-diarization x

50 search results found

NeMo: a toolkit for conversational AI

Espnet ⭐ 7,563

End-to-End Speech Processing Toolkit

Speechbrain ⭐ 7,166

A PyTorch-based Speech Toolkit

Pyannote Audio ⭐ 4,460

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Funasr ⭐ 2,315

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

Whisper Diarization ⭐ 1,538

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Uis Rnn ⭐ 1,491

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Awesome Diarization ⭐ 1,384

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Whisper Timestamped ⭐ 1,217

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

A python package to build AI-powered real-time audio applications

Spectralcluster ⭐ 480

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Wespeaker ⭐ 450

Research and Production Oriented Speaker Recognition Toolkit

3d Speaker ⭐ 435

A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.

Speaker Diarization ⭐ 292

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Pyannote Whisper ⭐ 290

Speaker Id ⭐ 270

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

End-to-End Neural Diarization

Chatbot Watson Android ⭐ 181

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

Pytorch_xvectors ⭐ 120

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Uhv Ots Speech ⭐ 90

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Awesome Speaker Diarization ⭐ 86

Some comprehensive papers about speaker diarization

Discriminative Neural Clustering for Speaker Diarisation

Time delay neural network (TDNN) implementation in Pytorch using unfold method

PyTorch implementation of Densely Connected Time Delay Neural Network

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

Simple_diarizer ⭐ 63

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Alimeeting ⭐ 57

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

Speakerdiarization_rnn_cnn_lstm ⭐ 52

Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Simpleder ⭐ 44

A lightweight library to compute Diarization Error Rate (DER).

Factorized Tdnn ⭐ 38

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Is2023 Powerset Diarization ⭐ 28

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Scripts for LIUM SpkDiarization tools

🔭 Speaker diarization via transfer learning

Ge2e Loss ⭐ 23

Pytorch implementation of Generalized End-to-End Loss for speaker verification

Rttm Viewer ⭐ 22

Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way

speaker diarization system using an LSTM

Deepaudio Speaker ⭐ 20

neural network based speaker embedder

speaker diarization system using an LSTM

Nn Similarity Diarization ⭐ 16

Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization")

Resource_speech ⭐ 16

语音算法相关资源汇总 Resource for Speech Processing || NEWS: official link of VoxCeleb fails recently and an external link is added for download

Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

Speech_signal_processing ⭐ 11

Vb_diarization ⭐ 10

VB Diarization with Eigenvoice and HMM Priors, refactored

On-device speaker diarization powered by deep learning

Speaker Diarization ⭐ 6

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

Speakerdiarization ⭐ 6

Audio based speaker diarization

Smooth Convex Kl Nmf ⭐ 5

Repository holding various implementation of specific NMF methods for speaker diarization

Companion repository for the paper "Continual Self-supervised Domain Adaptation for End-to-end Speaker Diarization"

1-50 of 50 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.