Speaker Diarization


Speaker diarization is the process of segmenting and clustering speech signals to identify different speakers in an audio recording.

Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation

Add code
Aug 01, 2024
Viaarxiv icon

NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks

Add code
Aug 23, 2024
Figure 1 for NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
Figure 2 for NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
Figure 3 for NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
Figure 4 for NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
Viaarxiv icon

Systematic Evaluation of Online Speaker Diarization Systems Regarding their Latency

Add code
Jul 05, 2024
Viaarxiv icon

Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders

Add code
Jul 02, 2024
Figure 1 for Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders
Figure 2 for Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders
Figure 3 for Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders
Figure 4 for Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders
Viaarxiv icon

Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning

Add code
Jul 21, 2024
Viaarxiv icon

Leveraging Speaker Embeddings in End-to-End Neural Diarization for Two-Speaker Scenarios

Add code
Jul 01, 2024
Viaarxiv icon

Investigating Confidence Estimation Measures for Speaker Diarization

Add code
Jun 24, 2024
Viaarxiv icon

System Description for the Displace Speaker Diarization Challenge 2023

Add code
Jun 20, 2024
Viaarxiv icon

Exploring Speech Foundation Models for Speaker Diarization in Child-Adult Dyadic Interactions

Add code
Jun 12, 2024
Viaarxiv icon

psifx -- Psychological and Social Interactions Feature Extraction Package

Add code
Jul 16, 2024
Figure 1 for psifx -- Psychological and Social Interactions Feature Extraction Package
Figure 2 for psifx -- Psychological and Social Interactions Feature Extraction Package
Figure 3 for psifx -- Psychological and Social Interactions Feature Extraction Package
Figure 4 for psifx -- Psychological and Social Interactions Feature Extraction Package
Viaarxiv icon