Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

Sep 19, 2023

Luyao Cheng, Siqi Zheng, Qinglin Zhang, Hui Wang, Yafeng Chen, Qian Chen, Shiliang Zhang

Figure 1 for Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

Figure 2 for Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

Figure 3 for Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

Figure 4 for Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

Share this with someone who'll enjoy it:

Abstract:Speaker diarization has gained considerable attention within speech processing research community. Mainstream speaker diarization rely primarily on speakers' voice characteristics extracted from acoustic signals and often overlook the potential of semantic information. Considering the fact that speech signals can efficiently convey the content of a speech, it is of our interest to fully exploit these semantic cues utilizing language models. In this work we propose a novel approach to effectively leverage semantic information in clustering-based speaker diarization systems. Firstly, we introduce spoken language understanding modules to extract speaker-related semantic information and utilize these information to construct pairwise constraints. Secondly, we present a novel framework to integrate these constraints into the speaker diarization pipeline, enhancing the performance of the entire system. Extensive experiments conducted on the public dataset demonstrate the consistent superiority of our proposed approach over acoustic-only speaker diarization systems.

* Submitted to ICASSP 2024

View paper on

Share this with someone who'll enjoy it:

Title:Improving Speaker Diarization using Semantic Information: Joint Pairwise Constraints Propagation

Paper and Code