Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

Dec 22, 2023

Federico Landini, Mireia Diez, Themos Stafylakis, Lukáš Burget

Figure 1 for DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

Figure 2 for DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

Figure 3 for DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

Figure 4 for DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

Share this with someone who'll enjoy it:

Abstract:Until recently, the field of speaker diarization was dominated by cascaded systems. Due to their limitations, mainly regarding overlapped speech and cumbersome pipelines, end-to-end models have gained great popularity lately. One of the most successful models is end-to-end neural diarization with encoder-decoder based attractors (EEND-EDA). In this work, we replace the EDA module with a Perceiver-based one and show its advantages over EEND-EDA; namely obtaining better performance on the largely studied Callhome dataset, finding the quantity of speakers in a conversation more accurately, and running inference on almost half of the time on long recordings. Furthermore, when exhaustively compared with other methods, our model, DiaPer, reaches remarkable performance with a very lightweight design. Besides, we perform comparisons with other works and a cascaded baseline across more than ten public wide-band datasets. Together with this publication, we release the code of DiaPer as well as models trained on public and free data.

View paper on

Share this with someone who'll enjoy it:

Title:DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

Paper and Code