Picture for Dominik Klement

Dominik Klement

DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition

Add code
Dec 30, 2024
Viaarxiv icon

Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization

Add code
Nov 04, 2024
Figure 1 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 2 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 3 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Figure 4 for Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
Viaarxiv icon

Target Speaker ASR with Whisper

Add code
Sep 14, 2024
Viaarxiv icon

Discriminative Training of VBx Diarization

Add code
Oct 04, 2023
Viaarxiv icon