Picture for Lukáš Burget

Lukáš Burget

Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization

Add code
Nov 04, 2024
Viaarxiv icon

Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models

Add code
Oct 22, 2024
Viaarxiv icon

State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data

Add code
Oct 03, 2024
Viaarxiv icon

Target Speaker ASR with Whisper

Add code
Sep 14, 2024
Viaarxiv icon

BUT Systems and Analyses for the ASVspoof 5 Challenge

Add code
Aug 20, 2024
Viaarxiv icon

Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets

Add code
Mar 12, 2024
Viaarxiv icon

Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information?

Add code
Feb 29, 2024
Viaarxiv icon

DiaPer: End-to-End Neural Diarization with Perceiver-Based Attractors

Add code
Dec 22, 2023
Viaarxiv icon

Discriminative Training of VBx Diarization

Add code
Oct 04, 2023
Viaarxiv icon

Hystoc: Obtaining word confidences for fusion of end-to-end ASR systems

Add code
May 21, 2023
Viaarxiv icon