Picture for Alexander Polok

Alexander Polok

DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition

Add code
Dec 30, 2024
Viaarxiv icon

BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism

Add code
Dec 23, 2024
Viaarxiv icon

Aligning Pre-trained Models for Spoken Language Translation

Add code
Nov 27, 2024
Viaarxiv icon

Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models

Add code
Oct 22, 2024
Viaarxiv icon

Target Speaker ASR with Whisper

Add code
Sep 14, 2024
Viaarxiv icon