Picture for Alexander Polok

Alexander Polok

DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition

Add code
Dec 30, 2024
Viaarxiv icon

BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism

Add code
Dec 23, 2024
Figure 1 for BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism
Figure 2 for BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism
Figure 3 for BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism
Figure 4 for BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism
Viaarxiv icon

Aligning Pre-trained Models for Spoken Language Translation

Add code
Nov 27, 2024
Viaarxiv icon

Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models

Add code
Oct 22, 2024
Figure 1 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 2 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 3 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Figure 4 for Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
Viaarxiv icon

Target Speaker ASR with Whisper

Add code
Sep 14, 2024
Viaarxiv icon