Picture for Jan Černocký

Jan Černocký

Aligning Pre-trained Models for Spoken Language Translation

Add code
Nov 27, 2024
Viaarxiv icon

Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models

Add code
Oct 22, 2024
Viaarxiv icon

State-of-the-art Embeddings with Video-free Segmentation of the Source VoxCeleb Data

Add code
Oct 03, 2024
Viaarxiv icon

Target Speaker ASR with Whisper

Add code
Sep 14, 2024
Viaarxiv icon

Improving Speaker Verification with Self-Pretrained Transformer Models

Add code
May 17, 2023
Viaarxiv icon

Neural Target Speech Extraction: An Overview

Add code
Jan 31, 2023
Viaarxiv icon

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

Add code
Nov 08, 2022
Viaarxiv icon

Parameter-efficient transfer learning of pre-trained Transformer models for speaker verification using adapters

Add code
Oct 28, 2022
Viaarxiv icon

Analysis of impact of emotions on target speech extraction and speech separation

Add code
Aug 15, 2022
Figure 1 for Analysis of impact of emotions on target speech extraction and speech separation
Figure 2 for Analysis of impact of emotions on target speech extraction and speech separation
Figure 3 for Analysis of impact of emotions on target speech extraction and speech separation
Figure 4 for Analysis of impact of emotions on target speech extraction and speech separation
Viaarxiv icon

MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification

Add code
Nov 11, 2021
Figure 1 for MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
Figure 2 for MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
Figure 3 for MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification
Viaarxiv icon