Picture for Aleksei Romanenko

Aleksei Romanenko

STCON System for the CHiME-8 Challenge

Add code
Oct 17, 2024
Viaarxiv icon

Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition

Add code
Aug 16, 2022
Figure 1 for Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition
Figure 2 for Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition
Figure 3 for Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition
Figure 4 for Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition
Viaarxiv icon

LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring

Add code
Apr 06, 2021
Figure 1 for LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Figure 2 for LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Figure 3 for LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Figure 4 for LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring
Viaarxiv icon

Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario

Add code
May 14, 2020
Figure 1 for Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Figure 2 for Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Figure 3 for Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Figure 4 for Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario
Viaarxiv icon