Picture for Ralf Schlüter

Ralf Schlüter

The Conformer Encoder May Reverse the Time Dimension

Add code
Oct 01, 2024
Viaarxiv icon

On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition

Add code
Jul 31, 2024
Viaarxiv icon

On the Effect of Purely Synthetic Training Data for Different Automatic Speech Recognition Architectures

Add code
Jul 25, 2024
Viaarxiv icon

Investigating the Effect of Label Topology and Training Criterion on ASR Performance and Alignment Quality

Add code
Jul 16, 2024
Viaarxiv icon

On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition

Add code
Oct 12, 2023
Viaarxiv icon

Investigating the Effect of Language Models in Sequence Discriminative Training for Neural Transducers

Add code
Oct 11, 2023
Viaarxiv icon

End-to-End Training of a Neural HMM with Label and Transition Probabilities

Add code
Oct 09, 2023
Viaarxiv icon

On the Relation between Internal Language Model and Sequence Discriminative Training for Neural Transducers

Add code
Sep 25, 2023
Viaarxiv icon

Mixture Encoder Supporting Continuous Speech Separation for Meeting Recognition

Add code
Sep 15, 2023
Viaarxiv icon

Chunked Attention-based Encoder-Decoder Model for Streaming Speech Recognition

Add code
Sep 15, 2023
Viaarxiv icon