Picture for Ilja Baumann

Ilja Baumann

Outlier Reduction with Gated Attention for Improved Post-training Quantization in Large Sequence-to-sequence Speech Foundation Models

Add code
Jun 16, 2024
Viaarxiv icon

Large Language Models for Dysfluency Detection in Stuttered Speech

Add code
Jun 16, 2024
Figure 1 for Large Language Models for Dysfluency Detection in Stuttered Speech
Figure 2 for Large Language Models for Dysfluency Detection in Stuttered Speech
Viaarxiv icon

Optimized Speculative Sampling for GPU Hardware Accelerators

Add code
Jun 16, 2024
Figure 1 for Optimized Speculative Sampling for GPU Hardware Accelerators
Figure 2 for Optimized Speculative Sampling for GPU Hardware Accelerators
Figure 3 for Optimized Speculative Sampling for GPU Hardware Accelerators
Figure 4 for Optimized Speculative Sampling for GPU Hardware Accelerators
Viaarxiv icon

A Survey of Music Generation in the Context of Interaction

Add code
Feb 23, 2024
Viaarxiv icon

Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks

Add code
Jun 10, 2023
Viaarxiv icon

A Stutter Seldom Comes Alone -- Cross-Corpus Stuttering Detection as a Multi-label Problem

Add code
May 30, 2023
Viaarxiv icon

Speaker Adaptation for End-To-End Speech Recognition Systems in Noisy Environments

Add code
Nov 16, 2022
Viaarxiv icon

The Importance of Speech Stimuli for Pathologic Speech Classification

Add code
Oct 28, 2022
Figure 1 for The Importance of Speech Stimuli for Pathologic Speech Classification
Figure 2 for The Importance of Speech Stimuli for Pathologic Speech Classification
Figure 3 for The Importance of Speech Stimuli for Pathologic Speech Classification
Figure 4 for The Importance of Speech Stimuli for Pathologic Speech Classification
Viaarxiv icon

Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?

Add code
Oct 27, 2022
Figure 1 for Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Figure 2 for Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Figure 3 for Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Figure 4 for Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Viaarxiv icon

Nonwords Pronunciation Classification in Language Development Tests for Preschool Children

Add code
Jun 17, 2022
Figure 1 for Nonwords Pronunciation Classification in Language Development Tests for Preschool Children
Figure 2 for Nonwords Pronunciation Classification in Language Development Tests for Preschool Children
Figure 3 for Nonwords Pronunciation Classification in Language Development Tests for Preschool Children
Figure 4 for Nonwords Pronunciation Classification in Language Development Tests for Preschool Children
Viaarxiv icon