Picture for Timo Gerkmann

Timo Gerkmann

Department of Informatics, University of Hamburg, Hamburg, Germany

Real-Time Streaming Mel Vocoding with Generative Flow Matching

Add code
Sep 18, 2025
Figure 1 for Real-Time Streaming Mel Vocoding with Generative Flow Matching
Figure 2 for Real-Time Streaming Mel Vocoding with Generative Flow Matching
Figure 3 for Real-Time Streaming Mel Vocoding with Generative Flow Matching
Viaarxiv icon

Self-Steering Deep Non-Linear Spatially Selective Filters for Efficient Extraction of Moving Speakers under Weak Guidance

Add code
Jul 03, 2025
Viaarxiv icon

ReverbFX: A Dataset of Room Impulse Responses Derived from Reverb Effect Plugins for Singing Voice Dereverberation

Add code
May 26, 2025
Viaarxiv icon

Steering Deep Non-Linear Spatially Selective Filters for Weakly Guided Extraction of Moving Speakers in Dynamic Scenarios

Add code
May 20, 2025
Viaarxiv icon

Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement

Add code
May 08, 2025
Viaarxiv icon

FlowDec: A flow-based full-band general audio codec with high perceptual quality

Add code
Mar 03, 2025
Viaarxiv icon

Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation

Add code
Oct 25, 2024
Figure 1 for Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation
Figure 2 for Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation
Figure 3 for Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation
Viaarxiv icon

Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech

Add code
Oct 23, 2024
Figure 1 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 2 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 3 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 4 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Viaarxiv icon

HRTF Estimation using a Score-based Prior

Add code
Oct 02, 2024
Figure 1 for HRTF Estimation using a Score-based Prior
Figure 2 for HRTF Estimation using a Score-based Prior
Figure 3 for HRTF Estimation using a Score-based Prior
Figure 4 for HRTF Estimation using a Score-based Prior
Viaarxiv icon

Investigating Training Objectives for Generative Speech Enhancement

Add code
Sep 16, 2024
Figure 1 for Investigating Training Objectives for Generative Speech Enhancement
Figure 2 for Investigating Training Objectives for Generative Speech Enhancement
Figure 3 for Investigating Training Objectives for Generative Speech Enhancement
Viaarxiv icon