Picture for Timo Gerkmann

Timo Gerkmann

Department of Informatics, University of Hamburg, Hamburg, Germany

Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation

Add code
Oct 25, 2024
Viaarxiv icon

Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech

Add code
Oct 23, 2024
Figure 1 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 2 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 3 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 4 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Viaarxiv icon

HRTF Estimation using a Score-based Prior

Add code
Oct 02, 2024
Figure 1 for HRTF Estimation using a Score-based Prior
Figure 2 for HRTF Estimation using a Score-based Prior
Figure 3 for HRTF Estimation using a Score-based Prior
Figure 4 for HRTF Estimation using a Score-based Prior
Viaarxiv icon

Investigating Training Objectives for Generative Speech Enhancement

Add code
Sep 16, 2024
Figure 1 for Investigating Training Objectives for Generative Speech Enhancement
Figure 2 for Investigating Training Objectives for Generative Speech Enhancement
Figure 3 for Investigating Training Objectives for Generative Speech Enhancement
Viaarxiv icon

Dynamics of Collective Group Affect: Group-level Annotations and the Multimodal Modeling of Convergence and Divergence

Add code
Sep 13, 2024
Figure 1 for Dynamics of Collective Group Affect: Group-level Annotations and the Multimodal Modeling of Convergence and Divergence
Figure 2 for Dynamics of Collective Group Affect: Group-level Annotations and the Multimodal Modeling of Convergence and Divergence
Figure 3 for Dynamics of Collective Group Affect: Group-level Annotations and the Multimodal Modeling of Convergence and Divergence
Figure 4 for Dynamics of Collective Group Affect: Group-level Annotations and the Multimodal Modeling of Convergence and Divergence
Viaarxiv icon

Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models

Add code
Aug 14, 2024
Viaarxiv icon

Robustness of Speech Separation Models for Similar-pitch Speakers

Add code
Jul 22, 2024
Viaarxiv icon

EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation

Add code
Jun 11, 2024
Figure 1 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 2 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 3 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 4 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Viaarxiv icon

The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement

Add code
Jun 05, 2024
Figure 1 for The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Figure 2 for The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Figure 3 for The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Figure 4 for The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Viaarxiv icon

BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models

Add code
May 07, 2024
Figure 1 for BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
Figure 2 for BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
Viaarxiv icon