Picture for Julius Richter

Julius Richter

Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning

Add code
Dec 22, 2025
Viaarxiv icon

SAM Audio: Segment Anything in Audio

Add code
Dec 19, 2025
Figure 1 for SAM Audio: Segment Anything in Audio
Figure 2 for SAM Audio: Segment Anything in Audio
Figure 3 for SAM Audio: Segment Anything in Audio
Figure 4 for SAM Audio: Segment Anything in Audio
Viaarxiv icon

ReverbFX: A Dataset of Room Impulse Responses Derived from Reverb Effect Plugins for Singing Voice Dereverberation

Add code
May 26, 2025
Viaarxiv icon

LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models

Add code
May 16, 2025
Viaarxiv icon

Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement

Add code
May 08, 2025
Viaarxiv icon

Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech

Add code
Oct 23, 2024
Figure 1 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 2 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 3 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Figure 4 for Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
Viaarxiv icon

Investigating Training Objectives for Generative Speech Enhancement

Add code
Sep 16, 2024
Figure 1 for Investigating Training Objectives for Generative Speech Enhancement
Figure 2 for Investigating Training Objectives for Generative Speech Enhancement
Figure 3 for Investigating Training Objectives for Generative Speech Enhancement
Viaarxiv icon

EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation

Add code
Jun 11, 2024
Figure 1 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 2 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 3 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 4 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Viaarxiv icon

The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement

Add code
Jun 05, 2024
Figure 1 for The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Figure 2 for The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Figure 3 for The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Figure 4 for The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
Viaarxiv icon

Diffusion Models for Audio Restoration

Add code
Feb 15, 2024
Figure 1 for Diffusion Models for Audio Restoration
Figure 2 for Diffusion Models for Audio Restoration
Figure 3 for Diffusion Models for Audio Restoration
Figure 4 for Diffusion Models for Audio Restoration
Viaarxiv icon