Picture for Alexander Richard

Alexander Richard

University of Bonn

FlowDec: A flow-based full-band general audio codec with high perceptual quality

Add code
Mar 03, 2025
Viaarxiv icon

AV-Flow: Transforming Text to Audio-Visual Human-like Interactions

Add code
Feb 18, 2025
Viaarxiv icon

ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling

Add code
Feb 04, 2025
Figure 1 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 2 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 3 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 4 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Viaarxiv icon

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Add code
Jul 18, 2024
Viaarxiv icon

EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation

Add code
Jun 11, 2024
Figure 1 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 2 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 3 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 4 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Viaarxiv icon

Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark

Add code
Mar 27, 2024
Viaarxiv icon

ScoreDec: A Phase-preserving High-Fidelity Audio Codec with A Generalized Score-based Diffusion Post-filter

Add code
Jan 22, 2024
Viaarxiv icon

From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations

Add code
Jan 03, 2024
Figure 1 for From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Figure 2 for From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Figure 3 for From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Figure 4 for From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Viaarxiv icon

Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio

Add code
Nov 01, 2023
Figure 1 for Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio
Figure 2 for Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio
Figure 3 for Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio
Figure 4 for Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio
Viaarxiv icon

AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec

Add code
May 26, 2023
Figure 1 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 2 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 3 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 4 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Viaarxiv icon