Picture for Fabio Antonacci

Fabio Antonacci

MambaFoley: Foley Sound Generation using Selective State-Space Models

Add code
Sep 13, 2024
Figure 1 for MambaFoley: Foley Sound Generation using Selective State-Space Models
Figure 2 for MambaFoley: Foley Sound Generation using Selective State-Space Models
Figure 3 for MambaFoley: Foley Sound Generation using Selective State-Space Models
Figure 4 for MambaFoley: Foley Sound Generation using Selective State-Space Models
Viaarxiv icon

A Physics-Informed Neural Network-Based Approach for the Spatial Upsampling of Spherical Microphone Arrays

Add code
Jul 26, 2024
Figure 1 for A Physics-Informed Neural Network-Based Approach for the Spatial Upsampling of Spherical Microphone Arrays
Figure 2 for A Physics-Informed Neural Network-Based Approach for the Spatial Upsampling of Spherical Microphone Arrays
Viaarxiv icon

PAGURI: a user experience study of creative interaction with text-to-music models

Add code
Jul 05, 2024
Figure 1 for PAGURI: a user experience study of creative interaction with text-to-music models
Figure 2 for PAGURI: a user experience study of creative interaction with text-to-music models
Figure 3 for PAGURI: a user experience study of creative interaction with text-to-music models
Viaarxiv icon

Interpreting End-to-End Deep Learning Models for Speech Source Localization Using Layer-wise Relevance Propagation

Add code
Apr 04, 2024
Figure 1 for Interpreting End-to-End Deep Learning Models for Speech Source Localization Using Layer-wise Relevance Propagation
Figure 2 for Interpreting End-to-End Deep Learning Models for Speech Source Localization Using Layer-wise Relevance Propagation
Figure 3 for Interpreting End-to-End Deep Learning Models for Speech Source Localization Using Layer-wise Relevance Propagation
Figure 4 for Interpreting End-to-End Deep Learning Models for Speech Source Localization Using Layer-wise Relevance Propagation
Viaarxiv icon

Synthesizing Soundscapes: Leveraging Text-to-Audio Models for Environmental Sound Classification

Add code
Mar 26, 2024
Viaarxiv icon

Physics-Informed Neural Network for Volumetric Sound field Reconstruction of Speech Signals

Add code
Mar 14, 2024
Viaarxiv icon

HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays

Add code
Feb 21, 2024
Figure 1 for HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays
Figure 2 for HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays
Figure 3 for HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays
Figure 4 for HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays
Viaarxiv icon

Room transfer function reconstruction using complex-valued neural networks and irregularly distributed microphones

Add code
Feb 01, 2024
Viaarxiv icon

Reconstruction of Sound Field through Diffusion Models

Add code
Dec 14, 2023
Figure 1 for Reconstruction of Sound Field through Diffusion Models
Figure 2 for Reconstruction of Sound Field through Diffusion Models
Viaarxiv icon

Timbre transfer using image-to-image denoising diffusion implicit models

Add code
Jul 28, 2023
Figure 1 for Timbre transfer using image-to-image denoising diffusion implicit models
Figure 2 for Timbre transfer using image-to-image denoising diffusion implicit models
Figure 3 for Timbre transfer using image-to-image denoising diffusion implicit models
Figure 4 for Timbre transfer using image-to-image denoising diffusion implicit models
Viaarxiv icon