Picture for Davide Berghi

Davide Berghi

Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation

Add code
Oct 29, 2024
Figure 1 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 2 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 3 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Figure 4 for Leveraging Reverberation and Visual Depth Cues for Sound Event Localization and Detection with Distance Estimation
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Figure 1 for Text-Queried Target Sound Event Localization
Figure 2 for Text-Queried Target Sound Event Localization
Figure 3 for Text-Queried Target Sound Event Localization
Figure 4 for Text-Queried Target Sound Event Localization
Viaarxiv icon

Audio-Visual Talker Localization in Video for Spatial Sound Reproduction

Add code
Jun 01, 2024
Figure 1 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 2 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 3 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Figure 4 for Audio-Visual Talker Localization in Video for Spatial Sound Reproduction
Viaarxiv icon

Leveraging Visual Supervision for Array-based Active Speaker Detection and Localization

Add code
Dec 21, 2023
Viaarxiv icon

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Dec 14, 2023
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Oct 23, 2023
Viaarxiv icon

Audio Inputs for Active Speaker Detection and Localization via Microphone Array

Add code
Jul 27, 2023
Figure 1 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 2 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 3 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Figure 4 for Audio Inputs for Active Speaker Detection and Localization via Microphone Array
Viaarxiv icon

Tragic Talkers: A Shakespearean Sound- and Light-Field Dataset for Audio-Visual Machine Learning Research

Add code
Dec 04, 2022
Viaarxiv icon

Visually Supervised Speaker Detection and Localization via Microphone Array

Add code
Mar 07, 2022
Figure 1 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 2 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 3 for Visually Supervised Speaker Detection and Localization via Microphone Array
Figure 4 for Visually Supervised Speaker Detection and Localization via Microphone Array
Viaarxiv icon

Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction

Add code
May 03, 2021
Figure 1 for Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction
Figure 2 for Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction
Viaarxiv icon