Picture for Hengyi Hong

Hengyi Hong

MVANet: Multi-Stage Video Attention Network for Sound Event Localization and Detection with Source Distance Estimation

Add code
Nov 21, 2024
Viaarxiv icon