Picture for Zhibin Wen

Zhibin Wen

Learning to Unify Audio, Visual and Text for Audio-Enhanced Multilingual Visual Answer Localization

Add code
Nov 05, 2024
Viaarxiv icon