Picture for Peipei Wu

Peipei Wu

Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection

Add code
Dec 14, 2023
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Oct 23, 2023
Viaarxiv icon

CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing

Add code
Oct 11, 2023
Viaarxiv icon

Text-Driven Foley Sound Generation With Latent Diffusion Model

Add code
Jun 23, 2023
Viaarxiv icon