Picture for Yiyang Nan

Yiyang Nan

SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering

Add code
Nov 07, 2024
Figure 1 for SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering
Figure 2 for SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering
Figure 3 for SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering
Figure 4 for SaSR-Net: Source-Aware Semantic Representation Network for Enhancing Audio-Visual Question Answering
Viaarxiv icon

Continual Audio-Visual Sound Separation

Add code
Nov 05, 2024
Viaarxiv icon

Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation

Add code
Feb 28, 2024
Viaarxiv icon