Picture for Yuxin Mao

Yuxin Mao

Towards Open-Vocabulary Audio-Visual Event Localization

Add code
Nov 18, 2024
Viaarxiv icon

Label-anticipated Event Disentanglement for Audio-Visual Video Parsing

Add code
Jul 11, 2024
Viaarxiv icon

You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet

Add code
May 31, 2024
Figure 1 for You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet
Figure 2 for You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet
Figure 3 for You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet
Figure 4 for You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet
Viaarxiv icon

TAVGBench: Benchmarking Text to Audible-Video Generation

Add code
Apr 22, 2024
Viaarxiv icon

Multimodal Variational Auto-encoder based Audio-Visual Segmentation

Add code
Oct 12, 2023
Viaarxiv icon

RPEFlow: Multimodal Fusion of RGB-PointCloud-Event for Joint Optical Flow and Scene Flow Estimation

Add code
Sep 26, 2023
Viaarxiv icon

Decomposed Guided Dynamic Filters for Efficient RGB-Guided Depth Completion

Add code
Sep 05, 2023
Viaarxiv icon

Improving Audio-Visual Segmentation with Bidirectional Generation

Add code
Aug 16, 2023
Viaarxiv icon

Contrastive Conditional Latent Diffusion for Audio-visual Segmentation

Add code
Jul 31, 2023
Viaarxiv icon

Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

Add code
Jun 06, 2023
Viaarxiv icon