Picture for Wenrui Li

Wenrui Li

SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting

Add code
Aug 25, 2024
Viaarxiv icon

Riemann-based Multi-scale Attention Reasoning Network for Text-3D Retrieval

Add code
Aug 25, 2024
Viaarxiv icon

Sample-agnostic Adversarial Perturbation for Vision-Language Pre-training Models

Add code
Aug 06, 2024
Viaarxiv icon

A Unified Understanding of Adversarial Vulnerability Regarding Unimodal Models and Vision-Language Pre-training Models

Add code
Jul 25, 2024
Viaarxiv icon

Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning

Add code
Jul 11, 2024
Figure 1 for Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning
Figure 2 for Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning
Figure 3 for Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning
Figure 4 for Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning
Viaarxiv icon

SHMamba: Structured Hyperbolic State Space Model for Audio-Visual Question Answering

Add code
Jun 14, 2024
Viaarxiv icon

Weakly-supervised causal discovery based on fuzzy knowledge and complex data complementarity

Add code
May 14, 2024
Viaarxiv icon

Design and Optimization of Cooperative Sensing With Limited Backhaul Capacity

Add code
Apr 04, 2024
Figure 1 for Design and Optimization of Cooperative Sensing With Limited Backhaul Capacity
Figure 2 for Design and Optimization of Cooperative Sensing With Limited Backhaul Capacity
Figure 3 for Design and Optimization of Cooperative Sensing With Limited Backhaul Capacity
Figure 4 for Design and Optimization of Cooperative Sensing With Limited Backhaul Capacity
Viaarxiv icon

SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding

Add code
Apr 01, 2024
Viaarxiv icon

MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations

Add code
Mar 20, 2024
Viaarxiv icon