Picture for Xuesong Zhang

Xuesong Zhang

Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation

Add code
Dec 10, 2024
Viaarxiv icon

Seeing is Believing? Enhancing Vision-Language Navigation using Visual Perturbations

Add code
Sep 09, 2024
Viaarxiv icon

Adversarial Score Distillation: When score distillation meets GAN

Add code
Dec 01, 2023
Viaarxiv icon

Multimodal Feature Extraction and Fusion for Emotional Reaction Intensity Estimation and Expression Classification in Videos with Transformers

Add code
Mar 16, 2023
Viaarxiv icon

Super-Resolution Neural Operator

Add code
Mar 05, 2023
Viaarxiv icon