Picture for Xiangbo Shu

Xiangbo Shu

EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond

Add code
Nov 27, 2024
Viaarxiv icon

FTMoMamba: Motion Generation with Frequency and Text State Space Models

Add code
Nov 26, 2024
Viaarxiv icon

UnitedVLN: Generalizable Gaussian Splatting for Continuous Vision-Language Navigation

Add code
Nov 25, 2024
Viaarxiv icon

FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data

Add code
Nov 22, 2024
Viaarxiv icon

HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes

Add code
Sep 30, 2024
Figure 1 for HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes
Figure 2 for HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes
Figure 3 for HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes
Figure 4 for HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes
Viaarxiv icon

GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling

Add code
Sep 30, 2024
Figure 1 for GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling
Figure 2 for GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling
Figure 3 for GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling
Figure 4 for GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling
Viaarxiv icon

Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization

Add code
Sep 12, 2024
Figure 1 for Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Figure 2 for Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Figure 3 for Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Figure 4 for Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Viaarxiv icon

The SkatingVerse Workshop & Challenge: Methods and Results

Add code
May 27, 2024
Viaarxiv icon

AdaFPP: Adapt-Focused Bi-Propagating Prototype Learning for Panoramic Activity Recognition

Add code
May 04, 2024
Viaarxiv icon

MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition

Add code
May 03, 2024
Viaarxiv icon