Picture for Shutao Li

Shutao Li

Fellow, IEEE

2nd of the 5th PVUW MeViS-Audio Track: ASR-SaSaSa2VA

Add code
Apr 27, 2026
Viaarxiv icon

ASSR-Net: Anisotropic Structure-Aware and Spectrally Recalibrated Network for Hyperspectral Image Fusion

Add code
Apr 07, 2026
Viaarxiv icon

SARCLIP: A Vision Language Foundation Model for Semantic Understanding and Target Recognition in SAR Imagery

Add code
Oct 26, 2025
Viaarxiv icon

Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection

Add code
Oct 09, 2025
Figure 1 for Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
Figure 2 for Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
Figure 3 for Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
Figure 4 for Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
Viaarxiv icon

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Figure 1 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 2 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 3 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 4 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Viaarxiv icon

Multimodal Prompt Alignment for Facial Expression Recognition

Add code
Jun 26, 2025
Viaarxiv icon

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Add code
May 22, 2025
Viaarxiv icon

Panoramic Out-of-Distribution Segmentation

Add code
May 06, 2025
Viaarxiv icon

Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping

Add code
Apr 18, 2025
Figure 1 for Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping
Figure 2 for Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping
Figure 3 for Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping
Viaarxiv icon

DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection

Add code
Oct 23, 2024
Figure 1 for DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection
Figure 2 for DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection
Figure 3 for DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection
Figure 4 for DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection
Viaarxiv icon