Picture for Shutao Li

Shutao Li

Fellow, IEEE

SARCLIP: A Vision Language Foundation Model for Semantic Understanding and Target Recognition in SAR Imagery

Add code
Oct 26, 2025
Viaarxiv icon

Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection

Add code
Oct 09, 2025
Figure 1 for Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
Figure 2 for Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
Figure 3 for Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
Figure 4 for Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection
Viaarxiv icon

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Figure 1 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 2 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 3 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 4 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Viaarxiv icon

Multimodal Prompt Alignment for Facial Expression Recognition

Add code
Jun 26, 2025
Viaarxiv icon

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

Add code
May 22, 2025
Viaarxiv icon

Panoramic Out-of-Distribution Segmentation

Add code
May 06, 2025
Viaarxiv icon

Learning from Noisy Pseudo-labels for All-Weather Land Cover Mapping

Add code
Apr 18, 2025
Viaarxiv icon

DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection

Add code
Oct 23, 2024
Viaarxiv icon

Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble

Add code
Jun 18, 2024
Figure 1 for Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble
Figure 2 for Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble
Figure 3 for Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble
Figure 4 for Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble
Viaarxiv icon

DrVideo: Document Retrieval Based Long Video Understanding

Add code
Jun 18, 2024
Figure 1 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 2 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 3 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 4 for DrVideo: Document Retrieval Based Long Video Understanding
Viaarxiv icon