Picture for Hongze Shen

Hongze Shen

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Add code
Jan 27, 2026
Viaarxiv icon

HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding

Add code
Oct 09, 2024
Viaarxiv icon

How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction

Add code
Nov 14, 2018
Figure 1 for How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction
Figure 2 for How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction
Figure 3 for How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction
Figure 4 for How Drones Look: Crowdsourced Knowledge Transfer for Aerial Video Saliency Prediction
Viaarxiv icon