Picture for Rui Zhao

Rui Zhao

Department of Radiology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China

Just Noticeable Difference Modeling for Deep Visual Features

Add code
Jan 29, 2026
Viaarxiv icon

RoamScene3D: Immersive Text-to-3D Scene Generation via Adaptive Object-aware Roaming

Add code
Jan 27, 2026
Viaarxiv icon

Performance-guided Reinforced Active Learning for Object Detection

Add code
Jan 22, 2026
Viaarxiv icon

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Add code
Dec 18, 2025
Viaarxiv icon

What Happens Next? Next Scene Prediction with a Unified Video Model

Add code
Dec 15, 2025
Figure 1 for What Happens Next? Next Scene Prediction with a Unified Video Model
Figure 2 for What Happens Next? Next Scene Prediction with a Unified Video Model
Figure 3 for What Happens Next? Next Scene Prediction with a Unified Video Model
Figure 4 for What Happens Next? Next Scene Prediction with a Unified Video Model
Viaarxiv icon

VIDEOP2R: Video Understanding from Perception to Reasoning

Add code
Nov 14, 2025
Viaarxiv icon

Hyperbolic Hierarchical Alignment Reasoning Network for Text-3D Retrieval

Add code
Nov 14, 2025
Viaarxiv icon

Human-in-the-loop Online Rejection Sampling for Robotic Manipulation

Add code
Oct 30, 2025
Viaarxiv icon

GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models

Add code
Oct 09, 2025
Figure 1 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 2 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 3 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 4 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Viaarxiv icon

Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model

Add code
Sep 19, 2025
Viaarxiv icon