Picture for Rui Zhao

Rui Zhao

Department of Radiology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China

GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models

Add code
Oct 09, 2025
Figure 1 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 2 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 3 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 4 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Viaarxiv icon

Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model

Add code
Sep 19, 2025
Viaarxiv icon

Fishing for Answers: Exploring One-shot vs. Iterative Retrieval Strategies for Retrieval Augmented Generation

Add code
Sep 05, 2025
Viaarxiv icon

Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges

Add code
Jun 12, 2025
Viaarxiv icon

Attention-based Learning for 3D Informative Path Planning

Add code
Jun 10, 2025
Viaarxiv icon

PHRASED: Phrase Dictionary Biasing for Speech Translation

Add code
Jun 10, 2025
Viaarxiv icon

SORCE: Small Object Retrieval in Complex Environments

Add code
May 30, 2025
Viaarxiv icon

SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams

Add code
May 26, 2025
Viaarxiv icon

DiffE2E: Rethinking End-to-End Driving with a Hybrid Action Diffusion and Supervised Policy

Add code
May 26, 2025
Viaarxiv icon

Unlocking the Power of SAM 2 for Few-Shot Segmentation

Add code
May 21, 2025
Figure 1 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 2 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 3 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Figure 4 for Unlocking the Power of SAM 2 for Few-Shot Segmentation
Viaarxiv icon