Picture for Jingyi Zhang

Jingyi Zhang

AI Guide Dog: Egocentric Path Prediction on Smartphone

Add code
Jan 14, 2025
Viaarxiv icon

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Add code
Dec 24, 2024
Viaarxiv icon

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Add code
Dec 11, 2024
Viaarxiv icon

Open-Vocabulary Object Detection via Language Hierarchy

Add code
Oct 27, 2024
Figure 1 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 2 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 3 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 4 for Open-Vocabulary Object Detection via Language Hierarchy
Viaarxiv icon

Historical Test-time Prompt Tuning for Vision Foundation Models

Add code
Oct 27, 2024
Figure 1 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 2 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 3 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 4 for Historical Test-time Prompt Tuning for Vision Foundation Models
Viaarxiv icon

A Survey on Evaluation of Multimodal Large Language Models

Add code
Aug 28, 2024
Viaarxiv icon

NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

Add code
Apr 22, 2024
Figure 1 for NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results
Figure 2 for NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results
Figure 3 for NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results
Figure 4 for NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results
Viaarxiv icon

The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

Add code
Apr 16, 2024
Figure 1 for The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Figure 2 for The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Figure 3 for The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Figure 4 for The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
Viaarxiv icon

Learning Expressive And Generalizable Motion Features For Face Forgery Detection

Add code
Mar 08, 2024
Viaarxiv icon

Diving Deep into Regions: Exploiting Regional Information Transformer for Single Image Deraining

Add code
Feb 25, 2024
Viaarxiv icon