Picture for Linjie Li

Linjie Li

RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents

Add code
Feb 02, 2026
Viaarxiv icon

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

ProImage-Bench: Rubric-Based Evaluation for Professional Image Generation

Add code
Dec 13, 2025
Viaarxiv icon

Computer-Use Agents as Judges for Generative User Interface

Add code
Nov 19, 2025
Viaarxiv icon

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Add code
Oct 30, 2025
Figure 1 for ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Figure 2 for ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Figure 3 for ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Figure 4 for ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Viaarxiv icon

SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models

Add code
Oct 08, 2025
Figure 1 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 2 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 3 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Figure 4 for SHANKS: Simultaneous Hearing and Thinking for Spoken Language Models
Viaarxiv icon

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Add code
Jun 11, 2025
Viaarxiv icon

Synthetic Visual Genome

Add code
Jun 09, 2025
Viaarxiv icon

Audio-Aware Large Language Models as Judges for Speaking Styles

Add code
Jun 06, 2025
Viaarxiv icon

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Add code
Jun 05, 2025
Viaarxiv icon