Picture for Zhaoxiang Zhang

Zhaoxiang Zhang

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Add code
Sep 11, 2025
Viaarxiv icon

HLG: Comprehensive 3D Room Construction via Hierarchical Layout Generation

Add code
Aug 25, 2025
Viaarxiv icon

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Add code
Jul 10, 2025
Viaarxiv icon

A Survey on Latent Reasoning

Add code
Jul 08, 2025
Viaarxiv icon

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Add code
Jul 08, 2025
Viaarxiv icon

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Add code
Jul 03, 2025
Viaarxiv icon

Unified Vision-Language-Action Model

Add code
Jun 24, 2025
Viaarxiv icon

TC-Light: Temporally Consistent Relighting for Dynamic Long Videos

Add code
Jun 23, 2025
Viaarxiv icon

MLLM-CL: Continual Learning for Multimodal Large Language Models

Add code
Jun 05, 2025
Viaarxiv icon

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

Add code
May 21, 2025
Viaarxiv icon