Picture for Xiaohui Li

Xiaohui Li

Thinking Without Images: Internalizing Visual Manipulation with On-Policy Self-Distillation

Add code
Jun 07, 2026
Viaarxiv icon

What Makes Interaction Trajectories Effective for Training Terminal Agents?

Add code
Jun 02, 2026
Viaarxiv icon

Robust Subspace-Constrained Quadratic Models for Low-Dimensional Structure Learning

Add code
May 19, 2026
Viaarxiv icon

BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion

Add code
May 12, 2026
Viaarxiv icon

Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning

Add code
May 10, 2026
Viaarxiv icon

StableI2I: Spotting Unintended Changes in Image-to-Image Transition

Add code
May 06, 2026
Viaarxiv icon

Deeper Thought, Weaker Aim: Understanding and Mitigating Perceptual Impairment during Reasoning in Multimodal Large Language Models

Add code
Mar 15, 2026
Viaarxiv icon

Accelerating Masked Image Generation by Learning Latent Controlled Dynamics

Add code
Feb 27, 2026
Viaarxiv icon

UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model

Add code
Feb 15, 2026
Viaarxiv icon

Toward Generalizable Deblurring: Leveraging Massive Blur Priors with Linear Attention for Real-World Scenarios

Add code
Jan 10, 2026
Viaarxiv icon