Picture for Liang Li

Liang Li

Key Lab of Intell. Info. Process., Inst. of Comput. Tech., Chinese Academy of Sciences

UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

Add code
Jun 15, 2026
Viaarxiv icon

VAIC: Vision-Guided Humanoid Agile Object Interaction Control via Decoupled Commands

Add code
Jun 08, 2026
Viaarxiv icon

MetaPoint: Unlocking Precise Spatial Control in Agentic Visual Generation

Add code
Jun 03, 2026
Viaarxiv icon

GARDEN: Gravity-Aligned Reconstruction of Disentangled ENvironments from RGB images

Add code
Jun 02, 2026
Viaarxiv icon

What to Format and How: A Benchmark and Workflow Approach for Document Formatting

Add code
Jun 01, 2026
Viaarxiv icon

3D Magnetic Field Reconstruction and Mapping with Physics-Informed Neural Networks

Add code
May 25, 2026
Viaarxiv icon

Cluster-Aware Neural Collapse Prompt Tuning for Long-Tailed Generalization of Vision-Language Models

Add code
May 12, 2026
Viaarxiv icon

CoSyncDiT: Cognitive Synchronous Diffusion Transformer for Movie Dubbing

Add code
Apr 14, 2026
Viaarxiv icon

XRZero-G0: Pushing the Frontier of Dexterous Robotic Manipulation with Interfaces, Quality and Ratios

Add code
Apr 14, 2026
Viaarxiv icon

Weather-Conditioned Branch Routing for Robust LiDAR-Radar 3D Object Detection

Add code
Apr 07, 2026
Viaarxiv icon