Picture for Hongsheng Li

Hongsheng Li

ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework

Add code
Mar 21, 2026
Viaarxiv icon

AR-CoPO: Align Autoregressive Video Generation with Contrastive Policy Optimization

Add code
Mar 18, 2026
Viaarxiv icon

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Add code
Mar 10, 2026
Viaarxiv icon

PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents

Add code
Mar 09, 2026
Viaarxiv icon

MADCrowner: Margin Aware Dental Crown Design with Template Deformation and Refinement

Add code
Mar 05, 2026
Viaarxiv icon

From Solver to Tutor: Evaluating the Pedagogical Intelligence of LLMs with KMP-Bench

Add code
Mar 03, 2026
Viaarxiv icon

From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors

Add code
Feb 25, 2026
Viaarxiv icon

GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generatio

Add code
Feb 24, 2026
Viaarxiv icon

MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning

Add code
Feb 11, 2026
Viaarxiv icon

UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents

Add code
Feb 05, 2026
Viaarxiv icon