Picture for Chen Li

Chen Li

Beihang University

OmniHuman: A Large-scale Dataset and Benchmark for Human-Centric Video Generation

Add code
Apr 20, 2026
Viaarxiv icon

Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing

Add code
Apr 12, 2026
Viaarxiv icon

From Understanding to Erasing: Towards Complete and Stable Video Object Removal

Add code
Apr 02, 2026
Viaarxiv icon

AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

Add code
Mar 19, 2026
Viaarxiv icon

Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress

Add code
Mar 18, 2026
Viaarxiv icon

Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation

Add code
Mar 18, 2026
Viaarxiv icon

NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing

Add code
Mar 03, 2026
Viaarxiv icon

CARE: A Molecular-Guided Foundation Model with Adaptive Region Modeling for Whole Slide Image Analysis

Add code
Feb 25, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

FAIL: Flow Matching Adversarial Imitation Learning for Image Generation

Add code
Feb 12, 2026
Viaarxiv icon