Picture for Hongsheng Li

Hongsheng Li

High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning

Add code
Mar 28, 2025
Viaarxiv icon

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

Add code
Mar 27, 2025
Viaarxiv icon

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Add code
Mar 27, 2025
Viaarxiv icon

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Add code
Mar 27, 2025
Viaarxiv icon

Empowering LLMs in Decision Games through Algorithmic Data Synthesis

Add code
Mar 18, 2025
Viaarxiv icon

Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning

Add code
Mar 14, 2025
Viaarxiv icon

CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

Add code
Mar 13, 2025
Viaarxiv icon

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Add code
Mar 13, 2025
Viaarxiv icon

TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation

Add code
Mar 10, 2025
Viaarxiv icon

DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation

Add code
Mar 10, 2025
Viaarxiv icon