Picture for Ran Yi

Ran Yi

InpaintDPO: Mitigating Spatial Relationship Hallucinations in Foreground-conditioned Inpainting via Diverse Preference Optimization

Add code
Dec 16, 2025
Viaarxiv icon

PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence

Add code
Dec 15, 2025
Viaarxiv icon

Uncovering and Mitigating Transient Blindness in Multimodal Model Editing

Add code
Nov 17, 2025
Viaarxiv icon

IAR2: Improving Autoregressive Visual Generation with Semantic-Detail Associated Token Prediction

Add code
Oct 08, 2025
Viaarxiv icon

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Add code
Sep 26, 2025
Figure 1 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 2 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 3 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 4 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Viaarxiv icon

PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement

Add code
Jun 09, 2025
Viaarxiv icon

3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations

Add code
Apr 21, 2025
Viaarxiv icon

SIGMAN:Scaling 3D Human Gaussian Generation with Millions of Assets

Add code
Apr 09, 2025
Viaarxiv icon

A$^\text{T}$A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting

Add code
Apr 02, 2025
Figure 1 for A$^\text{T}$A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting
Figure 2 for A$^\text{T}$A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting
Figure 3 for A$^\text{T}$A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting
Figure 4 for A$^\text{T}$A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting
Viaarxiv icon

MOS: Modeling Object-Scene Associations in Generalized Category Discovery

Add code
Mar 15, 2025
Viaarxiv icon