Picture for Ran Yi

Ran Yi

UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy

Add code
Mar 25, 2026
Viaarxiv icon

StdGEN++: A Comprehensive System for Semantic-Decomposed 3D Character Generation

Add code
Jan 12, 2026
Viaarxiv icon

HeadLighter: Disentangling Illumination in Generative 3D Gaussian Heads via Lightstage Captures

Add code
Jan 05, 2026
Viaarxiv icon

InpaintDPO: Mitigating Spatial Relationship Hallucinations in Foreground-conditioned Inpainting via Diverse Preference Optimization

Add code
Dec 16, 2025
Figure 1 for InpaintDPO: Mitigating Spatial Relationship Hallucinations in Foreground-conditioned Inpainting via Diverse Preference Optimization
Figure 2 for InpaintDPO: Mitigating Spatial Relationship Hallucinations in Foreground-conditioned Inpainting via Diverse Preference Optimization
Figure 3 for InpaintDPO: Mitigating Spatial Relationship Hallucinations in Foreground-conditioned Inpainting via Diverse Preference Optimization
Figure 4 for InpaintDPO: Mitigating Spatial Relationship Hallucinations in Foreground-conditioned Inpainting via Diverse Preference Optimization
Viaarxiv icon

PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence

Add code
Dec 15, 2025
Viaarxiv icon

Uncovering and Mitigating Transient Blindness in Multimodal Model Editing

Add code
Nov 17, 2025
Viaarxiv icon

IAR2: Improving Autoregressive Visual Generation with Semantic-Detail Associated Token Prediction

Add code
Oct 08, 2025
Viaarxiv icon

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Add code
Sep 26, 2025
Figure 1 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 2 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 3 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Figure 4 for MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
Viaarxiv icon

PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement

Add code
Jun 09, 2025
Viaarxiv icon

3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations

Add code
Apr 21, 2025
Viaarxiv icon