Picture for Xiu Li

Xiu Li

Cross-Domain Offline Policy Adaptation via Selective Transition Correction

Add code
Feb 05, 2026
Viaarxiv icon

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Add code
Feb 02, 2026
Viaarxiv icon

Visual Language Hypothesis

Add code
Dec 31, 2025
Viaarxiv icon

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

Add code
Dec 30, 2025
Viaarxiv icon

DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO

Add code
Dec 25, 2025
Figure 1 for DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO
Figure 2 for DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO
Figure 3 for DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO
Figure 4 for DiverseGRPO: Mitigating Mode Collapse in Image Generation via Diversity-Aware GRPO
Viaarxiv icon

MVInverse: Feed-forward Multi-view Inverse Rendering in Seconds

Add code
Dec 24, 2025
Viaarxiv icon

FAIR: Focused Attention Is All You Need for Generative Recommendation

Add code
Dec 17, 2025
Viaarxiv icon

PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning

Add code
Nov 14, 2025
Figure 1 for PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Figure 2 for PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Figure 3 for PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Figure 4 for PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Viaarxiv icon

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Add code
Sep 30, 2025
Viaarxiv icon

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Figure 1 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 2 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 3 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 4 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Viaarxiv icon