Picture for Xiangnan He

Xiangnan He

Bridging Perception and Reasoning: Token Reweighting for RLVR in Multimodal LLMs

Add code
Mar 26, 2026
Viaarxiv icon

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Add code
Mar 23, 2026
Viaarxiv icon

Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models

Add code
Mar 23, 2026
Viaarxiv icon

Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation

Add code
Mar 11, 2026
Viaarxiv icon

GuardAlign: Test-time Safety Alignment in Multimodal Large Language Models

Add code
Feb 27, 2026
Viaarxiv icon

Enhancing Multi-Modal LLMs Reasoning via Difficulty-Aware Group Normalization

Add code
Feb 26, 2026
Viaarxiv icon

Fine-grained Semantics Integration for Large Language Model-based Recommendation

Add code
Feb 26, 2026
Viaarxiv icon

Uncertainty-aware Generative Recommendation

Add code
Feb 12, 2026
Viaarxiv icon

Towards Sample-Efficient and Stable Reinforcement Learning for LLM-based Recommendation

Add code
Jan 31, 2026
Viaarxiv icon

UniGRec: Unified Generative Recommendation with Soft Identifiers for End-to-End Optimization

Add code
Jan 24, 2026
Viaarxiv icon