Picture for Tingting Gao

Tingting Gao

DIVA-GRPO: Enhancing Multimodal Reasoning through Difficulty-Adaptive Variant Advantage

Add code
Mar 01, 2026
Viaarxiv icon

ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL

Add code
Feb 26, 2026
Viaarxiv icon

CREM: Compression-Driven Representation Enhancement for Multimodal Retrieval and Comprehension

Add code
Feb 22, 2026
Viaarxiv icon

UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing

Add code
Feb 15, 2026
Viaarxiv icon

Awakening Dormant Users: Generative Recommendation with Counterfactual Functional Role Reasoning

Add code
Feb 13, 2026
Viaarxiv icon

Spatial Chain-of-Thought: Bridging Understanding and Generation Models for Spatial Reasoning Generation

Add code
Feb 12, 2026
Viaarxiv icon

QARM V2: Quantitative Alignment Multi-Modal Recommendation for Reasoning User Sequence Modeling

Add code
Feb 09, 2026
Viaarxiv icon

VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos

Add code
Feb 08, 2026
Viaarxiv icon

Joint Reward Modeling: Internalizing Chain-of-Thought for Efficient Visual Reward Models

Add code
Feb 07, 2026
Viaarxiv icon

SpatialReward: Bridging the Perception Gap in Online RL for Image Editing via Explicit Spatial Reasoning

Add code
Feb 07, 2026
Viaarxiv icon