Picture for Sirui Han

Sirui Han

Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification

Add code
Jan 30, 2026
Viaarxiv icon

Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning

Add code
Jan 20, 2026
Viaarxiv icon

LRAS: Advanced Legal Reasoning with Agentic Search

Add code
Jan 12, 2026
Viaarxiv icon

AM$^3$Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs

Add code
Jan 08, 2026
Viaarxiv icon

Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test

Add code
Jan 07, 2026
Viaarxiv icon

MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data

Add code
Dec 15, 2025
Viaarxiv icon

WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation

Add code
Oct 08, 2025
Viaarxiv icon

Can World Models Benefit VLMs for World Dynamics?

Add code
Oct 01, 2025
Viaarxiv icon

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon